Assessment

Critical Thinking Test Vol.2

1 sections16 questions

Quantify critical thinking across four sub-skills — recognizing assumptions, evaluating inferences, spotting fallacies, and judging reasons — in 16 questions built on everyday arguments, evidence, and hidden assumptions. Correctness is graded into an overall score and level, with a per-sub-skill breakdown, per-question results and explanations, strengths and growth areas, and study steps.

[Vol.2] A new 16-question set. A critical-thinking test about real-world arguments, measuring four sub-skills — recognizing assumptions, evaluating inferences, spotting fallacies, and judging reasons — across 16 questions. It grades your answers into an overall score and level, and shows a sub-skill profile, per-question results with explanations, strengths and growth areas, and a study plan. About 5–8 minutes.

Start answering

What this assessment measures

Test result

Your level inferred from your score on 16 questions

Proficient

PROFICIENT

Stable critical thinking that weighs evidence and assumptions to evaluate arguments.

Developing

DEVELOPING

The foundation is there; sharpening how you read evidence and assumptions will lift you.

Emerging

EMERGING

Begin by getting used to the anatomy of an argument; separating claim from reason will steady you.

Example result report

PROFICIENT

Proficient

You weigh evidence and assumptions and evaluate the strength of arguments calmly — stable and reliable.

Pass

Overall score

100/ 100

Correct

16/ 16

Top

95%ile

Accuracy

100%

Overall score and pass line

100

Ability profile

Your score across ability domains (out of 100)

Domain-by-domain analysis

Spotting the unstated assumption an argument relies on. The key is to look for the condition that, if it failed, would break the conclusion.

Judging how firmly a conclusion follows from the given evidence. The key is to avoid overreach and not to mistake correlation for causation.

Spotting the flaw in everyday arguments. Knowing patterns like ad hominem, false dilemma, and circular reasoning keeps it steady.

Judging whether a reason is relevant and strong for a claim, or weak and irrelevant. The key is to weight systematic evidence over anecdote or popularity.

Your strengths

You separate the substance of a claim from the strength of the reasons behind it.

You distinguish correlation from causation and judge what the evidence supports.

You have an eye for hidden assumptions and fallacies lurking in an argument.

Next challenges

A perfect score — well done. Next, try items that weigh competing, more tangled reasons.

Make a habit of spotting one hidden assumption or fallacy in the news and ads you read.

Turn the same critical eye on your own views and examine the reasons from the opposing side.

Detailed analysis

This test measures critical thinking across several sub-skills. You reached the proficient level, with a steady grip on spotting assumptions, judging what evidence supports, and evaluating arguments. Your sub-skill strengths and weak spots are shown directly in the profile and the per-question results above. For any item you missed, retrace 'why this option is best supported' in its explanation, and a perfect score is within reach. Keep resisting persuasive phrasing and appeals to popularity or authority.

Question review

○

“Let's take an umbrella, because the forecast said it would rain.” What must the speaker be taking for granted for this to make sense?

Correct: B) The weather forecast is generally reliable.

To go from 'the forecast said rain' to 'take an umbrella', you must assume the forecast is at least somewhat reliable (B); if it were never right, it would give no reason to bring one. A doesn't claim it is already raining, C isn't the point (whether an umbrella is best), and D isn't stated — none is the assumption the argument depends on.

○

“This morning I saw five cars parked in the lot, and all five were white.” From this alone, what can be said most defensibly?

Correct: D) At least the five cars I saw this morning were white.

The only thing established is that the five cars seen this morning were white (D). Concluding a parking rule (A), local popularity (B), or the rarity of other colors (C) from just five cars is an unsupported leap from a tiny sample.

○

“That commentator talks about environmental policy, yet he himself drives a big car. So his argument isn't worth hearing.” What is the flaw?

Correct: A) It dismisses the claim because of the speaker's conduct rather than its substance.

Whether a claim is right should be judged on its content, but this rejects it because of what the speaker does ('drives a big car'). That's an ad hominem (a tu-quoque / hypocrisy attack) (A). His behavior and the truth of his claim are separate matters. There's no false dilemma (B), hasty generalization (C), or circular reasoning (D).

○

Which is the strongest (most relevant) reason to support the claim “this used car has been well maintained”?

Correct: C) It comes with a service logbook recording all scheduled inspections and part replacements.

What most directly supports 'well maintained' is a service logbook documenting inspections and replacements (C). Liking the color (A), a celebrity former owner (B), or a pleasant seller (D) are irrelevant or weak as evidence of maintenance.

○

“After we switched the office lighting to LED, employees' overtime hours fell. So the other branches should switch to LED too.” What is an unstated assumption this argument depends on?

Correct: C) The drop in overtime was mainly due to the lighting change.

The conclusion 'other branches should switch too' depends on the overtime drop being caused by the lighting change rather than by, say, a change in workload (C). If that fails, there's no basis to expect the same effect elsewhere. A, B, and D can be false while the argument still stands, so they aren't the assumption it depends on.

○

“A regional survey found that households with a pet dog tend to have residents who are more active in the neighborhood.” What is the most careful, defensible inference?

Correct: B) There's an association between owning a dog and neighborhood activity, but this alone can't establish cause and effect.

Correlation doesn't imply causation — the reverse (already-outgoing people are more likely to get a dog) or a common factor could explain it, so this survey alone can't establish cause (B). A assumes causation, D overstates a tendency into 'always', and C reverses the finding — all overreach.

○

“This investment method is a sure moneymaker — a famous investor once commented that it's a good method.” Which best describes the flaw?

Correct: D) It treats one offhand remark by a celebrity as proof of certainty.

Even from a famous investor, a one-line comment doesn't guarantee a 'sure moneymaker'. With no supporting data or conditions, the argument leans on authority (a famous name). That's an inappropriate appeal to authority (D). No false dilemma (A), topic shift (B), or appeal to popularity (C) is used.

○

Which is the most effective objection to the claim “this company should relocate its office to the city center”?

Correct: A) Data shows most employees chose the current location for its short commute, and a city-center move would greatly lengthen many employees' commutes.

The strongest objection uses data to show a concrete, serious harm — the move would greatly worsen many employees' commutes (A) — directly undercutting the case for relocating. B merely notes mixed opinions, C is an ad hominem against the proposer, and D is a vague prediction — all weak as objections.

○

“This password can never be cracked, because it has been targeted by many break-in attempts and none has ever succeeded.” What must be assumed for this to hold?

Correct: B) The attacks tried so far are representative of every attack that could ever be attempted.

To move from 'never cracked so far' to 'can never be cracked', you must assume past attempts represent every possible future method (no new technique will ever appear) (B). If they don't, an unknown method could still succeed. A, C, and D could be false without changing the logic, so they aren't the required assumption.

○

Q10

“This summer, in this town both air-conditioner sales and heatstroke hospitalizations rose sharply over last year.” What can be concluded?

Correct: C) Nothing beyond the fact that both AC sales and heatstroke cases rose can be concluded from this alone.

Two figures both rising doesn't reveal the reason (a common factor like an unusually hot summer is likely, among others), so the careful conclusion is just the fact itself (C). A and B mistake correlation for causation (in fact extreme heat likely drove both), and D is a baseless guess — all are leaps.

○

Q11

“We must not revise the current system even slightly. Once we touch it, exception after exception will be allowed, and eventually the whole system will collapse.” Which best describes the flaw?

Correct: A) It assumes, without warrant, that a small step must lead to an extreme outcome.

'A small revision → endless exceptions → total collapse' asserts that the first small step must inevitably lead to an extreme end, without showing the intervening causal links. That's a slippery slope (A). It isn't an ad hominem (B), circular reasoning (C), or appeal to popularity (D).

○

Q12

Which is the strongest reason supporting the claim “using this study material raises your English score”?

Correct: B) In a controlled trial that randomly split many learners into two groups and gave the material to only one, that group scored significantly higher.

The strongest reason is systematic evidence — a randomized, controlled comparison (B), which isolates the material's effect from other factors. A is a single anecdote, C is an appeal to popularity, and D is an irrelevant celebrity (misplaced authority); all are weak support.

○

Q13

“In our customer-satisfaction survey, stores with faster service had higher satisfaction. So if we add staff at every store to speed up service, satisfaction should rise.” Which assumption is most essential for this conclusion?

Correct: C) The higher satisfaction was due to fast service itself, not to other features of those stores (like location or product range).

To claim adding staff will raise satisfaction, you must assume the satisfaction came from fast service itself and not from other store-specific traits (location, product range) (C). If it came from those, speeding up service won't help. A only complicates the causal reading, and B and D aren't needed for satisfaction specifically to rise.

○

Q14

A school tried a new teaching method in two classes. After it was introduced, both classes' average scores rose. But three other classes that did NOT use the new method saw their averages rise by about the same amount over the same period. What is the most defensible inference about the new method?

Correct: B) Since the other classes rose by about the same amount, the increase may be due to a common factor other than the method, and the method's effect can't be confirmed from this data.

If the classes that didn't use the method rose by about the same amount, the increase likely stems from a schoolwide common factor (easier test, cohort-wide progress), so the method's specific effect isn't shown here (B). Claiming it's 'proven' (A) ignores that the comparison group undercuts it; 'no effect whatsoever' (C) overstates the other way; and D dismisses the inconvenient data.

○

Q15

“These rules are fair, because they were made through a fair procedure. And why is that procedure fair? Because it produced these fair rules.” What is the main flaw?

Correct: D) It runs in a circle, using the conclusion as its own support (circular reasoning).

The rules are fair because the procedure is fair, and the procedure is fair because it produced these fair rules — the argument uses the very conclusion it's trying to prove as its support, forming a loop. That's circular reasoning / begging the question (D). It isn't hasty generalization (A), a slippery slope (B), or a red herring (C).

○

Q16

Consider the claim “the school should stop opening its playground after hours,” weighing reasons for and against. Which is the weightiest (potentially decisive) objection?

Correct: B) Data shows there's no other safe place for children to play in this area, so ending the opening would force many children to play on busy roads, raising the risk of accidents.

The weightiest objection shows that stopping would cause a serious, concrete harm — children's safety, forced onto busy roads — with no alternative (B), a strong consideration that can be weighed directly against any benefit of stopping. A is a weak minority preference, C is an ad hominem against the proposer, and D is an appeal to tradition — none is decisive.

What to do next

For each item you missed, retrace 'why this option is best supported' in its explanation.

Drill distinguishing fallacy types — ad hominem, false dilemma, circular reasoning — to stop slips.

Practice spotting hidden assumptions and gauging reason-strength in daily articles and ads.

This test is reference information about critical-thinking tendencies, not a formal qualification or a guarantee of ability.

Who it's for

Anyone who wants to sharpen how they weigh evidence and evaluate arguments, or read information more critically at work or in study.

What the result looks like

Shows an overall score and level, a four-sub-skill profile, per-question results with explanations, strengths and growth areas, and next study steps.

This assessment has 1 sections and 16 questions.

Once you start, you cannot change the language. Switch beforehand if needed.