Stable critical thinking that weighs evidence and assumptions to evaluate arguments.
Critical Thinking Test Vol.2
Quantify critical thinking across four sub-skills — recognizing assumptions, evaluating inferences, spotting fallacies, and judging reasons — in 16 questions built on everyday arguments, evidence, and hidden assumptions. Correctness is graded into an overall score and level, with a per-sub-skill breakdown, per-question results and explanations, strengths and growth areas, and study steps.
[Vol.2] A new 16-question set. A critical-thinking test about real-world arguments, measuring four sub-skills — recognizing assumptions, evaluating inferences, spotting fallacies, and judging reasons — across 16 questions. It grades your answers into an overall score and level, and shows a sub-skill profile, per-question results with explanations, strengths and growth areas, and a study plan. About 5–8 minutes.
What this assessment measures
Test result
Your level inferred from your score on 16 questions
The foundation is there; sharpening how you read evidence and assumptions will lift you.
Begin by getting used to the anatomy of an argument; separating claim from reason will steady you.
Example result report
Proficient
You weigh evidence and assumptions and evaluate the strength of arguments calmly — stable and reliable.
PassOverall score and pass line
Ability profile
Your score across ability domains (out of 100)
Domain-by-domain analysis
Spotting the unstated assumption an argument relies on. The key is to look for the condition that, if it failed, would break the conclusion.
Judging how firmly a conclusion follows from the given evidence. The key is to avoid overreach and not to mistake correlation for causation.
Spotting the flaw in everyday arguments. Knowing patterns like ad hominem, false dilemma, and circular reasoning keeps it steady.
Judging whether a reason is relevant and strong for a claim, or weak and irrelevant. The key is to weight systematic evidence over anecdote or popularity.
Your strengths
You separate the substance of a claim from the strength of the reasons behind it.
You distinguish correlation from causation and judge what the evidence supports.
You have an eye for hidden assumptions and fallacies lurking in an argument.
Next challenges
A perfect score — well done. Next, try items that weigh competing, more tangled reasons.
Make a habit of spotting one hidden assumption or fallacy in the news and ads you read.
Turn the same critical eye on your own views and examine the reasons from the opposing side.
Detailed analysis
This test measures critical thinking across several sub-skills. You reached the proficient level, with a steady grip on spotting assumptions, judging what evidence supports, and evaluating arguments. Your sub-skill strengths and weak spots are shown directly in the profile and the per-question results above. For any item you missed, retrace 'why this option is best supported' in its explanation, and a perfect score is within reach. Keep resisting persuasive phrasing and appeals to popularity or authority.
Question review
Q1
“Let's take an umbrella, because the forecast said it would rain.” What must the speaker be taking for granted for this to make sense?
Correct: B) The weather forecast is generally reliable.
To go from 'the forecast said rain' to 'take an umbrella', you must assume the forecast is at least somewhat reliable (B); if it were never right, it would give no reason to bring one. A doesn't claim it is already raining, C isn't the point (whether an umbrella is best), and D isn't stated — none is the assumption the argument depends on.
Q2
“This morning I saw five cars parked in the lot, and all five were white.” From this alone, what can be said most defensibly?
Correct: D) At least the five cars I saw this morning were white.
The only thing established is that the five cars seen this morning were white (D). Concluding a parking rule (A), local popularity (B), or the rarity of other colors (C) from just five cars is an unsupported leap from a tiny sample.
Q3
“That commentator talks about environmental policy, yet he himself drives a big car. So his argument isn't worth hearing.” What is the flaw?
Correct: A) It dismisses the claim because of the speaker's conduct rather than its substance.
Whether a claim is right should be judged on its content, but this rejects it because of what the speaker does ('drives a big car'). That's an ad hominem (a tu-quoque / hypocrisy attack) (A). His behavior and the truth of his claim are separate matters. There's no false dilemma (B), hasty generalization (C), or circular reasoning (D).
Q4
Which is the strongest (most relevant) reason to support the claim “this used car has been well maintained”?
Correct: C) It comes with a service logbook recording all scheduled inspections and part replacements.
What most directly supports 'well maintained' is a service logbook documenting inspections and replacements (C). Liking the color (A), a celebrity former owner (B), or a pleasant seller (D) are irrelevant or weak as evidence of maintenance.
Q5
“After we switched the office lighting to LED, employees' overtime hours fell. So the other branches should switch to LED too.” What is an unstated assumption this argument depends on?
Correct: C) The drop in overtime was mainly due to the lighting change.
The conclusion 'other branches should switch too' depends on the overtime drop being caused by the lighting change rather than by, say, a change in workload (C). If that fails, there's no basis to expect the same effect elsewhere. A, B, and D can be false while the argument still stands, so they aren't the assumption it depends on.
Q6
“A regional survey found that households with a pet dog tend to have residents who are more active in the neighborhood.” What is the most careful, defensible inference?
Correct: B) There's an association between owning a dog and neighborhood activity, but this alone can't establish cause and effect.
Correlation doesn't imply causation — the reverse (already-outgoing people are more likely to get a dog) or a common factor could explain it, so this survey alone can't establish cause (B). A assumes causation, D overstates a tendency into 'always', and C reverses the finding — all overreach.
Q7
“This investment method is a sure moneymaker — a famous investor once commented that it's a good method.” Which best describes the flaw?
Correct: D) It treats one offhand remark by a celebrity as proof of certainty.
Even from a famous investor, a one-line comment doesn't guarantee a 'sure moneymaker'. With no supporting data or conditions, the argument leans on authority (a famous name). That's an inappropriate appeal to authority (D). No false dilemma (A), topic shift (B), or appeal to popularity (C) is used.
Q8
Which is the most effective objection to the claim “this company should relocate its office to the city center”?
Correct: A) Data shows most employees chose the current location for its short commute, and a city-center move would greatly lengthen many employees' commutes.
The strongest objection uses data to show a concrete, serious harm — the move would greatly worsen many employees' commutes (A) — directly undercutting the case for relocating. B merely notes mixed opinions, C is an ad hominem against the proposer, and D is a vague prediction — all weak as objections.
Q9
“This password can never be cracked, because it has been targeted by many break-in attempts and none has ever succeeded.” What must be assumed for this to hold?
Correct: B) The attacks tried so far are representative of every attack that could ever be attempted.
To move from 'never cracked so far' to 'can never be cracked', you must assume past attempts represent every possible future method (no new technique will ever appear) (B). If they don't, an unknown method could still succeed. A, C, and D could be false without changing the logic, so they aren't the required assumption.
Q10
“This summer, in this town both air-conditioner sales and heatstroke hospitalizations rose sharply over last year.” What can be concluded?
Correct: C) Nothing beyond the fact that both AC sales and heatstroke cases rose can be concluded from this alone.
Two figures both rising doesn't reveal the reason (a common factor like an unusually hot summer is likely, among others), so the careful conclusion is just the fact itself (C). A and B mistake correlation for causation (in fact extreme heat likely drove both), and D is a baseless guess — all are leaps.
Q11
“We must not revise the current system even slightly. Once we touch it, exception after exception will be allowed, and eventually the whole system will collapse.” Which best describes the flaw?
Correct: A) It assumes, without warrant, that a small step must lead to an extreme outcome.
'A small revision → endless exceptions → total collapse' asserts that the first small step must inevitably lead to an extreme end, without showing the intervening causal links. That's a slippery slope (A). It isn't an ad hominem (B), circular reasoning (C), or appeal to popularity (D).
Q12
Which is the strongest reason supporting the claim “using this study material raises your English score”?
Correct: B) In a controlled trial that randomly split many learners into two groups and gave the material to only one, that group scored significantly higher.
The strongest reason is systematic evidence — a randomized, controlled comparison (B), which isolates the material's effect from other factors. A is a single anecdote, C is an appeal to popularity, and D is an irrelevant celebrity (misplaced authority); all are weak support.
Q13
“In our customer-satisfaction survey, stores with faster service had higher satisfaction. So if we add staff at every store to speed up service, satisfaction should rise.” Which assumption is most essential for this conclusion?
Correct: C) The higher satisfaction was due to fast service itself, not to other features of those stores (like location or product range).
To claim adding staff will raise satisfaction, you must assume the satisfaction came from fast service itself and not from other store-specific traits (location, product range) (C). If it came from those, speeding up service won't help. A only complicates the causal reading, and B and D aren't needed for satisfaction specifically to rise.
Q14
A school tried a new teaching method in two classes. After it was introduced, both classes' average scores rose. But three other classes that did NOT use the new method saw their averages rise by about the same amount over the same period. What is the most defensible inference about the new method?
Correct: B) Since the other classes rose by about the same amount, the increase may be due to a common factor other than the method, and the method's effect can't be confirmed from this data.
If the classes that didn't use the method rose by about the same amount, the increase likely stems from a schoolwide common factor (easier test, cohort-wide progress), so the method's specific effect isn't shown here (B). Claiming it's 'proven' (A) ignores that the comparison group undercuts it; 'no effect whatsoever' (C) overstates the other way; and D dismisses the inconvenient data.
Q15
“These rules are fair, because they were made through a fair procedure. And why is that procedure fair? Because it produced these fair rules.” What is the main flaw?
Correct: D) It runs in a circle, using the conclusion as its own support (circular reasoning).
The rules are fair because the procedure is fair, and the procedure is fair because it produced these fair rules — the argument uses the very conclusion it's trying to prove as its support, forming a loop. That's circular reasoning / begging the question (D). It isn't hasty generalization (A), a slippery slope (B), or a red herring (C).
Q16
Consider the claim “the school should stop opening its playground after hours,” weighing reasons for and against. Which is the weightiest (potentially decisive) objection?
Correct: B) Data shows there's no other safe place for children to play in this area, so ending the opening would force many children to play on busy roads, raising the risk of accidents.
The weightiest objection shows that stopping would cause a serious, concrete harm — children's safety, forced onto busy roads — with no alternative (B), a strong consideration that can be weighed directly against any benefit of stopping. A is a weak minority preference, C is an ad hominem against the proposer, and D is an appeal to tradition — none is decisive.
What to do next
For each item you missed, retrace 'why this option is best supported' in its explanation.
Drill distinguishing fallacy types — ad hominem, false dilemma, circular reasoning — to stop slips.
Practice spotting hidden assumptions and gauging reason-strength in daily articles and ads.
This test is reference information about critical-thinking tendencies, not a formal qualification or a guarantee of ability.
Developing
The foundation is there. Sharpen how you read evidence and assumptions and the next score band comes into view.
Almost thereOverall score and pass line
Ability profile
Your score across ability domains (out of 100)
Domain-by-domain analysis
Spotting the unstated assumption an argument relies on. The key is to look for the condition that, if it failed, would break the conclusion.
Judging how firmly a conclusion follows from the given evidence. The key is to avoid overreach and not to mistake correlation for causation.
Spotting the flaw in everyday arguments. Knowing patterns like ad hominem, false dilemma, and circular reasoning keeps it steady.
Judging whether a reason is relevant and strong for a claim, or weak and irrelevant. The key is to weight systematic evidence over anecdote or popularity.
Your strengths
You read the prompts to the end and compare the options.
You notice obvious ad hominems and clearly irrelevant reasons.
On items you grasp, you trace the reasons to a sound conclusion.
Growth areas
Distinguishing correlation from causation and gauging what evidence supports can trip you up.
You sometimes miss an unstated assumption and accept a conclusion too quickly.
Persuasive wording or majority opinion can pull you along.
Detailed analysis
This test measures critical thinking across several sub-skills. You are at the developing level, with the foundation in place. The profile and the per-question results above show which sub-skills have the most headroom. For the items you missed, write out the claim, the reasons, and the hidden assumption, check one by one how much each reason supports the conclusion, then reread the explanation — the same type of error will fade.
Question review
Q1
“Let's take an umbrella, because the forecast said it would rain.” What must the speaker be taking for granted for this to make sense?
Correct: B) The weather forecast is generally reliable.
To go from 'the forecast said rain' to 'take an umbrella', you must assume the forecast is at least somewhat reliable (B); if it were never right, it would give no reason to bring one. A doesn't claim it is already raining, C isn't the point (whether an umbrella is best), and D isn't stated — none is the assumption the argument depends on.
Q2
“This morning I saw five cars parked in the lot, and all five were white.” From this alone, what can be said most defensibly?
Correct: D) At least the five cars I saw this morning were white.
The only thing established is that the five cars seen this morning were white (D). Concluding a parking rule (A), local popularity (B), or the rarity of other colors (C) from just five cars is an unsupported leap from a tiny sample.
Q3
“That commentator talks about environmental policy, yet he himself drives a big car. So his argument isn't worth hearing.” What is the flaw?
Correct: A) It dismisses the claim because of the speaker's conduct rather than its substance.
Whether a claim is right should be judged on its content, but this rejects it because of what the speaker does ('drives a big car'). That's an ad hominem (a tu-quoque / hypocrisy attack) (A). His behavior and the truth of his claim are separate matters. There's no false dilemma (B), hasty generalization (C), or circular reasoning (D).
Q4
Which is the strongest (most relevant) reason to support the claim “this used car has been well maintained”?
Correct: C) It comes with a service logbook recording all scheduled inspections and part replacements.
What most directly supports 'well maintained' is a service logbook documenting inspections and replacements (C). Liking the color (A), a celebrity former owner (B), or a pleasant seller (D) are irrelevant or weak as evidence of maintenance.
Q5
“After we switched the office lighting to LED, employees' overtime hours fell. So the other branches should switch to LED too.” What is an unstated assumption this argument depends on?
Correct: C) The drop in overtime was mainly due to the lighting change.
The conclusion 'other branches should switch too' depends on the overtime drop being caused by the lighting change rather than by, say, a change in workload (C). If that fails, there's no basis to expect the same effect elsewhere. A, B, and D can be false while the argument still stands, so they aren't the assumption it depends on.
Q6
“A regional survey found that households with a pet dog tend to have residents who are more active in the neighborhood.” What is the most careful, defensible inference?
Correct: B) There's an association between owning a dog and neighborhood activity, but this alone can't establish cause and effect.
Correlation doesn't imply causation — the reverse (already-outgoing people are more likely to get a dog) or a common factor could explain it, so this survey alone can't establish cause (B). A assumes causation, D overstates a tendency into 'always', and C reverses the finding — all overreach.
Q7
“This investment method is a sure moneymaker — a famous investor once commented that it's a good method.” Which best describes the flaw?
Correct: D) It treats one offhand remark by a celebrity as proof of certainty.
Even from a famous investor, a one-line comment doesn't guarantee a 'sure moneymaker'. With no supporting data or conditions, the argument leans on authority (a famous name). That's an inappropriate appeal to authority (D). No false dilemma (A), topic shift (B), or appeal to popularity (C) is used.
Q8
Which is the most effective objection to the claim “this company should relocate its office to the city center”?
Correct: A) Data shows most employees chose the current location for its short commute, and a city-center move would greatly lengthen many employees' commutes.
The strongest objection uses data to show a concrete, serious harm — the move would greatly worsen many employees' commutes (A) — directly undercutting the case for relocating. B merely notes mixed opinions, C is an ad hominem against the proposer, and D is a vague prediction — all weak as objections.
Q9
“This password can never be cracked, because it has been targeted by many break-in attempts and none has ever succeeded.” What must be assumed for this to hold?
Correct: B) The attacks tried so far are representative of every attack that could ever be attempted.
To move from 'never cracked so far' to 'can never be cracked', you must assume past attempts represent every possible future method (no new technique will ever appear) (B). If they don't, an unknown method could still succeed. A, C, and D could be false without changing the logic, so they aren't the required assumption.
Q10
“This summer, in this town both air-conditioner sales and heatstroke hospitalizations rose sharply over last year.” What can be concluded?
Correct: C) Nothing beyond the fact that both AC sales and heatstroke cases rose can be concluded from this alone.
Two figures both rising doesn't reveal the reason (a common factor like an unusually hot summer is likely, among others), so the careful conclusion is just the fact itself (C). A and B mistake correlation for causation (in fact extreme heat likely drove both), and D is a baseless guess — all are leaps.
Q11
“We must not revise the current system even slightly. Once we touch it, exception after exception will be allowed, and eventually the whole system will collapse.” Which best describes the flaw?
Correct: A) It assumes, without warrant, that a small step must lead to an extreme outcome.
'A small revision → endless exceptions → total collapse' asserts that the first small step must inevitably lead to an extreme end, without showing the intervening causal links. That's a slippery slope (A). It isn't an ad hominem (B), circular reasoning (C), or appeal to popularity (D).
Q12
Which is the strongest reason supporting the claim “using this study material raises your English score”?
Correct: B) In a controlled trial that randomly split many learners into two groups and gave the material to only one, that group scored significantly higher.
The strongest reason is systematic evidence — a randomized, controlled comparison (B), which isolates the material's effect from other factors. A is a single anecdote, C is an appeal to popularity, and D is an irrelevant celebrity (misplaced authority); all are weak support.
Q13
“In our customer-satisfaction survey, stores with faster service had higher satisfaction. So if we add staff at every store to speed up service, satisfaction should rise.” Which assumption is most essential for this conclusion?
Correct: C) The higher satisfaction was due to fast service itself, not to other features of those stores (like location or product range).
To claim adding staff will raise satisfaction, you must assume the satisfaction came from fast service itself and not from other store-specific traits (location, product range) (C). If it came from those, speeding up service won't help. A only complicates the causal reading, and B and D aren't needed for satisfaction specifically to rise.
Q14
A school tried a new teaching method in two classes. After it was introduced, both classes' average scores rose. But three other classes that did NOT use the new method saw their averages rise by about the same amount over the same period. What is the most defensible inference about the new method?
Correct: B) Since the other classes rose by about the same amount, the increase may be due to a common factor other than the method, and the method's effect can't be confirmed from this data.
If the classes that didn't use the method rose by about the same amount, the increase likely stems from a schoolwide common factor (easier test, cohort-wide progress), so the method's specific effect isn't shown here (B). Claiming it's 'proven' (A) ignores that the comparison group undercuts it; 'no effect whatsoever' (C) overstates the other way; and D dismisses the inconvenient data.
Q15
“These rules are fair, because they were made through a fair procedure. And why is that procedure fair? Because it produced these fair rules.” What is the main flaw?
Correct: D) It runs in a circle, using the conclusion as its own support (circular reasoning).
The rules are fair because the procedure is fair, and the procedure is fair because it produced these fair rules — the argument uses the very conclusion it's trying to prove as its support, forming a loop. That's circular reasoning / begging the question (D). It isn't hasty generalization (A), a slippery slope (B), or a red herring (C).
Q16
Consider the claim “the school should stop opening its playground after hours,” weighing reasons for and against. Which is the weightiest (potentially decisive) objection?
Correct: B) Data shows there's no other safe place for children to play in this area, so ending the opening would force many children to play on busy roads, raising the risk of accidents.
The weightiest objection shows that stopping would cause a serious, concrete harm — children's safety, forced onto busy roads — with no alternative (B), a strong consideration that can be weighed directly against any benefit of stopping. A is a weak minority preference, C is an ad hominem against the proposer, and D is an appeal to tradition — none is decisive.
What to do next
Practice breaking an argument into claim, reasons, and hidden assumption.
Sort options into well-supported / merely plausible / irrelevant.
Work through correlation-vs-causation examples untimed and carefully.
This test is reference information about critical-thinking tendencies, not a formal qualification or a guarantee of ability.
Emerging
Start from the basics. Getting into the habit of splitting an argument into claim and reason will steady your evaluation fast.
Almost thereOverall score and pass line
Ability profile
Your score across ability domains (out of 100)
Domain-by-domain analysis
Spotting the unstated assumption an argument relies on. The key is to look for the condition that, if it failed, would break the conclusion.
Judging how firmly a conclusion follows from the given evidence. The key is to avoid overreach and not to mistake correlation for causation.
Spotting the flaw in everyday arguments. Knowing patterns like ad hominem, false dilemma, and circular reasoning keeps it steady.
Judging whether a reason is relevant and strong for a claim, or weak and irrelevant. The key is to weight systematic evidence over anecdote or popularity.
Your strengths
You stay with the task to the end — a base to build on.
On items with familiar topics, you grasp the situation.
Rereading the explanations helps you regrasp the approach.
Growth areas
Separating a claim from the reasons that support it is still shaky.
The difference between correlation and causation, and unstated assumptions, can be easy to miss.
You may take persuasive wording or majority opinion at face value as a reason.
Detailed analysis
This test measures critical thinking across several sub-skills. You are at the emerging level — not a ceiling on ability, but a sign the anatomy of an argument is still new. The profile and the per-question results above show where to start. Read aloud the explanations of the items you missed and check sentence by sentence 'is this a claim, or a reason?' — start there and your foundation will steady.
Question review
Q1
“Let's take an umbrella, because the forecast said it would rain.” What must the speaker be taking for granted for this to make sense?
Correct: B) The weather forecast is generally reliable.
To go from 'the forecast said rain' to 'take an umbrella', you must assume the forecast is at least somewhat reliable (B); if it were never right, it would give no reason to bring one. A doesn't claim it is already raining, C isn't the point (whether an umbrella is best), and D isn't stated — none is the assumption the argument depends on.
Q2
“This morning I saw five cars parked in the lot, and all five were white.” From this alone, what can be said most defensibly?
Correct: D) At least the five cars I saw this morning were white.
The only thing established is that the five cars seen this morning were white (D). Concluding a parking rule (A), local popularity (B), or the rarity of other colors (C) from just five cars is an unsupported leap from a tiny sample.
Q3
“That commentator talks about environmental policy, yet he himself drives a big car. So his argument isn't worth hearing.” What is the flaw?
Correct: A) It dismisses the claim because of the speaker's conduct rather than its substance.
Whether a claim is right should be judged on its content, but this rejects it because of what the speaker does ('drives a big car'). That's an ad hominem (a tu-quoque / hypocrisy attack) (A). His behavior and the truth of his claim are separate matters. There's no false dilemma (B), hasty generalization (C), or circular reasoning (D).
Q4
Which is the strongest (most relevant) reason to support the claim “this used car has been well maintained”?
Correct: C) It comes with a service logbook recording all scheduled inspections and part replacements.
What most directly supports 'well maintained' is a service logbook documenting inspections and replacements (C). Liking the color (A), a celebrity former owner (B), or a pleasant seller (D) are irrelevant or weak as evidence of maintenance.
Q5
“After we switched the office lighting to LED, employees' overtime hours fell. So the other branches should switch to LED too.” What is an unstated assumption this argument depends on?
Correct: C) The drop in overtime was mainly due to the lighting change.
The conclusion 'other branches should switch too' depends on the overtime drop being caused by the lighting change rather than by, say, a change in workload (C). If that fails, there's no basis to expect the same effect elsewhere. A, B, and D can be false while the argument still stands, so they aren't the assumption it depends on.
Q6
“A regional survey found that households with a pet dog tend to have residents who are more active in the neighborhood.” What is the most careful, defensible inference?
Correct: B) There's an association between owning a dog and neighborhood activity, but this alone can't establish cause and effect.
Correlation doesn't imply causation — the reverse (already-outgoing people are more likely to get a dog) or a common factor could explain it, so this survey alone can't establish cause (B). A assumes causation, D overstates a tendency into 'always', and C reverses the finding — all overreach.
Q7
“This investment method is a sure moneymaker — a famous investor once commented that it's a good method.” Which best describes the flaw?
Correct: D) It treats one offhand remark by a celebrity as proof of certainty.
Even from a famous investor, a one-line comment doesn't guarantee a 'sure moneymaker'. With no supporting data or conditions, the argument leans on authority (a famous name). That's an inappropriate appeal to authority (D). No false dilemma (A), topic shift (B), or appeal to popularity (C) is used.
Q8
Which is the most effective objection to the claim “this company should relocate its office to the city center”?
Correct: A) Data shows most employees chose the current location for its short commute, and a city-center move would greatly lengthen many employees' commutes.
The strongest objection uses data to show a concrete, serious harm — the move would greatly worsen many employees' commutes (A) — directly undercutting the case for relocating. B merely notes mixed opinions, C is an ad hominem against the proposer, and D is a vague prediction — all weak as objections.
Q9
“This password can never be cracked, because it has been targeted by many break-in attempts and none has ever succeeded.” What must be assumed for this to hold?
Correct: B) The attacks tried so far are representative of every attack that could ever be attempted.
To move from 'never cracked so far' to 'can never be cracked', you must assume past attempts represent every possible future method (no new technique will ever appear) (B). If they don't, an unknown method could still succeed. A, C, and D could be false without changing the logic, so they aren't the required assumption.
Q10
“This summer, in this town both air-conditioner sales and heatstroke hospitalizations rose sharply over last year.” What can be concluded?
Correct: C) Nothing beyond the fact that both AC sales and heatstroke cases rose can be concluded from this alone.
Two figures both rising doesn't reveal the reason (a common factor like an unusually hot summer is likely, among others), so the careful conclusion is just the fact itself (C). A and B mistake correlation for causation (in fact extreme heat likely drove both), and D is a baseless guess — all are leaps.
Q11
“We must not revise the current system even slightly. Once we touch it, exception after exception will be allowed, and eventually the whole system will collapse.” Which best describes the flaw?
Correct: A) It assumes, without warrant, that a small step must lead to an extreme outcome.
'A small revision → endless exceptions → total collapse' asserts that the first small step must inevitably lead to an extreme end, without showing the intervening causal links. That's a slippery slope (A). It isn't an ad hominem (B), circular reasoning (C), or appeal to popularity (D).
Q12
Which is the strongest reason supporting the claim “using this study material raises your English score”?
Correct: B) In a controlled trial that randomly split many learners into two groups and gave the material to only one, that group scored significantly higher.
The strongest reason is systematic evidence — a randomized, controlled comparison (B), which isolates the material's effect from other factors. A is a single anecdote, C is an appeal to popularity, and D is an irrelevant celebrity (misplaced authority); all are weak support.
Q13
“In our customer-satisfaction survey, stores with faster service had higher satisfaction. So if we add staff at every store to speed up service, satisfaction should rise.” Which assumption is most essential for this conclusion?
Correct: C) The higher satisfaction was due to fast service itself, not to other features of those stores (like location or product range).
To claim adding staff will raise satisfaction, you must assume the satisfaction came from fast service itself and not from other store-specific traits (location, product range) (C). If it came from those, speeding up service won't help. A only complicates the causal reading, and B and D aren't needed for satisfaction specifically to rise.
Q14
A school tried a new teaching method in two classes. After it was introduced, both classes' average scores rose. But three other classes that did NOT use the new method saw their averages rise by about the same amount over the same period. What is the most defensible inference about the new method?
Correct: B) Since the other classes rose by about the same amount, the increase may be due to a common factor other than the method, and the method's effect can't be confirmed from this data.
If the classes that didn't use the method rose by about the same amount, the increase likely stems from a schoolwide common factor (easier test, cohort-wide progress), so the method's specific effect isn't shown here (B). Claiming it's 'proven' (A) ignores that the comparison group undercuts it; 'no effect whatsoever' (C) overstates the other way; and D dismisses the inconvenient data.
Q15
“These rules are fair, because they were made through a fair procedure. And why is that procedure fair? Because it produced these fair rules.” What is the main flaw?
Correct: D) It runs in a circle, using the conclusion as its own support (circular reasoning).
The rules are fair because the procedure is fair, and the procedure is fair because it produced these fair rules — the argument uses the very conclusion it's trying to prove as its support, forming a loop. That's circular reasoning / begging the question (D). It isn't hasty generalization (A), a slippery slope (B), or a red herring (C).
Q16
Consider the claim “the school should stop opening its playground after hours,” weighing reasons for and against. Which is the weightiest (potentially decisive) objection?
Correct: B) Data shows there's no other safe place for children to play in this area, so ending the opening would force many children to play on busy roads, raising the risk of accidents.
The weightiest objection shows that stopping would cause a serious, concrete harm — children's safety, forced onto busy roads — with no alternative (B), a strong consideration that can be weighed directly against any benefit of stopping. A is a weak minority preference, C is an ad hominem against the proposer, and D is an appeal to tradition — none is decisive.
What to do next
Start by pointing out the claim and the reason, one each, in everyday news and conversation.
Say aloud 'does this reason really support the claim?' to check it.
Do three easy items a day, pairing each with reading the explanation aloud.
This test is reference information about critical-thinking tendencies, not a formal qualification or a guarantee of ability.
Who it's for
Anyone who wants to sharpen how they weigh evidence and evaluate arguments, or read information more critically at work or in study.
What the result looks like
Shows an overall score and level, a four-sub-skill profile, per-question results with explanations, strengths and growth areas, and next study steps.
This assessment has 1 sections and 16 questions.
Once you start, you cannot change the language. Switch beforehand if needed.