Trapstreet.run

run · run-mpizw3io-3

claude-pdf

on Legal Contract Review

status

scored · passed

total score

0.895

cases

17 passed · 2 failed

latency

69.17s

cost

$3.684

tokens

scored

2026-05-23 23:44:03

duration

69s

Run summary

score
0.895
passed
total cases
19
passed cases
17
skipped cases
0
pass threshold
0.800
tokens total
null
cost total
$3.684
latency p95
7.25 s
latency total
69.17 s
latency median
3.24 s

score by category

dates
100%
money
86%
clauses
88%
deposit
100%
scenario
100%
scenario_reasoning
100%

extras

n_scored=19n_skipped_no_gold=0

Solution metadata

Self-reported by the solution. Not validated.

repo
https://github.com/Ruqii/trapstreet-solutions
engine
claude-opus-4-7

Per-case results

caseexitdurationmetrics
break_clause02921 msid=break_clausetype=booleanmodel=claude-opus-4-7score=1category=clausesusd_cost=0.119616difficulty=mediumagent_answer=noinput_tokens=64output_tokens=5expected_answer=nomatcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
deposit_amount02576 msid=deposit_amounttype=numericmodel=claude-opus-4-7score=1category=moneyusd_cost=0.119451difficulty=easyagent_answer=£2,250.00input_tokens=28output_tokens=10expected_answer=2250matcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
deposit_dispute_escalation04695 msid=deposit_dispute_escalationtype=textmodel=claude-opus-4-7score=1category=depositusd_cost=0.123786difficulty=hardagent_answer=The dispute will be submitted to the ICE (Independent Case Examiner of The Dispute Service) for adjudication, and all parties agree to co-operate with the adjudication.input_tokens=62output_tokens=61expected_answer=The dispute may be submitted to the Independent Case Examiner (ICE) for adjudication.matcher_results=[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
deposit_scheme03211 msid=deposit_schemetype=textmodel=claude-opus-4-7score=1category=clausesusd_cost=0.120801difficulty=mediumagent_answer=The Dispute Service (Tenancy Deposit Scheme).input_tokens=38output_tokens=26expected_answer=TDSmatcher_results=[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
early_surrender03008 msid=early_surrendertype=booleanmodel=claude-opus-4-7score=0category=clausesusd_cost=0.119481difficulty=hardagent_answer=Noinput_tokens=55output_tokens=5expected_answer=yesmatcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
early_surrender_economic_incentive04206 msid=early_surrender_economic_incentivetype=booleanmodel=claude-opus-4-7score=1category=scenario_reasoningusd_cost=0.126381difficulty=expertagent_answer=No. The clause only specifies that the tenant is liable for "any difference in rental payments... to the extent that the replacement Tenant will be paying a lower amount." It does not provide for any refund or credit if the replacement tenant pays a higher rent.input_tokens=125output_tokens=83expected_answer=nomatcher_results=[object Object],[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
fixed_term_departure_notice02810 msid=fixed_term_departure_noticetype=booleanmodel=claude-opus-4-7score=1category=clausesusd_cost=0.119571difficulty=hardagent_answer=Yesinput_tokens=51output_tokens=7expected_answer=yesmatcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
governing_act03665 msid=governing_acttype=textmodel=claude-opus-4-7score=1category=clausesusd_cost=0.120351difficulty=hardagent_answer=Housing Act 1988input_tokens=78output_tokens=12expected_answer=Housing Act 1988matcher_results=[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
inventory_referenced02519 msid=inventory_referencedtype=booleanmodel=claude-opus-4-7score=1category=clausesusd_cost=0.119676difficulty=hardagent_answer=yesinput_tokens=63output_tokens=6expected_answer=yesmatcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
late_rent_interest_rate03659 msid=late_rent_interest_ratetype=textmodel=claude-opus-4-7score=1category=moneyusd_cost=0.120831difficulty=hardagent_answer=3% per annum above the Bank of England base rateinput_tokens=70output_tokens=20expected_answer=3% per annum above Bank of England base ratematcher_results=[object Object],[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
pets_allowed02888 msid=pets_allowedtype=textmodel=claude-opus-4-7score=1category=clausesusd_cost=0.119706difficulty=mediumagent_answer=with landlord consentinput_tokens=45output_tokens=10expected_answer=with landlord consentmatcher_results=[object Object],[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
post_fixed_term_extension02934 msid=post_fixed_term_extensiontype=textmodel=claude-opus-4-7score=1category=clausesusd_cost=0.121986difficulty=hardagent_answer=The agreement automatically extends for a further period of six months from the expiration of the term, with rent increasing by 5%.input_tokens=47output_tokens=40expected_answer=the tenancy automatically extends for six monthsmatcher_results=[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
rent_increase_scope03730 msid=rent_increase_scopetype=textmodel=claude-opus-4-7score=1category=moneyusd_cost=0.119871difficulty=hardagent_answer=The automatic extension period.input_tokens=46output_tokens=12expected_answer=the automatic extension period onlymatcher_results=[object Object],[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
rent_payment_day02649 msid=rent_payment_daytype=textmodel=claude-opus-4-7score=0category=moneyusd_cost=0.119226difficulty=mediumagent_answer=5thinput_tokens=28output_tokens=7expected_answer=on or prior 5th of the monthmatcher_results=[object Object],[object Object],[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
rent_year205755 msid=rent_year2type=numericmodel=claude-opus-4-7score=1category=moneyusd_cost=1.479907difficulty=mediumagent_answer=£2100.00input_tokens=48output_tokens=9expected_answer=2100matcher_results=[object Object]cache_read_input_tokens=0cache_creation_input_tokens=78854
rent_year303881 msid=rent_year3type=numericmodel=claude-opus-4-7score=1category=moneyusd_cost=0.120051difficulty=mediumagent_answer=£2400.00input_tokens=73output_tokens=9expected_answer=2400matcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
scenario_leave_22mo_replacement_1mo_gap07247 msid=scenario_leave_22mo_replacement_1mo_gaptype=numericmodel=claude-opus-4-7score=1category=scenariousd_cost=0.150801difficulty=hardagent_answer=Months remaining at surrender: 36 − 22 = 14 months early. Replacement found after 1 month, so replacement occupies remaining 13 months. Rent at month 22 of a 36-month term starting 05/09/2022: - Months 1–12 (05/09/2022–04/09/2023): £1,950 - Months 13–24 (05/09/2023–04/09/2024): £2,100 - Months 25–36 (05/09/2024–04/09/2025): £2,400 Month 22 falls in the £2,100 period. The 1-month gap is at month 23 = £2,100. The replacement's 13 months: months 24 (£2,100) + months 25–36 (12 × £2,400 = £28,800) = £30,900. (a) Rent gap: £2,100 (b) Letting fee: £30,900 × 13.2% = £4,078.80 (c) Inventory: £144 × 14 / 36 = £56.00 (d) Admin: £480 × 14 / 36 = £186.67 Total: £2,100 + £4,078.80 + £56.00 + £186.67 = **£6,421.47**input_tokens=273output_tokens=379expected_answer=£6,421.47matcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
term_start03572 msid=term_starttype=datemodel=claude-opus-4-7score=1category=datesusd_cost=0.119691difficulty=easyagent_answer=05/09/2022input_tokens=39output_tokens=11expected_answer=05/09/2022matcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
total_rent_fixed_term03239 msid=total_rent_fixed_termtype=numericmodel=claude-opus-4-7score=1category=moneyusd_cost=0.123321difficulty=hardagent_answer=12 × £1950 + 12 × £2100 + 12 × £2400 = £23,400 + £25,200 + £28,800 = £77,400.input_tokens=66output_tokens=54expected_answer=77400matcher_results=[object Object]cache_read_input_tokens=78854cache_creation_input_tokens=0
discuss this run →