🎯 Cross-Set Question Stability Checker

Test the same question across different sets (A, B, C, D, E) - Target: SSI ≥ 0.90

📋 How to use: Example formats (all work!):
Set A: Q1a: Mary likes parks | Q2a) The sky is blue
Set B: Q1b. Mary wants parks | Q2b The sky appears blue
Set C: Q1c) Mary visits parks | Q2c: Sky looks blue

⚠️ Backend Required: Python backend must be running on localhost:5000