Test answer stability across different sets (A, B, C, D, E) - Target: SSI ≥ 0.90
📋 How to use:
Set A contains A1a, A2a, A3a, A15a, A28a... (answers for questions 1, 2, 3, 15, 28...)
Set B contains A1b, A2b, A3b, A15b, A28b...
Works with ANY format: A1a: text | A1a) text | A1a. text | A1a text
The tool will group by answer number (A1, A2, A15, A28, etc.)
SSI is calculated for each answer across all sets
Answers with SSI < 0.90 are highlighted in RED for manual review
Example formats (all work!): Set A: A1a: The answer is 42 | A2a) Blue is the color Set B: A1b. Answer is forty-two | A2b The color is blue Set C: A1c) It's 42 | A2c: Blue
⚠️ Backend Required: Python backend must be running on localhost:5000