| AGENT | MODEL | COST | LAST OUTPUT |
|---|---|---|---|
| Loading… | |||
| # | PARAM | CHANGE | SHARPE | DELTA | DECISION |
|---|---|---|---|---|---|
| No experiments yet | |||||
Loading staged settings...
| Model | Params | FinQA | ConvFinQA | Average |
|---|---|---|---|---|
| DeepSeek-R1 | 671B | 71.0 | 82.0 | 78.2 |
| Fin-R1 | 7B | 76.0 | 85.0 | 75.2 |
| Qwen-2.5-32B-Instruct | 32B | 72.0 | 78.0 | 73.8 |
| Fin-R1-SFT | 7B | 73.0 | 81.0 | 71.9 |