29.
The new model called O1's ability in medical related and other tasks.
Nice study providing a comprehensive evaluation of OpenAI's o1-preview LLM.
Shows strong performance across many tasks:
- competitive programming
- generating coherent and accurate radiology reports
- high school-level mathematical reasoning tasks
- chip design tasks
- anthropology and geology
- quantitative investing
- social media analysis
... and many other domains and problems.
👇
https://t.me/Medical_journals_and_updates/86826