A GeoGuessr bench: geobench.org. Which shows that gemini is still better than o3 on OSINT. For a long time I want to setup a bench for GeoGuessr and 图寻 which is a fun topic let the visual model do OSINT challenges. Though I’m not figure it out but it seems like to…
The post A GeoGuessr bench: geobench.org. Which shows that gemini is still better than o3 on OSINT. For a long time I want to setup a bench for GeoGuessr and 图寻 which is a fun topic let the visual model do OSINT challenges. Though I’m not figure it out but it seems like to… first appeared on JOSSICA – jossica.com.

