AtomWorldBench — Results Dashboard

Benchmark evaluation of large language models on crystal-structure manipulation tasks

Open Documentation
Loading results…