On line SMASH seminar: Advancing Scientific Research with AI Research Agents

On line seminar

The lecture will be given by Tatiana Shavrina, Phd, an experienced AI/ML Research Lead with 12 years of experience, passionate about NLP, AGI and open-source.

Abstract:

Large Language Model (LLM) agents are poised to transform the landscape of scientific research by automating complex, multi-stage workflows. To accelerate progress in this domain, AI Research Agents are applied to more and more modeling sciences, the first being AI Research itself.

The talk will cover recent works in the domain, including frontier scaffolds, LLMS, and benchmarks.We will look at how the current agentic tasks encompass a broad spectrum of scientific challenges, including language modeling, mathematics, bioinformatics, and time series forecasting. We will rigorously evaluate agentic capabilities across the entire research lifecycle—spanning ideation, experimental analysis, and iterative refinement—without providing baseline code.

Our empirical results reveal that while agents surpass human state-of-the-art (SOTA) performance in four tasks, they fall short in sixteen, and even the best-performing agents do not reach the theoretical task ceilings. These findings highlight that AIRS-Bench remains unsaturated, offering significant headroom for future advancements.

The link to the lecture will be provided shortly before the event date.