[2025 Week 25] MetaX Weekly AI Paper Review

MetaX Weekly AI Paper Review Week 25 2025 -- Active Research in Innovative Architecture for Context Expansion and Efficiency Improvement of Large Language Models, AI Performance Improvement Through Multilingual Multimodal Benchmark Development, Feedback Integration, and Test-Time Computation Optimization: MiniMax-M1: World first open-weight large-scale reasoning model supporting 1 million token context combining hybrid MoE architecture and lightning attention. MultiFinBen: First multilingual multimodal benchmark specialized in the financial domain evaluating real financial communication ability of LLMs. Scientists First Exam: Scientific MLLM benchmark evaluating scientific cognitive ability in three stages of signal recognition, attribute understanding, and comparative reasoning. DeepResearch Bench: Deep research agent benchmark consisting of 100 PhD-level research tasks evaluating web navigation, information retrieval, and synthesis capability. Scaling Test-time Compute for LLM Agents: Various test-time scaling strategies. Additional papers covered: context window extension techniques enabling models to process much longer inputs; efficiency improvements reducing memory and compute requirements for inference; multilingual capability evaluation ensuring AI systems work across diverse languages; and feedback integration methods enabling AI systems to improve based on human evaluation signals.

[2025 Week 25] MetaX Weekly AI Paper Review

Related Articles

The Privacy Paradox: Why We Worry Yet Share Our Data So Easi

[Paper Review] Generational Differences in Acceptance of AI

Are Large Language Models Truly Intelligent, or Just Sophist

Related Articles

논문리뷰
The Privacy Paradox: Why We Worry Yet Share Our Data So Easi
이든 기자 · 2026.06.05

논문리뷰
[Paper Review] Generational Differences in Acceptance of AI
류성훈 기자 · 2026.06.04

논문리뷰
Are Large Language Models Truly Intelligent, or Just Sophist
이든 기자 · 2026.06.04