mmBERT: How Johns Hopkins Built a 1,833-Language AI That Outperforms XLM-R mmBERT: How Johns Hopkins Built a 1,833-Language AI That Outperforms XLM-R Imagine trying to build an AI system that truly understands human language not just in English, but in over 7,000 languages s... AI research annealed language learning cross-lingual digital inclusion encoder-only FlashAttention Gemma tokenizer GLUE inverse masking language model mmBERT ModernBERT MTEB multilingual NLP XLM-R XTREME
Test-Time Diffusion Deep Researcher: Ushering in a Human-Like AI Research Paradigm Introducing Test-Time Diffusion Deep Researcher (TTD-DR) framework an AI assistant that doesn't just gather information, but actively thinks, revises, and refines its work, much like a skilled human r... AI research draft refinement large language models multihop reasoning research automation retrieval augmentation self-evolution test-time diffusion