0
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2025年3月公開 Arxiv論文ランキング】2503.xxxxx

Posted at

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2025年3月頃に公開されたcsカテゴリの論文 (ID: 2503.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2025月6月14日更新)

被引用数   タイトル 動画
arxiv225 DAPO: An Open-Source LLM Reinforcement Learning System at Scale
arxiv190 Gemma 3 Technical Report
arxiv142 Understanding R1-Zero-Like Training: A Critical Perspective
arxiv123 Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models not yet
arxiv120 Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models not yet
arxiv117 Visual-RFT: Visual Reinforcement Fine-Tuning not yet
arxiv116 Wan: Open and Advanced Large-Scale Video Generative Models not yet
arxiv113 Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models not yet
arxiv109 SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild not yet
arxiv107 Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning not yet
arxiv97 L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning not yet
arxiv91 MM-Eureka: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning not yet
arxiv89 Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
arxiv82 R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization not yet
arxiv77 Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model not yet
arxiv64 R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization not yet
arxiv64 LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL not yet
arxiv64 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning not yet
arxiv61 Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond not yet
arxiv59 Qwen2.5-Omni Technical Report not yet
arxiv58 R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model not yet
arxiv57 Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs not yet
arxiv54 GR00T N1: An Open Foundation Model for Generalist Humanoid Robots not yet
arxiv49 DAST: Difficulty-Adaptive Slow-Thinking for Large Reasoning Models not yet
arxiv47 Video-R1: Reinforcing Video Reasoning in MLLMs not yet
arxiv45 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond not yet
arxiv43 ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning not yet
arxiv42 A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well? not yet
arxiv42 Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning not yet
arxiv38 OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement not yet
arxiv38 Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement not yet
arxiv36 Why Do Multi-Agent LLM Systems Fail?
arxiv36 VisualPRM: An Effective Process Reward Model for Multimodal Reasoning not yet
arxiv35 Gemini Robotics: Bringing AI into the Physical World not yet
arxiv32 AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems not yet
arxiv32 Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching not yet
arxiv32 An Empirical Study on Eliciting and Improving R1-like Reasoning Models not yet
arxiv30 Model Context Protocol (MCP): Landscape, Security Threats, and Future Research Directions not yet
arxiv30 Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning not yet
arxiv30 Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models not yet
arxiv29 UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning not yet
arxiv29 Large Language Model Agent: A Survey on Methodology, Applications and Challenges not yet
arxiv29 Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey not yet
arxiv29 How Well do LLMs Compress Their Own Chain-of-Thought? A Token Complexity Approach not yet
arxiv27 Large Language Models Post-training: Surveying Techniques from Alignment to Reasoning not yet
arxiv26 Monitoring Reasoning Models for Misbehavior and the Risks of Promoting Obfuscation
arxiv23 WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation not yet
arxiv22 ToRL: Scaling Tool-Integrated RL not yet
arxiv22 What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret not yet
arxiv21 Which Economic Tasks are Performed with AI? Evidence from Millions of Claude Conversations not yet
arxiv20 CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models not yet
arxiv20 Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad
arxiv20 A Linear Collider Vision for the Future of Particle Physics not yet
arxiv20 GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing not yet
arxiv20 VACE: All-in-One Video Creation and Editing not yet
arxiv20 Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning not yet
arxiv19 Crossing the Reward Bridge: Expanding RL with Verifiable Rewards Across Diverse Domains not yet
arxiv19 CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models
arxiv19 SoK: Security Analysis of Blockchain-based Cryptocurrency not yet
arxiv19 SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks not yet
arxiv19 RWKV-7 "Goose" with Expressive Dynamic State Evolution not yet
arxiv19 Efficient Test-Time Scaling via Self-Calibration not yet
arxiv18 Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't not yet
arxiv18 Measuring AI Ability to Complete Long Tasks not yet
arxiv17 Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models not yet
arxiv17 Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning not yet
arxiv17 ReCamMaster: Camera-Controlled Generative Rendering from A Single Video not yet
arxiv17 HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model not yet
arxiv17 All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning not yet
arxiv17 Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable not yet
arxiv16 Vision-R1: Evolving Human-Free Alignment in Large Vision-Language Models via Vision-Guided Reinforcement Learning not yet
arxiv16 What Makes a Reward Model a Good Teacher? An Optimization Perspective not yet
arxiv16 LLM Agents for Education: Advances and Applications not yet
arxiv16 EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer not yet
arxiv16 The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models not yet
arxiv16 Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens not yet
arxiv15 Efficient Inference for Large Reasoning Models: A Survey not yet
arxiv15 Transformers without Normalization
arxiv15 KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding not yet
arxiv15 DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning not yet
arxiv14 Open Deep Search: Democratizing Search with Open-source Reasoning Agents
arxiv14 A Comprehensive Survey on Long Context Language Modeling not yet
arxiv14 OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning not yet
arxiv14 Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning not yet
arxiv14 VGGT: Visual Geometry Grounded Transformer not yet
arxiv14 ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning not yet
arxiv14 Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model not yet
arxiv14 START: Self-taught Reasoner with Tools not yet
arxiv14 EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test not yet
arxiv13 Reasoning-SQL: Reinforcement Learning with SQL Tailored Partial Rewards for Reasoning-Enhanced Text-to-SQL not yet
arxiv13 Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking not yet
arxiv13 1.4 Million Open-Source Distilled Reasoning Dataset to Empower Large Language Model Training not yet
arxiv13 XAttention: Block Sparse Attention with Antidiagonal Scoring not yet
arxiv13 A Survey on Trustworthy LLM Agents: Threats and Countermeasures not yet
arxiv13 Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k not yet
arxiv13 YuE: Scaling Open Foundation Models for Long-Form Music Generation not yet
arxiv13 InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models not yet
arxiv13 R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement Learning not yet
arxiv13 Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities not yet
arxiv13 MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents not yet
arxiv12 VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness not yet
arxiv12 Reasoning to Learn from Latent Thoughts
arxiv12 Defeating Prompt Injections by Design not yet
arxiv12 MetaSpatial: Reinforcing 3D Spatial Reasoning in VLMs for the Metaverse not yet
arxiv12 FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient Training R1-like Reasoning Models not yet
arxiv12 SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability not yet
arxiv12 Unified Reward Model for Multimodal Understanding and Generation not yet
arxiv12 Personalized Generation In Large Model Era: A Survey not yet
arxiv12 Audio-Reasoner: Improving Reasoning Capability in Large Audio Language Models not yet
arxiv11 SCORE: Story Coherence and Retrieval Enhancement for AI Narratives not yet
arxiv11 Agentic Large Language Models, a survey not yet
arxiv11 GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving not yet
arxiv11 Learning Multi-Level Features with Matryoshka Sparse Autoencoders not yet
arxiv11 Survey on Evaluation of LLM-based Agents not yet
arxiv11 Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation not yet
arxiv11 Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning not yet
arxiv11 CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models not yet
arxiv11 Long Context Tuning for Video Generation not yet
arxiv11 Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models
arxiv11 Agentic AI for Scientific Discovery: A Survey of Progress, Challenges, and Future Directions not yet
arxiv11 Chain-of-Thought Reasoning In The Wild Is Not Always Faithful not yet
arxiv11 AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning not yet
arxiv11 Remasking Discrete Diffusion Models with Inference-Time Scaling not yet
arxiv10 Effectively Controlling Reasoning Models through Thinking Intervention not yet
arxiv10 A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models not yet
arxiv10 Think Before Recommend: Unleashing the Latent Reasoning Power for Sequential Recommendation not yet
arxiv10 Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback
arxiv10 Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks not yet
arxiv10 Social Network User Profiling for Anomaly Detection Based on Graph Neural Networks not yet
arxiv10 AgentRxiv: Towards Collaborative Autonomous Research not yet
arxiv10 Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback not yet
arxiv10 Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings not yet
arxiv10 Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding not yet
arxiv10 MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation not yet
arxiv10 SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation not yet
arxiv10 MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning not yet
arxiv10 SafeArena: Evaluating the Safety of Autonomous Web Agents
arxiv10 OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction not yet
arxiv10 A New $\sim 5\sigma$ Tension at Characteristic Redshift from DESI-DR1 BAO and DES-SN5YR Observations not yet
arxiv9 XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery? not yet
arxiv9 Large Language Models Pass the Turing Test not yet
arxiv9 Q-Insight: Understanding Image Quality via Visual Reinforcement Learning not yet
arxiv9 Lumina-Image 2.0: A Unified and Efficient Image Generative Framework not yet
arxiv9 Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models not yet
arxiv9 ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition
arxiv9 Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging not yet
arxiv9 LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning? not yet
arxiv9 HALHF: a hybrid, asymmetric, linear Higgs factory using plasma- and RF-based acceleration not yet
arxiv9 HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

0
1
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?