1
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2025年4月公開 Arxiv論文ランキング】2504.xxxxx

Last updated at Posted at 2025-06-14

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2025年4月頃に公開されたcsカテゴリの論文 (ID: 2504.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2025月8月2日更新)

被引用数   タイトル 動画
arxiv187 InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models not yet
arxiv151 Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
arxiv113 VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model not yet
arxiv69 VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks not yet
arxiv66 Kimi-VL Technical Report not yet
arxiv63 Reinforcement Learning for Reasoning in Large Language Models with One Training Example
arxiv61 $\pi_{0.5}$: a Vision-Language-Action Model with Open-World Generalization
arxiv57 Reasoning Models Can Be Effective Without Thinking
arxiv52 Inference-Time Scaling for Generalist Reward Modeling not yet
arxiv50 DeepResearcher: Scaling Deep Research via Reinforcement Learning in Real-world Environments not yet
arxiv49 ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning not yet
arxiv47 Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning not yet
arxiv46 VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning not yet
arxiv46 Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems not yet
arxiv44 TTRL: Test-Time Reinforcement Learning
arxiv40 RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning not yet
arxiv40 ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
arxiv38 DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition
arxiv38 SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models not yet
arxiv38 GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents not yet
arxiv36 VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning not yet
arxiv35 A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment not yet
arxiv35 ToolRL: Reward is All Tool Learning Needs not yet
arxiv34 Phi-4-reasoning Technical Report
arxiv33 The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
arxiv31 Step1X-Edit: A Practical Framework for General Image Editing not yet
arxiv30 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce not yet
arxiv28 Dynamic Early Exit in Reasoning Models not yet
arxiv28 Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought not yet
arxiv28 PaperBench: Evaluating AI's Ability to Replicate AI Research not yet
arxiv27 A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility not yet
arxiv27 Concise Reasoning via Reinforcement Learning not yet
arxiv27 GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation not yet
arxiv27 Command A: An Enterprise-Ready Large Language Model
arxiv26 Kimi-Audio Technical Report not yet
arxiv26 BrowseComp: A Simple Yet Challenging Benchmark for Browsing Agents not yet
arxiv25 DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning not yet
arxiv24 Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning not yet
arxiv24 A Survey of Frontiers in LLM Reasoning: Inference Scaling, Learning to Reason, and Agentic Systems not yet
arxiv24 Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model not yet
arxiv23 WebThinker: Empowering Large Reasoning Models with Deep Research Capability not yet
arxiv23 AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset not yet
arxiv23 Efficient Reasoning Models: A Survey not yet
arxiv23 Transfer between Modalities with MetaQueries not yet
arxiv21 DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning not yet
arxiv21 SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement not yet
arxiv21 Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining not yet
arxiv21 Rethinking Reflection in Pre-Training not yet
arxiv20 Learning to Reason under Off-Policy Guidance not yet
arxiv20 SRPO: A Cross-Domain Implementation of Large-Scale Reinforcement Learning on LLM not yet
arxiv20 SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL not yet
arxiv20 Perception-R1: Pioneering Perception Policy with Reinforcement Learning not yet
arxiv20 Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving not yet
arxiv20 LLM Social Simulations Are a Promising Research Method not yet
arxiv20 OpenCodeReasoning: Advancing Data Distillation for Competitive Coding not yet
arxiv20 Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents not yet
arxiv19 Acting Less is Reasoning More! Teaching Model to Act Efficiently
arxiv19 Right Question is Already Half the Answer: Fully Unsupervised LLM Reasoning Incentivization not yet
arxiv19 Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use not yet
arxiv18 ReasonIR: Training Retrievers for Reasoning Tasks not yet
arxiv18 Building A Secure Agentic AI Application Leveraging A2A Protocol
arxiv18 InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners not yet
arxiv18 NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation not yet
arxiv18 Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill? not yet
arxiv18 Reasoning Models Know When They're Right: Probing Hidden States for Self-Verification not yet
arxiv18 SmartBugBert: BERT-Enhanced Vulnerability Detection for Smart Contract Bytecode not yet
arxiv18 GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning not yet
arxiv17 Safety in Large Reasoning Models: A Survey not yet
arxiv17 PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models not yet
arxiv17 Enterprise-Grade Security for the Model Context Protocol (MCP): Frameworks and Mitigation Strategies not yet
arxiv17 SmolVLM: Redefining small and efficient multimodal models not yet
arxiv17 Enhancing Smart Contract Vulnerability Detection in DApps Leveraging Fine-Tuned LLM not yet
arxiv17 Less-to-More Generalization: Unlocking More Controllability by In-Context Generation not yet
arxiv17 Z1: Efficient Test-time Scaling with Code not yet
arxiv16 The Leaderboard Illusion
arxiv16 A Survey of AI Agent Protocols not yet
arxiv16 Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning not yet
arxiv16 Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement Learning not yet
arxiv16 d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning not yet
arxiv16 Seedream 3.0 Technical Report not yet
arxiv16 GRPO-LEAD: A Difficulty-Aware Reinforcement Learning Approach for Concise Mathematical Reasoning in Language Models not yet
arxiv16 TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning not yet
arxiv16 SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning not yet
arxiv16 On The Landscape of Spoken Language Models: A Comprehensive Survey not yet
arxiv16 SEAL: Steerable Reasoning Calibration of Large Language Models for Free not yet
arxiv16 ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use not yet
arxiv16 Efficient Reinforcement Finetuning via Adaptive Curriculum Learning not yet
arxiv16 Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models not yet
arxiv16 An Approach to Technical AGI Safety and Security not yet
arxiv16 MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs not yet
arxiv16 m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models not yet
arxiv15 One-Minute Video Generation with Test-Time Training
arxiv15 UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding not yet
arxiv15 GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning not yet
arxiv14 In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer not yet
arxiv14 SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning not yet
arxiv14 Optimized Path Planning for Logistics Robots Using Ant Colony Algorithm under Multiple Constraints not yet
arxiv14 An Illusion of Progress? Assessing the Current State of Web Agents not yet
arxiv14 WorldScore: A Unified Evaluation Benchmark for World Generation not yet
arxiv14 JudgeLRM: Large Reasoning Models as a Judge not yet
arxiv13 Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory not yet
arxiv13 Fast-Slow Thinking for Large Vision-Language Model Reasoning not yet
arxiv13 VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models not yet
arxiv13 Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning not yet
arxiv13 Packing Input Frame Context in Next-Frame Prediction Models for Video Generation not yet
arxiv13 The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections not yet
arxiv13 MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft not yet
arxiv13 Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning not yet
arxiv13 STAR-1: Safer Alignment of Reasoning LLMs with 1K Data not yet
arxiv12 Ada-R1: Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization not yet
arxiv12 From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review not yet
arxiv12 Malicious Code Detection in Smart Contracts via Opcode Vectorization not yet
arxiv12 WORLDMEM: Long-term Consistent World Simulation with Memory not yet
arxiv12 MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning not yet
arxiv12 RealSafe-R1: Safety-Aligned DeepSeek-R1 without Compromising Reasoning Capability not yet
arxiv12 SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models not yet
arxiv12 TxGemma: Efficient and Agentic LLMs for Therapeutics not yet
arxiv12 Understanding Aha Moments: from External Observations to Internal Mechanisms not yet
arxiv12 Why do LLMs attend to the first token? not yet
arxiv12 SkyReels-A2: Compose Anything in Video Diffusion Transformers not yet
arxiv11 SWE-smith: Scaling Data for Software Engineering Agents
arxiv11 TesserAct: Learning 4D Embodied World Models not yet
arxiv11 Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods not yet
arxiv11 A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future not yet
arxiv11 TextArena not yet
arxiv11 VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge not yet
arxiv11 SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users not yet
arxiv11 Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning not yet
arxiv11 SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills not yet
arxiv11 Leanabell-Prover: Posttraining Scaling in Formal Reasoning not yet
arxiv11 Think When You Need: Self-Adaptive Chain-of-Thought Learning not yet
arxiv11 Improved Visual-Spatial Reasoning via R1-Zero-Like Training not yet
arxiv10 ShorterBetter: Guiding Reasoning Models to Find Optimal Inference Length for Efficient Reasoning not yet
arxiv10 WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks not yet
arxiv10 HalluLens: LLM Hallucination Benchmark not yet
arxiv10 DreamO: A Unified Framework for Image Customization not yet
arxiv10 Describe Anything: Detailed Localized Image and Video Captioning
arxiv10 SConU: Selective Conformal Uncertainty in Large Language Models not yet
arxiv10 IMAGGarment-1: Fine-Grained Garment Generation for Controllable Fashion Design not yet
arxiv10 SkyReels-V2: Infinite-length Film Generative Model not yet
arxiv10 Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time not yet
arxiv10 REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers not yet
arxiv10 Psychological Health Knowledge-Enhanced LLM-based Social Network Crisis Intervention Text Transfer Recognition Method not yet
arxiv10 Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models not yet
arxiv10 Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning not yet
arxiv10 APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay not yet
arxiv10 Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme not yet
arxiv10 Cognitive Memory in Large Language Models not yet
arxiv10 Inference-Time Scaling for Complex Tasks: Where We Stand and What Lies Ahead
arxiv9 Multidimensional precipitation index prediction based on CNN-LSTM hybrid framework not yet
arxiv9 Securing GenAI Multi-Agent Systems Against Tool Squatting: A Zero Trust Registry-Based Approach not yet
arxiv9 BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese not yet
arxiv9 Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks not yet
arxiv9 Process Reward Models That Think not yet
arxiv9 Tina: Tiny Reasoning Models via LoRA not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

1
1
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
1
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?