0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2025年5月公開 Arxiv論文ランキング】2505.xxxxx

Last updated at Posted at 2025-08-01

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2025年5月頃に公開されたcsカテゴリの論文 (ID: 2505.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2025年8月2日更新)

被引用数   タイトル 動画
arxiv295 Qwen3 Technical Report not yet
arxiv44 Seed1.5-VL Technical Report not yet
arxiv39 Llama-Nemotron: Efficient Reasoning Models not yet
arxiv32 Absolute Zero: Reinforced Self-play Reasoning with Zero Data
arxiv29 Reasoning Models Don't Always Say What They Think
arxiv29 RM-R1: Reward Modeling as Reasoning not yet
arxiv26 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT not yet
arxiv24 Emerging Properties in Unified Multimodal Pretraining not yet
arxiv22 LLMs Get Lost In Multi-Turn Conversation not yet
arxiv19 BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset not yet
arxiv18 Learning to Reason without External Rewards not yet
arxiv18 Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models not yet
arxiv17 AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenges not yet
arxiv17 ZeroSearch: Incentivize the Search Capability of LLMs without Searching
arxiv16 OpenThinkIMG: Learning to Think with Images via Visual Tool Reinforcement Learning not yet
arxiv15 Skywork Open Reasoner 1 Technical Report not yet
arxiv15 Between Underthinking and Overthinking: An Empirical Study of Reasoning Length and correctness in LLMs not yet
arxiv14 SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training not yet
arxiv14 HealthBench: Evaluating Large Language Models Towards Improved Human Health not yet
arxiv14 A survey of agent interoperability protocols: Model Context Protocol (MCP), Agent Communication Protocol (ACP), Agent-to-Agent Protocol (A2A), and Agent Network Protocol (ANP) not yet
arxiv14 100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models not yet
arxiv13 ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
arxiv13 Future Circular Collider Feasibility Study Report: Volume 1, Physics, Experiments, Detectors not yet
arxiv12 MMaDA: Multimodal Large Diffusion Language Models
arxiv12 CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process not yet
arxiv12 The Open Molecules 2025 (OMol25) Dataset, Evaluations, and Models not yet
arxiv11 The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models not yet
arxiv11 The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning not yet
arxiv11 AdaptThink: Reasoning Models Can Learn When to Think not yet
arxiv11 ARC-AGI-2: A New Challenge for Frontier AI Reasoning Systems
arxiv11 Scalable Chain of Thoughts via Elastic Reasoning not yet
arxiv11 HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation not yet
arxiv11 Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models not yet
arxiv11 Practical Efficiency of Muon for Pretraining not yet
arxiv11 Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions not yet
arxiv10 Avocado Price Prediction Using a Hybrid Deep Learning Model: TCN-MLP-Attention Architecture not yet
arxiv10 MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining not yet
arxiv10 X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains not yet
arxiv10 Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning not yet
arxiv10 Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions not yet
arxiv10 Nemotron-Research-Tool-N1: Exploring Tool-Using Language Models with Reinforced Reasoning not yet
arxiv9 WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning not yet
arxiv9 Think Only When You Need with Large Hybrid-Reasoning Models not yet
arxiv9 Aya Vision: Advancing the Frontier of Multilingual Multimodality not yet
arxiv9 Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation not yet
arxiv9 LlamaFirewall: An open source guardrail system for building secure AI agents not yet
arxiv9 HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models not yet
arxiv9 Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning not yet
arxiv8 Evaluating Supervised Learning Models for Fraud Detection: A Comparative Study of Classical and Deep Architectures on Imbalanced Transaction Data not yet
arxiv8 SETransformer: A Hybrid Attention-Based Architecture for Robust Human Activity Recognition not yet
arxiv8 General-Reasoner: Advancing LLM Reasoning Across All Domains not yet
arxiv8 Thinkless: LLM Learns When to Think not yet
arxiv8 CTLformer: A Hybrid Denoising Model Combining Convolutional Layers and Self-Attention for Enhanced CT Image Reconstruction not yet
arxiv8 Group-in-Group Policy Optimization for LLM Agent Training not yet
arxiv8 DanceGRPO: Unleashing GRPO on Visual Generation not yet
arxiv8 Crosslingual Reasoning through Test-Time Scaling not yet
arxiv8 GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data not yet
arxiv8 TWIST: Teleoperated Whole-Body Imitation System not yet
arxiv7 One-shot Entropy Minimization not yet
arxiv7 LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models not yet
arxiv7 AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting not yet
arxiv7 PhyX: Does Your Model Have the "Wits" for Physical Reasoning? not yet
arxiv7 Reasoning Models Better Express Their Confidence not yet
arxiv7 DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning not yet
arxiv7 Robin: A multi-agent system for automating scientific discovery not yet
arxiv7 AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning not yet
arxiv7 Cloud-Based AI Systems: Leveraging Large Language Models for Intelligent Fault Detection and Autonomous Self-Healing not yet
arxiv7 Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization not yet
arxiv7 J1: Incentivizing Thinking in LLM-as-a-Judge via Reinforcement Learning not yet
arxiv7 Generative AI for Autonomous Driving: Frontiers and Opportunities not yet
arxiv7 User Behavior Analysis in Privacy Protection with Large Language Models: A Study on Privacy Preferences with Limited Data not yet
arxiv7 UniVLA: Learning to Act Anywhere with Task-centric Latent Actions not yet
arxiv7 Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging not yet
arxiv7 Personalized Risks and Regulatory Strategies of Large Language Models in Digital Advertising not yet
arxiv7 FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models not yet
arxiv7 Sailing by the Stars: A Survey on Reward Models and Learning Strategies for Learning from Rewards not yet
arxiv7 R-Bench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation not yet
arxiv7 On the generalization of language models from in-context learning and finetuning: a controlled study
arxiv7 PDCS: A Primal-Dual Large-Scale Conic Programming Solver with GPU Enhancements not yet
arxiv7 Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems not yet
arxiv6 MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration not yet
arxiv6 Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents
arxiv6 Can Large Reasoning Models Self-Train? not yet
arxiv6 SageAttention2++: A More Efficient Implementation of SageAttention2 not yet
arxiv6 Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution not yet
arxiv6 AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning not yet
arxiv6 MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO not yet
arxiv6 ChartEdit: How Far Are MLLMs From Automating Chart Analysis? Evaluating MLLMs' Capability via Chart Editing not yet
arxiv6 Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL not yet
arxiv6 Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and Enhancement not yet
arxiv6 S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models not yet
arxiv6 Flow-GRPO: Training Flow Matching Models via Online RL not yet
arxiv6 TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation not yet
arxiv6 Vision-Language-Action Models: Concepts, Progress, Applications and Challenges not yet
arxiv6 OpenHelix: A Short Survey, Empirical Analysis, and Open-Source Dual-System VLA Model for Robotic Manipulation not yet
arxiv6 R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning not yet
arxiv6 Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents not yet
arxiv6 LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey not yet
arxiv6 Sum Rate Maximization for NOMA-Assisted Uplink Pinching-Antenna Systems not yet
arxiv6 T2VPhysBench: A First-Principles Benchmark for Physical Consistency in Text-to-Video Generation not yet
arxiv6 Base Models Beat Aligned Models at Randomness and Creativity not yet
arxiv5 AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning
arxiv5 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation not yet
arxiv5 Large Language Models Often Know When They Are Being Evaluated not yet
arxiv5 Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better not yet
arxiv5 CoThink: Token-Efficient Reasoning via Instruct Models Guiding Reasoning Models not yet
arxiv5 Dissipative Preparation of Many-Body Quantum States: Towards Practical Quantum Advantage not yet
arxiv5 ImgEdit: A Unified Image Editing Dataset and Benchmark not yet
arxiv5 Research on feature fusion and multimodal patent text based on graph attention network not yet
arxiv5 HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters not yet
arxiv5 Convergence Analysis of Adaptive Finite Element Algorithms for a Regularized Variational Model of Quasi-Static Brittle Fracture in "Strain-Limiting" Elastic Solids not yet
arxiv5 SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond not yet
arxiv5 Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation not yet
arxiv5 Brownian Bridge Augmented Surrogate Simulation and Injection Planning for Geological CO$_2$ Storage not yet
arxiv5 From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning not yet
arxiv5 LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning not yet
arxiv5 Pixel Reasoner: Incentivizing Pixel-Space Reasoning with Curiosity-Driven Reinforcement Learning not yet
arxiv5 Towards Holistic Evaluation of Large Audio-Language Models: A Comprehensive Survey not yet
arxiv5 Thought-Augmented Policy Optimization: Bridging External Guidance and Internal Capabilities not yet
arxiv5 Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought not yet
arxiv5 Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning not yet
arxiv5 UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning not yet
arxiv5 Beyond Semantics: The Unreasonable Effectiveness of Reasonless Intermediate Tokens not yet
arxiv5 Mean Flows for One-step Generative Modeling not yet
arxiv5 Optimizing Anytime Reasoning via Budget Relative Policy Optimization not yet
arxiv5 Cross-Cloud Data Privacy Protection: Optimizing Collaborative Mechanisms of AI Systems by Integrating Federated Learning and LLMs not yet
arxiv5 Harnessing the Universal Geometry of Embeddings not yet
arxiv5 HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation not yet
arxiv5 SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning not yet
arxiv5 GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning not yet
arxiv5 Agent Name Service (ANS): A Universal Directory for Secure AI Agent Discovery and Interoperability not yet
arxiv5 Large Language Models Are More Persuasive Than Incentivized Human Persuaders not yet
arxiv5 Energy-Efficient Resource Allocation for NOMA-Assisted Uplink Pinching-Antenna Systems not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

0
0
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?