0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2025年6月公開 Arxiv論文ランキング】2506.xxxxx

Posted at

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2025年6月頃に公開されたcsカテゴリの論文 (ID: 2506.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2025年9月18日更新)

被引用数   タイトル 動画
arxiv73 Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models not yet
arxiv59 Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
arxiv53 The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity
arxiv44 AlphaEvolve: A coding agent for scientific and algorithmic discovery
arxiv39 FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space not yet
arxiv30 Reasoning with Exploration: An Entropy Perspective on Reinforcement Learning for LLMs not yet
arxiv27 MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention not yet
arxiv27 Spurious Rewards: Rethinking Training Signals in RLVR not yet
arxiv27 Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task
arxiv27 OpenThoughts: Data Recipes for Reasoning Models not yet
arxiv25 Small Language Models are the Future of Agentic AI not yet
arxiv23 Architectural mechanisms of a universal fault-tolerant quantum computer not yet
arxiv23 OmniGen2: Exploration to Advanced Multimodal Generation not yet
arxiv23 V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
arxiv22 MiMo-VL Technical Report not yet
arxiv20 UMA: A Family of Universal Models for Atoms not yet
arxiv19 UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation not yet
arxiv18 Show-o2: Improved Native Unified Multimodal Models not yet
arxiv18 SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics not yet
arxiv17 Mercury: Ultra-Fast Language Models Based on Diffusion not yet
arxiv17 From Ground to Sky: Architectures, Applications, and Challenges Shaping Low-Altitude Wireless Networks not yet
arxiv17 dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching not yet
arxiv16 MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration not yet
arxiv16 Deep Research Agents: A Systematic Examination And Roadmap not yet
arxiv15 Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers not yet
arxiv15 OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling not yet
arxiv15 Magistral not yet
arxiv15 Constructive interference at the edge of quantum ergodic dynamics not yet
arxiv15 Seedance 1.0: Exploring the Boundaries of Video Generation Models
arxiv14 CRISP-SAM2: SAM2 with Cross-Modal Interaction and Semantic Prompting for Multi-Organ Segmentation not yet
arxiv13 MMSearch-R1: Incentivizing LMMs to Search not yet
arxiv13 DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation not yet
arxiv13 AceReason-Nemotron 1.1: Advancing Math and Code Reasoning through SFT and RL Synergy not yet
arxiv13 DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents not yet
arxiv13 The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning
arxiv12 Continuous operation of a coherent 3,000-qubit system not yet
arxiv12 Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs not yet
arxiv12 xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations not yet
arxiv12 Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions not yet
arxiv12 AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning not yet
arxiv11 Persona Features Control Emergent Misalignment not yet
arxiv11 SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning not yet
arxiv11 A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures not yet
arxiv11 ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation not yet
arxiv11 Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective not yet
arxiv11 Leveraging erasure errors in logical qubits with metastable $^{171}$Yb atoms not yet
arxiv11 A Comprehensive Survey of Deep Research: Systems, Methodologies, and Applications not yet
arxiv11 Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency not yet
arxiv11 Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning not yet
arxiv11 Follow-Your-Creation: Empowering 4D Creation through Video Inpainting not yet
arxiv11 GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents not yet
arxiv10 AgentOrchestra: A Hierarchical Multi-Agent Framework for General-Purpose Task Solving not yet
arxiv10 Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce not yet
arxiv10 Deep Research Bench: Evaluating AI Web Research Agents not yet
arxiv10 Seed-Coder: Let the Code Model Curate Data for Itself not yet
arxiv10 SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning not yet
arxiv10 MCP-Zero: Active Tool Discovery for Autonomous LLM Agents not yet
arxiv9 MEM1: Learning to Synergize Memory and Reasoning for Efficient Long-Horizon Agents not yet
arxiv9 ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies not yet
arxiv9 VGR: Visual Grounded Reasoning not yet
arxiv9 G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems not yet
arxiv9 RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics not yet
arxiv9 TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems not yet
arxiv9 Accelerating Diffusion LLMs via Adaptive Parallel Decoding not yet
arxiv8 Sequential Diagnosis with Language Models not yet
arxiv8 WorldVLA: Towards Autoregressive Action World Model
arxiv8 RoboTwin 2.0: A Scalable Data Generator and Benchmark with Strong Domain Randomization for Robust Bimanual Robotic Manipulation not yet
arxiv8 Towards AI Search Paradigm not yet
arxiv8 Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details not yet
arxiv8 Optimizing Length Compression in Large Reasoning Models not yet
arxiv8 High-fidelity entanglement and coherent multi-qubit mapping in an atom array not yet
arxiv8 Thought Crime: Backdoors and Emergent Misalignment in Reasoning Models not yet
arxiv8 Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs not yet
arxiv8 Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
arxiv8 Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning not yet
arxiv8 Curriculum Reinforcement Learning from Easy to Hard Tasks Improves LLM Reasoning not yet
arxiv8 Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library not yet
arxiv8 When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation not yet
arxiv8 SeedEdit 3.0: Fast and High-Quality Generative Image Editing not yet
arxiv8 MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark not yet
arxiv8 The Cost of Dynamic Reasoning: Demystifying AI Agents and Test-Time Scaling from an AI Infrastructure Perspective not yet
arxiv8 Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning not yet
arxiv8 Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning not yet
arxiv8 TIIF-Bench: How Does Your T2I Model Follow Your Instructions? not yet
arxiv8 A Graph Neural Network for the Era of Large Atomistic Models not yet
arxiv8 MMedAgent-RL: Optimizing Multi-Agent Collaboration for Multimodal Medical Reasoning not yet
arxiv7 SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning not yet
arxiv7 Pinching-Antenna Systems with In-Waveguide Attenuation: Performance Analysis and Algorithm Design not yet
arxiv7 Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge not yet
arxiv7 RLPR: Extrapolating RLVR to General Domains without Verifiers not yet
arxiv7 VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning not yet
arxiv7 OAgents: An Empirical Study of Building Effective Agents not yet
arxiv7 OneRec Technical Report not yet
arxiv7 We Should Identify and Mitigate Third-Party Safety Risks in MCP-Powered Agent Systems not yet
arxiv7 Model Context Protocol (MCP) at First Glance: Studying the Security and Maintainability of MCP Servers not yet
arxiv7 Continual Learning for Generative AI: From LLMs to MLLMs and Beyond not yet
arxiv7 SoundMind: RL-Incentivized Logic Reasoning for Audio-Language Models not yet
arxiv7 LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming? not yet
arxiv7 Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing not yet
arxiv7 Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity not yet
arxiv7 Diffuse and Disperse: Image Generation with Representation Regularization not yet
arxiv7 Design Patterns for Securing LLM Agents against Prompt Injections not yet
arxiv7 Reinforcement Pre-Training not yet
arxiv7 $\tau^2$-Bench: Evaluating Conversational Agents in a Dual-Control Environment not yet
arxiv7 WeThink: Toward General-purpose Vision-Language Reasoning via Reinforcement Learning not yet
arxiv7 Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models not yet
arxiv7 Research on E-Commerce Long-Tail Product Recommendation Mechanism Based on Large-Scale Language Models not yet
arxiv7 Research on Personalized Financial Product Recommendation by Integrating Large Language Models and Graph Neural Networks not yet
arxiv7 Log-Linear Attention not yet
arxiv7 OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models not yet
arxiv7 V2X-UniPool: Unifying Multimodal Perception and Knowledge Reasoning for Autonomous Driving not yet
arxiv7 EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language Navigation not yet
arxiv7 ACE-Step: A Step Towards Music Generation Foundation Model not yet
arxiv6 Hierarchical Reasoning Model
arxiv6 Potemkin Understanding in Large Language Models
arxiv6 FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language not yet
arxiv6 Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards not yet
arxiv6 Unified Vision-Language-Action Model not yet
arxiv6 ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs not yet
arxiv6 OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation not yet
arxiv6 No Free Lunch: Rethinking Internal Feedback for LLM Reasoning not yet
arxiv6 TabArena: A Living Benchmark for Machine Learning on Tabular Data not yet
arxiv6 Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material not yet
arxiv6 GMT: General Motion Tracking for Humanoid Whole-Body Control not yet
arxiv6 ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM not yet
arxiv6 LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs not yet
arxiv6 Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning not yet
arxiv6 Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks not yet
arxiv6 Serving Large Language Models on Huawei CloudMatrix384 not yet
arxiv6 $\xi$-Based adaptive phase field model for quasi-static anti-plane fracture not yet
arxiv6 Model Organisms for Emergent Misalignment not yet
arxiv6 Self-Adapting Language Models
arxiv6 Fast on the Easy, Deep on the Hard: Efficient Reasoning via Powered Length Penalty not yet
arxiv6 TaskCraft: Automated Generation of Agentic Tasks not yet
arxiv6 Repeated ancilla reuse for logical computation on a neutral atom quantum computer not yet
arxiv6 e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs not yet
arxiv6 BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models not yet
arxiv6 MiniCPM4: Ultra-Efficient LLMs on End Devices not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

0
0
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?