0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2025年1月公開 Arxiv論文ランキング】2501.xxxxx

Posted at

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2025年1月頃に公開されたcsカテゴリの論文 (ID: 2501.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2025年4月4日更新)

被引用数   タイトル 動画
arxiv821 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning not yet
arxiv85 s1: Simple test-time scaling not yet
arxiv78 Kimi k1.5: Scaling Reinforcement Learning with LLMs not yet
arxiv67 rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
arxiv44 Cosmos World Foundation Model Platform for Physical AI not yet
arxiv39 2 OLMo 2 Furious not yet
arxiv36 The Lessons of Developing Process Reward Models in Mathematical Reasoning
arxiv34 Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models not yet
arxiv32 Humanity's Last Exam
arxiv31 Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling not yet
arxiv27 LTX-Video: Realtime Video Latent Diffusion not yet
arxiv26 MiniMax-01: Scaling Foundation Models with Lightning Attention not yet
arxiv25 Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought not yet
arxiv24 SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
arxiv22 VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction not yet
arxiv22 Titans: Learning to Memorize at Test Time
arxiv19 Search-o1: Agentic Search-Enhanced Large Reasoning Models
arxiv17 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps
arxiv17 LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs not yet
arxiv17 REINFORCE++: An Efficient RLHF Algorithm with Robustness to Both Prompt and Reward Models not yet
arxiv16 On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis not yet
arxiv16 FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving not yet
arxiv14 Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs
arxiv14 A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models not yet
arxiv14 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding not yet
arxiv14 Evolving Deeper LLM Thinking
arxiv14 Open Problems in Machine Unlearning for AI Safety
arxiv14 Agent Laboratory: Using LLM Agents as Research Assistants
arxiv13 On the Computational Capability of Graph Neural Networks: A Circuit Complexity Bound Perspective not yet
arxiv12 Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling not yet
arxiv12 Neural Algorithmic Reasoning for Hypergraphs with Looped Transformers not yet
arxiv12 FAST: Efficient Action Tokenization for Vision-Language-Action Models not yet
arxiv12 Do generative video models understand physical principles?
arxiv12 LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token not yet
arxiv12 Virgo: A Preliminary Exploration on Reproducing o1-like MLLM not yet
arxiv12 Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models not yet
arxiv11 O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning not yet
arxiv11 PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models not yet
arxiv11 Training Medical Large Vision-Language Models with Abnormal-Aware Feedback not yet
arxiv10 Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step not yet
arxiv10 Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation not yet
arxiv10 Imagine while Reasoning in Space: Multimodal Visualization-of-Thought
arxiv10 A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges not yet
arxiv10 Humanoid Locomotion and Manipulation: Current Progress and Challenges in Control, Planning, and Learning not yet
arxiv10 VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling not yet
arxiv10 Retrieval-Augmented Generation with Graphs (GraphRAG) not yet
arxiv9 International AI Safety Report not yet
arxiv9 Open Problems in Mechanistic Interpretability not yet
arxiv9 Qwen2.5-1M Technical Report not yet
arxiv9 UI-TARS: Pioneering Automated GUI Interaction with Native Agents
arxiv9 RedStar: Does Scaling Long-CoT Data Unlock Better Slow-Reasoning Systems? not yet
arxiv9 Detection of AI Deepfake and Fraud in Online Payments Using GAN-Based Models not yet
arxiv9 Tensor Product Attention Is All You Need not yet
arxiv9 Multi-Agent Collaboration Mechanisms: A Survey of LLMs not yet
arxiv9 Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains not yet
arxiv9 Circuit Complexity Bounds for Visual Autoregressive Model not yet
arxiv8 o3-mini vs DeepSeek-R1: Which One is Safer? not yet
arxiv8 Learning to Plan & Reason for Evaluation with Thinking-LLM-as-a-Judge not yet
arxiv8 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling not yet
arxiv8 InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model not yet
arxiv8 Pinching Antennas: Principles, Applications and Challenges not yet
arxiv8 A General Framework for Inference-time Scaling and Steering of Diffusion Models not yet
arxiv8 MinMo: A Multimodal Large Language Model for Seamless Voice Interaction not yet
arxiv8 URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics not yet
arxiv8 Rotatable Antenna Enabled Wireless Communication: Modeling and Optimization not yet
arxiv8 Test-Time Compute: from System-1 Thinking to System-2 Thinking not yet
arxiv7 SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer not yet
arxiv7 Molecular-driven Foundation Model for Oncologic Pathology not yet
arxiv7 Baichuan-Omni-1.5 Technical Report not yet
arxiv7 Enhancing Intent Understanding for Ambiguous prompt: A Human-Machine Co-Adaption Strategy not yet
arxiv7 Reasoning Language Models: A Blueprint not yet
arxiv7 VideoWorld: Exploring Knowledge Learning from Unlabeled Videos not yet
arxiv7 O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning not yet
arxiv7 Enhancing Low-Cost Video Editing with Lightweight Adaptors and Temporal-Aware Inversion not yet
arxiv7 OpenOmni: Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis not yet
arxiv7 CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings not yet
arxiv6 SETS: Leveraging Self-Verification and Self-Correction for Improved Test-Time Scaling not yet
arxiv6 Multimodal Large Language Models for Image, Text, and Speech Data Augmentation: A Survey not yet
arxiv6 Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation
arxiv6 Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate not yet
arxiv6 SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model not yet
arxiv6 Improving Video Generation with Human Feedback not yet
arxiv6 Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks not yet
arxiv6 Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training not yet
arxiv6 RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation not yet
arxiv6 Inference-Time Alignment in Diffusion Models with Reward-Guided Generation: Tutorial and Review not yet
arxiv6 Diffusion Adversarial Post-Training for One-Step Video Generation not yet
arxiv6 Motion Tracks: A Unified Representation for Human-Robot Transfer in Few-Shot Imitation Learning not yet
arxiv6 InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection not yet
arxiv6 ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling not yet
arxiv5 Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming not yet
arxiv5 Probing topological matter and fermion dynamics on a neutral-atom quantum computer not yet
arxiv5 AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders not yet
arxiv5 How Linguistics Learned to Stop Worrying and Love the Language Models not yet
arxiv5 Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies not yet
arxiv5 Eagle 2: Building Post-Training Data Strategies from Scratch for Frontier Vision-Language Models not yet
arxiv5 Fanar: An Arabic-Centric Multimodal Generative AI Platform not yet
arxiv5 Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos not yet
arxiv5 Multi-Level Attention and Contrastive Learning for Enhanced Text Classification with an Optimized Transformer not yet
arxiv5 GAMED-Snake: Gradient-aware Adaptive Momentum Evolution Deep Snake Model for Multi-organ Segmentation not yet
arxiv5 Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
arxiv5 A Survey on Multi-Turn Interaction Capabilities of Large Language Models not yet
arxiv5 Quantum-Centric Algorithm for Sample-Based Krylov Diagonalization not yet
arxiv5 Vision-Language Models Do Not Understand Negation not yet
arxiv5 Agentic Retrieval-Augmented Generation: A Survey on Agentic RAG not yet
arxiv5 Enhancing Automated Interpretability with Output-Centric Feature Descriptions not yet
arxiv5 WebWalker: Benchmarking LLMs in Web Traversal not yet
arxiv5 Understanding and Benchmarking Artificial Intelligence: OpenAI's o3 Is Not AGI not yet
arxiv5 Multi-subject Open-set Personalization in Video Generation not yet
arxiv5 Enabling Scalable Oversight via Self-Evolving Critic not yet
arxiv5 Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchmark not yet
arxiv5 ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning not yet
arxiv5 LLM4SR: A Survey on Large Language Models for Scientific Research not yet
arxiv5 Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives not yet
arxiv5 Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control not yet
arxiv5 The FACTS Grounding Leaderboard: Benchmarking LLMs' Ability to Ground Responses to Long-Form Input not yet
arxiv5 EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation not yet
arxiv5 SaLoRA: Safety-Alignment Preserved Low-Rank Adaptation not yet
arxiv5 Object-level Visual Prompts for Compositional Image Generation not yet
arxiv5 Nested Attention: Semantic-aware Attention Values for Concept Personalization not yet
arxiv5 LEO-Split: A Semi-Supervised Split Learning Framework over LEO Satellite Networks not yet
arxiv5 CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries not yet
arxiv5 OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning not yet
arxiv5 Dual Diffusion for Unified Image Generation and Understanding not yet
arxiv4 Reward-Guided Speculative Decoding for Efficient LLM Reasoning not yet
arxiv4 Efficient Reasoning with Hidden Thinking not yet
arxiv4 Diffusion Autoencoders are Scalable Image Tokenizers not yet
arxiv4 GuardReasoner: Towards Reasoning-based LLM Safeguards not yet
arxiv4 MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding not yet
arxiv4 Sparse Autoencoders Can Interpret Randomly Initialized Transformers not yet
arxiv4 Large Language Models for Code Generation: The Practitioners Perspective not yet
arxiv4 Parameter-Efficient Fine-Tuning for Foundation Models not yet
arxiv4 UGMathBench: A Diverse and Dynamic Benchmark for Undergraduate-Level Mathematical Reasoning with Large Language Models not yet
arxiv4 Low-dimensional adaptation of diffusion models: Convergence in total variation not yet
arxiv4 Continuous 3D Perception Model with Persistent State not yet
arxiv4 MMVU: Measuring Expert-Level Multi-Discipline Video Understanding not yet
arxiv4 Poison-RAG: Adversarial Data Poisoning Attacks on Retrieval-Augmented Generation in Recommender Systems not yet
arxiv4 Tell me about yourself: LLMs are aware of their learned behaviors not yet
arxiv4 Generative Physical AI in Vision: A Survey not yet
arxiv4 Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments not yet
arxiv4 Infrastructure for AI Agents not yet
arxiv4 A Simple Aerial Detection Baseline of Multimodal Language Models not yet
arxiv4 Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians not yet
arxiv4 What Limits LLM-based Human Simulation: LLMs or Our Design? not yet
arxiv4 Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models not yet
arxiv4 GameFactory: Creating New Games with Generative Interactive Videos not yet
arxiv4 CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation not yet
arxiv4 Exploring and Mitigating Adversarial Manipulation of Voting-Based Leaderboards not yet
arxiv4 MiniRAG: Towards Extremely Simple Retrieval-Augmented Generation not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

0
0
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?