1
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2024年10月公開 Arxiv論文ランキング】2410.xxxxx

Last updated at Posted at 2024-12-11

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2024年10月頃に公開されたcsカテゴリの論文 (ID: 2410.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2025年1月16日更新)

被引用数   タイトル 動画
arxiv117 GPT-4o System Card not yet
arxiv82 Movie Gen: A Cast of Media Foundation Models
arxiv52 GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
arxiv25 Pixtral 12B not yet
arxiv25 Video Instruction Tuning With Synthetic Data not yet
arxiv23 Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
arxiv22 MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion not yet
arxiv20 Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation not yet
arxiv20 Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think not yet
arxiv19 Moshi: a speech-text foundation model for real-time dialogue not yet
arxiv17 Loong: Generating Minute-level Long Videos with Autoregressive Language Models not yet
arxiv16 Aria: An Open Multimodal Native Mixture-of-Experts Model not yet
arxiv16 Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge not yet
arxiv15 Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
arxiv15 HART: Efficient Visual Generation with Hybrid Autoregressive Transformer not yet
arxiv15 HSR-Enhanced Sparse Attention Acceleration not yet
arxiv15 Looped ReLU MLPs May Be All You Need as Practical Programmable Computers not yet
arxiv15 SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference not yet
arxiv15 LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning not yet
arxiv14 Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models not yet
arxiv14 O1 Replication Journey: A Strategic Progress Report -- Part 1 not yet
arxiv14 GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation not yet
arxiv14 Pyramidal Flow Matching for Efficient Video Generative Modeling
arxiv14 How to Train Long-Context Language Models (Effectively) not yet
arxiv13 No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images not yet
arxiv13 DepthSplat: Connecting Gaussian Splatting and Depth not yet
arxiv13 Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix not yet
arxiv13 Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities not yet
arxiv13 Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models not yet
arxiv13 Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents not yet
arxiv12 Self-Supervised Graph Neural Networks for Enhanced Feature Extraction in Heterogeneous Information Networks not yet
arxiv12 Efficient and Aesthetic UI Design with a Deep Learning-Based Interface Generation Tree Algorithm not yet
arxiv12 Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent not yet
arxiv12 Differential Transformer
arxiv12 A Survey on Diffusion Models for Inverse Problems not yet
arxiv11 LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
arxiv11 Allegro: Open the Black Box of Commercial-Level Video Generation Model
arxiv11 A Recommendation Model Utilizing Separation Embedding and Self-Attention for Feature Mining not yet
arxiv11 OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models not yet
arxiv11 Impurities and polarons in bosonic quantum gases: a review on recent progress not yet
arxiv11 Fine-grained Attention I/O Complexity: Comprehensive Analysis for Backward Passes not yet
arxiv11 RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation not yet
arxiv11 IC3M: In-Car Multimodal Multi-object Monitoring for Abnormal Status of Both Driver and Passengers not yet
arxiv11 ImageFolder: Autoregressive Image Generation with Folded Tokens not yet
arxiv10 $\pi_0$: A Vision-Language-Action Flow Model for General Robot Control not yet
arxiv10 YOLOv11: An Overview of the Key Architectural Enhancements not yet
arxiv10 Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation not yet
arxiv10 MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models not yet
arxiv10 Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm not yet
arxiv10 Agent-as-a-Judge: Evaluate Agents with Agents
arxiv10 How to Construct Random Unitaries not yet
arxiv10 Balancing Innovation and Privacy: Data Security Strategies in Natural Language Processing Applications not yet
arxiv10 Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis not yet
arxiv10 Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training not yet
arxiv10 Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning not yet
arxiv10 Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation not yet
arxiv10 F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
arxiv10 LLaVA-Critic: Learning to Evaluate Multimodal Models not yet
arxiv10 HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly not yet
arxiv10 OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data not yet
arxiv10 HelpSteer2-Preference: Complementing Ratings with Preferences not yet
arxiv9 Data Scaling Laws in Imitation Learning for Robotic Manipulation not yet
arxiv9 Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs not yet
arxiv9 PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction not yet
arxiv9 Optimizing Retrieval-Augmented Generation with Elasticsearch for Enhanced Question-Answering Systems not yet
arxiv9 SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers not yet
arxiv9 DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation not yet
arxiv9 MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
arxiv9 CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL not yet
arxiv8 Orb: A Fast, Scalable Neural Network Potential not yet
arxiv8 CaloChallenge 2022: A Community Challenge for Fast Calorimeter Simulation not yet
arxiv8 Deep Learning for Medical Text Processing: BERT Model Fine-Tuning and Comparative Study not yet
arxiv8 Predicting Liquidity Coverage Ratio with Gated Recurrent Units: A Deep Learning Model for Risk Management not yet
arxiv8 Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms not yet
arxiv8 The XLZD Design Book: Towards the Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics not yet
arxiv8 Jailbreaking and Mitigation of Vulnerabilities in Large Language Models not yet
arxiv8 Blockchain-Based Trust and Transparency in Airline Reservation Systems using Microservices Architecture not yet
arxiv8 Automated Genre-Aware Article Scoring and Feedback Using Large Language Models not yet
arxiv8 Jailbreaking LLM-Controlled Robots not yet
arxiv8 DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation not yet
arxiv8 Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies not yet
arxiv8 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations not yet
arxiv8 Generative AI and Its Impact on Personalized Intelligent Tutoring Systems not yet
arxiv8 SpeGCL: Self-supervised Graph Spectrum Contrastive Learning without Positive Samples not yet
arxiv8 Baichuan-Omni Technical Report not yet
arxiv8 TextHawk2: A Large Vision-Language Model Excels in Bilingual OCR and Grounding with 16x Fewer Tokens not yet
arxiv8 Applying Hybrid Graph Neural Networks to Strengthen Credit Risk Analysis not yet
arxiv8 Were RNNs All We Needed?
arxiv7 MarDini: Masked Autoregressive Diffusion for Video Generation at Scale not yet
arxiv7 Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data not yet
arxiv7 Optimizing Travel Itineraries with AI Algorithms in a Microservices Architecture: Balancing Cost, Time, Preferences, and Sustainability not yet
arxiv7 One-Step Diffusion Distillation through Score Implicit Matching not yet
arxiv7 Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance not yet
arxiv7 Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages not yet
arxiv7 Beamforming Optimization for Continuous Aperture Array (CAPA)-based Communications not yet
arxiv7 Open Materials 2024 (OMat24) Inorganic Materials Dataset and Models not yet
arxiv7 CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos not yet
arxiv7 Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models
arxiv7 Liger Kernel: Efficient Triton Kernels for LLM Training not yet
arxiv7 TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models not yet
arxiv7 VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents not yet
arxiv7 The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment with Large Language Models not yet
arxiv7 AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents not yet
arxiv7 Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow not yet
arxiv7 Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models
arxiv7 Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation not yet
arxiv7 ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection not yet
arxiv7 AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark not yet
arxiv7 RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
arxiv7 Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding not yet
arxiv7 Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown not yet
arxiv6 Deep Learning with HM-VGG: AI Strategies for Multi-modal Image Analysis not yet
arxiv6 In-Context LoRA for Diffusion Transformers not yet
arxiv6 EMMA: End-to-End Multimodal Model for Autonomous Driving not yet
arxiv6 Human-Centric eXplainable AI in Education not yet
arxiv6 MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark not yet
arxiv6 Graph Contrastive Learning via Cluster-refined Negative Sampling for Semi-supervised Text Classification not yet
arxiv6 Improve Vision Language Model Chain-of-thought Reasoning not yet
arxiv6 MCSFF: Multi-modal Consistency and Specificity Fusion Framework for Entity Alignment not yet
arxiv6 A Comparative Study on Reasoning Patterns of OpenAI's o1 Model not yet
arxiv6 ALOHA Unleashed: A Simple Recipe for Robot Dexterity not yet
arxiv6 MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generation not yet
arxiv6 Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws not yet
arxiv6 Latent Action Pretraining from Videos
arxiv6 When Attention Sink Emerges in Language Models: An Empirical View not yet
arxiv6 Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models not yet
arxiv6 Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models not yet
arxiv6 Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling not yet
arxiv6 MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models not yet
arxiv6 Progressive Autoregressive Video Diffusion Models not yet
arxiv6 Efficient Quantum Pseudorandomness from Hamiltonian Phase States not yet
arxiv6 Towards Interpreting Visual Information Processing in Vision-Language Models not yet
arxiv6 ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery not yet
arxiv6 Dynamic Diffusion Transformer not yet
arxiv6 Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise not yet
arxiv6 Iterated Radical Expansions and Convergence not yet
arxiv6 ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI not yet
arxiv5 Exposing Cross-Platform Coordinated Inauthentic Activity in the Run-Up to the 2024 U.S. Election not yet
arxiv5 A Systematic Assessment of OpenAI o1-Preview for Higher Order Thinking in Education not yet
arxiv5 Enhancing Resilience and Scalability in Travel Booking Systems: A Microservices Approach to Fault Tolerance, Load Balancing, and Service Discovery not yet
arxiv5 MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision not yet
arxiv5 FreeVS: Generative View Synthesis on Free Driving Trajectory not yet
arxiv5 OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation not yet
arxiv5 LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias not yet
arxiv5 Performance of the CMS high-level trigger during LHC Run 2 not yet
arxiv5 xGen-MM-Vid (BLIP-3-Video): You Only Need 32 Tokens to Represent a Video Even in VLMs not yet
arxiv5 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors not yet
arxiv5 A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications not yet
arxiv5 A Survey of Conversational Search not yet
arxiv5 Group Diffusion Transformers are Unsupervised Multitask Learners not yet
arxiv5 Iterative Methods via Locally Evolving Set Process not yet
arxiv5 NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples not yet
arxiv5 DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control not yet
arxiv5 Generative Reward Models not yet
arxiv5 JudgeBench: A Benchmark for Evaluating LLM-based Judges not yet
arxiv5 Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats not yet
arxiv5 Preference Optimization with Multi-Sample Comparisons not yet
arxiv5 MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark not yet
arxiv5 Boosting Camera Motion Control for Video Diffusion Transformers not yet
arxiv5 FunnelRAG: A Coarse-to-Fine Progressive Retrieval Paradigm for RAG not yet
arxiv5 The Ingredients for Robotic Diffusion Transformers not yet
arxiv5 Improved List Size for Folded Reed-Solomon Codes not yet
arxiv5 ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification not yet
arxiv5 ARCap: Collecting High-quality Human Demonstrations for Robot Learning with Augmented Reality Feedback not yet
arxiv5 Automated Creation of Digital Cousins for Robust Policy Learning not yet
arxiv5 Dynamic metastability in the self-attention model not yet
arxiv5 Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG not yet
arxiv5 Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion not yet
arxiv5 T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design not yet
arxiv5 Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification not yet
arxiv5 VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks not yet
arxiv5 Strong Model Collapse
arxiv5 Learning How Hard to Think: Input-Adaptive Allocation of LM Computation not yet
arxiv5 CAR: Controllable Autoregressive Modeling for Visual Generation not yet
arxiv5 Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution not yet
arxiv5 Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models not yet
arxiv5 Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models not yet
arxiv5 Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents not yet
arxiv5 MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences not yet
arxiv5 AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models not yet
arxiv5 ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning not yet
arxiv5 Deep Learning Alternatives of the Kolmogorov Superposition Theorem not yet
arxiv5 On the expressiveness and spectral bias of KANs not yet
arxiv5 softmax is not enough (for sharp out-of-distribution) not yet
arxiv5 MERIT: Multimodal Wearable Vital Sign Waveform Monitoring not yet
arxiv4 Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning not yet
arxiv4 Unearthing a Billion Telegram Posts about the 2024 U.S. Presidential Election: Development of a Public Dataset not yet
arxiv4 Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations? not yet
arxiv4 OS-ATLAS: A Foundation Action Model for Generalist GUI Agents not yet
arxiv4 Safety cases for frontier AI not yet
arxiv4 Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse
arxiv4 One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation not yet
arxiv4 LoRA vs Full Fine-tuning: An Illusion of Equivalence
arxiv4 ElectionSim: Massive Population Election Simulation Powered by Large Language Model Driven Agents not yet
arxiv4 Kernel Approximation of Fisher-Rao Gradient Flows not yet
arxiv4 AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions not yet
arxiv4 Fast Best-of-N Decoding via Speculative Rejection not yet
arxiv4 A Survey of Small Language Models
arxiv4 A distributional simplicity bias in the learning dynamics of transformers not yet
arxiv4 Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback not yet
arxiv4 MotionCLR: Motion Generation and Training-free Editing via Understanding Attention Mechanisms not yet
arxiv4 AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant not yet
arxiv4 LLM-Slice: Dedicated Wireless Network Slicing for Large Language Models not yet
arxiv4 Large Language Models Reflect the Ideology of their Creators not yet
arxiv4 Quantum linear system algorithm with optimal queries to initial state preparation not yet
arxiv4 WorldSimBench: Towards Video Generation Models as World Simulators not yet
arxiv4 VoiceBench: Benchmarking LLM-Based Voice Assistants not yet
arxiv4 Collapse or Thrive? Perils and Promises of Synthetic Data in a Self-Generating World not yet
arxiv4 Beyond Browsing: API-Based Web Agents not yet
arxiv4 RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style not yet
arxiv4 Reducing Hallucinations in Vision-Language Models via Latent Space Steering not yet
arxiv4 LTPNet Integration of Deep Learning and Environmental Decision Support Systems for Renewable Energy Demand Forecasting not yet
arxiv4 Deep Learning for Weather Forecasting: A CNN-LSTM Hybrid Model for Predicting Historical Temperature Data not yet
arxiv4 Transversal non-Clifford gates for quantum LDPC codes on sheaves not yet
arxiv4 REEF: Representation Encoding Fingerprints for Large Language Models not yet
arxiv4 Nova: An Iterative Planning and Search Approach to Enhance Novelty and Diversity of LLM Generated Ideas not yet
arxiv4 From PINNs to PIKANs: Recent Advances in Physics-Informed Machine Learning not yet
arxiv4 SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation not yet
arxiv4 FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression not yet
arxiv4 WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines not yet
arxiv4 DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception not yet
arxiv4 Expanding Chatbot Knowledge in Customer Service: Context-Aware Similar Question Generation Using Large Language Models not yet
arxiv4 Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies not yet
arxiv4 OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation not yet
arxiv4 DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
arxiv4 Locking Down the Finetuned LLMs Safety not yet
arxiv4 Animate-X: Universal Character Image Animation with Enhanced Motion Representation not yet
arxiv4 Safety-Aware Fine-Tuning of Large Language Models not yet
arxiv4 Targeted Vaccine: Safety Alignment for Large Language Models against Harmful Fine-Tuning via Layer-wise Perturbation not yet
arxiv4 Taming Overconfidence in LLMs: Reward Calibration in RLHF not yet
arxiv4 Toward General Instruction-Following Alignment for Retrieval-Augmented Generation not yet
arxiv4 Two Heads Are Better Than One: A Multi-Agent System Has the Potential to Improve Scientific Idea Generation not yet
arxiv4 AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation not yet
arxiv4 VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model not yet
arxiv4 Losing dimensions: Geometric memorization in generative diffusion not yet
arxiv4 Language model developers should report train-test overlap not yet
arxiv4 Scaling Laws For Diffusion Transformers not yet
arxiv4 Human and LLM Biases in Hate Speech Annotations: A Socio-Demographic Analysis of Annotators and Targets not yet
arxiv4 MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting not yet
arxiv4 SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection not yet
arxiv4 Toward hybrid quantum simulations with qubits and qumodes on trapped-ion platforms not yet
arxiv4 IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation not yet
arxiv4 Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making not yet
arxiv4 Personalized Visual Instruction Tuning not yet
arxiv4 Degree Distribution based Spiking Graph Networks for Domain Adaptation not yet
arxiv4 TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training not yet
arxiv4 Restructuring Vector Quantization with the Rotation Trick not yet
arxiv4 HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction not yet
arxiv4 EVOLvE: Evaluating and Optimizing LLMs For Exploration not yet
arxiv4 Round and Round We Go! What makes Rotary Positional Encodings useful? not yet
arxiv4 MDAP: A Multi-view Disentangled and Adaptive Preference Learning Framework for Cross-Domain Recommendation not yet
arxiv4 TRACE: Temporal Grounding Video LLM via Causal Event Modeling not yet
arxiv4 Falcon Mamba: The First Competitive Attention-free 7B Language Model not yet
arxiv4 AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs not yet
arxiv4 MIBench: A Comprehensive Benchmark for Model Inversion Attack and Defense not yet
arxiv4 Large Language Model Based Multi-Objective Optimization for Integrated Sensing and Communications in UAV Networks not yet
arxiv4 Gibbs state preparation for commuting Hamiltonian: Mapping to classical Gibbs sampling not yet
arxiv4 Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models not yet
arxiv4 LRQ-Fact: LLM-Generated Relevant Questions for Multimodal Fact-Checking not yet
arxiv4 Hammer: Robust Function-Calling for On-Device Language Models via Function Masking not yet
arxiv4 Towards Secure Tuning: Mitigating Security Risks Arising from Benign Instruction Fine-Tuning not yet
arxiv4 Recent Advances in Speech Language Models: A Survey not yet
arxiv4 What Matters for Model Merging at Scale?
arxiv4 Autoregressive Large Language Models are Computationally Universal
arxiv4 AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML not yet
arxiv4 Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations not yet
arxiv4 Contrastive Localized Language-Image Pre-Training not yet
arxiv4 SteerDiff: Steering towards Safe Text-to-Image Diffusion Models not yet
arxiv4 Selective Attention Improves Transformer not yet
arxiv4 CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs not yet
arxiv4 LEGO: Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion not yet
arxiv4 The Patterns of Life Human Mobility Simulation not yet
arxiv3 EgoMimic: Scaling Imitation Learning via Egocentric Video not yet
arxiv3 DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion not yet
arxiv3 Natural gradient and parameter estimation for quantum Boltzmann machines not yet
arxiv3 Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model not yet
arxiv3 Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs
arxiv3 Mitigating Challenges in Ethereum's Proof-of-Stake Consensus: Evaluating the Impact of EigenLayer and Lido not yet
arxiv3 Emergence of meta-stable clustering in mean-field transformer models not yet
arxiv3 YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems not yet
arxiv3 Effective and Efficient Adversarial Detection for Vision-Language Models via A Single Vector not yet
arxiv3 Optimizing Posterior Samples for Bayesian Optimization via Rootfinding not yet
arxiv3 A note on polynomial-time tolerant testing stabilizer states not yet
arxiv3 PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting not yet
arxiv3 From Explicit Rules to Implicit Reasoning in an Interpretable Violence Monitoring System not yet
arxiv3 MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding not yet
arxiv3 ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference not yet
arxiv3 OpenCity: A Scalable Platform to Simulate Urban Activities with Massive LLM Agents not yet
arxiv3 AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? not yet
arxiv3 HOVER: Versatile Neural Whole-Body Controller for Humanoid Robots not yet
arxiv3 SoS Certifiability of Subgaussian Distributions and its Algorithmic Applications not yet
arxiv3 Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction not yet
arxiv3 SEG:Seeds-Enhanced Iterative Refinement Graph Neural Network for Entity Alignment not yet
arxiv3 What Factors Affect Multi-Modal In-Context Learning? An In-Depth Exploration not yet
arxiv3 Centaur: a foundation model of human cognition
arxiv3 SCube: Instant Large-Scale Scene Reconstruction using VoxSplats not yet
arxiv3 YOLO11 and Vision Transformers based 3D Pose Estimation of Immature Green Fruits in Commercial Apple Orchards for Robotic Thinning not yet
arxiv3 OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization not yet
arxiv3 FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality not yet
arxiv3 VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks not yet
arxiv3 From a Tiny Slip to a Giant Leap: An LLM-Based Simulation for Fake News Evolution not yet
arxiv3 Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences not yet
arxiv3 Conceptual Design of the Muonium-to-Antimuonium Conversion Experiment (MACE) not yet
arxiv3 Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality not yet
arxiv3 Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances not yet
arxiv3 Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from Scratch not yet
arxiv3 Optimal Equivariant Architectures from the Symmetries of Matrix-Element Likelihoods not yet
arxiv3 Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks not yet
arxiv3 Improving Model Factuality with Fine-grained Critique-based Evaluator not yet
arxiv3 CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation not yet
arxiv3 Stochastic gradient descent in high dimensions for multi-spiked tensor PCA not yet
arxiv3 Using Platt's scaling for calibration after undersampling -- limitations and how to address them not yet
arxiv3 Analyzing Nobel Prize Literature with Large Language Models not yet
arxiv3 Advanced simulations with PLUMED: OPES and Machine Learning Collective Variables not yet
arxiv3 Scalable Ranked Preference Optimization for Text-to-Image Generation not yet
arxiv3 MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models not yet
arxiv3 GDDA: Semantic OOD Detection on Graphs under Covariate Shift via Score-Based Diffusion Models not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

1
0
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
1
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?