0
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2024年7月公開 Arxiv論文ランキング】2407.xxxxx

Last updated at Posted at 2024-10-01

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2024年7月頃に公開されたcsカテゴリの論文 (ID: 2407.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2024年12月9日更新)

被引用数   タイトル 動画
arxiv1751 The Llama 3 Herd of Models not yet
arxiv558 Qwen2 Technical Report
arxiv65 PaliGemma: A versatile 3B VLM for transfer not yet
arxiv64 LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models not yet
arxiv57 Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
arxiv50 Qwen2-Audio Technical Report not yet
arxiv46 InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output not yet
arxiv38 Random unitaries in extremely low depth not yet
arxiv37 FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precision
arxiv35 OpenHands: An Open Platform for AI Software Developers as Generalist Agents
arxiv35 Learning to (Learn at Test Time): RNNs with Expressive Hidden States not yet
arxiv34 Agentless: Demystifying LLM-based Software Engineering Agents not yet
arxiv31 CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens not yet
arxiv30 Gymnasium: A Standard Interface for Reinforcement Learning Environments not yet
arxiv28 LLM Critics Help Catch LLM Bugs
arxiv27 A Survey on Mixture of Experts not yet
arxiv25 Apple Intelligence Foundation Language Models
arxiv25 KAN or MLP: A Fairer Comparison not yet
arxiv25 OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation not yet
arxiv25 Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving not yet
arxiv24 MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
arxiv24 Open-TeleVision: Teleoperation with Immersive Active Visual Feedback not yet
arxiv23 Compact Language Models via Pruning and Knowledge Distillation
arxiv23 Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders not yet
arxiv22 MUSE: Machine Unlearning Six-Way Evaluation for Language Models not yet
arxiv21 Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
arxiv21 VLMEvalKit: An Open-Source Toolkit for Evaluating Large Multi-Modality Models not yet
arxiv20 Discrete Flow Matching not yet
arxiv20 LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models not yet
arxiv20 Jailbreak Attacks and Defenses Against Large Language Models: A Survey not yet
arxiv20 TokenPacker: Efficient Visual Projector for Multimodal LLM not yet
arxiv20 AI Agents That Matter not yet
arxiv19 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies not yet
arxiv19 MambaVision: A Hybrid Mamba-Transformer Vision Backbone
arxiv19 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence not yet
arxiv19 TrialBench: Multi-Modal Artificial Intelligence-Ready Clinical Trial Datasets not yet
arxiv18 Stable Audio Open
arxiv18 SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning not yet
arxiv18 VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control not yet
arxiv18 EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions not yet
arxiv18 RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs not yet
arxiv17 Physics of Language Models: Part 2.1, Grade-School Math and the Hidden Reasoning Process not yet
arxiv17 A polynomial-time classical algorithm for noisy quantum circuits not yet
arxiv17 MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine not yet
arxiv17 JailbreakZoo: Survey, Landscapes, and Horizons in Jailbreaking Large Language and Vision-Language Models not yet
arxiv17 Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
arxiv16 Tora: Trajectory-oriented Diffusion Transformer for Video Generation not yet
arxiv16 LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding not yet
arxiv16 Deep Time Series Models: A Comprehensive Survey and Benchmark not yet
arxiv16 Vision language models are blind not yet
arxiv16 LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages not yet
arxiv16 LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control not yet
arxiv16 A Review of Large Language Models and Autonomous Agents in Chemistry not yet
arxiv16 Tree Search for Language Model Agents not yet
arxiv15 Adaptive Training of Grid-Dependent Physics-Informed Kolmogorov-Arnold Networks not yet
arxiv15 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models not yet
arxiv15 Open Problems in Technical AI Governance not yet
arxiv15 Shape of Motion: 4D Reconstruction from a Single Video not yet
arxiv15 DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving not yet
arxiv15 Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation not yet
arxiv15 MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis not yet
arxiv15 FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds not yet
arxiv15 Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion not yet
arxiv15 SplitLoRA: A Split Parameter-Efficient Fine-Tuning Framework for Large Language Models not yet
arxiv15 Tarsier: Recipes for Training and Evaluating Large Video Description Models not yet
arxiv14 Benchmarking and fidelity response theory of high-fidelity Rydberg entangling gates not yet
arxiv14 mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval not yet
arxiv14 Mobile Edge Intelligence for Large Language Models: A Contemporary Survey not yet
arxiv14 A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More not yet
arxiv14 Consent in Crisis: The Rapid Decline of the AI Data Commons not yet
arxiv14 NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window? not yet
arxiv14 Distilling System 2 into System 1
arxiv14 7th ABAW Competition: Multi-Task Learning and Compound Expression Recognition not yet
arxiv13 Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs not yet
arxiv13 Internal Consistency and Self-Feedback in Large Language Models: A Survey not yet
arxiv13 A Survey of Prompt Engineering Methods in Large Language Models for Different NLP Tasks
arxiv13 SEED-Story: Multimodal Long Story Generation with Large Language Model
arxiv13 Scalable, high-fidelity all-electronic control of trapped-ion qubits not yet
arxiv13 Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI not yet
arxiv13 OffsetBias: Leveraging Debiased Data for Tuning Evaluators not yet
arxiv13 RegMix: Data Mixture as Regression for Language Model Pre-training not yet
arxiv12 ShieldGemma: Generative AI Content Moderation Based on Gemma not yet
arxiv12 AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases not yet
arxiv12 The Art of Saying No: Contextual Noncompliance in Language Models not yet
arxiv12 EfficientQAT: Efficient Quantization-Aware Training for Large Language Models not yet
arxiv12 A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends not yet
arxiv12 Reuse, Don't Retrain: A Recipe for Continued Pretraining of Language Models not yet
arxiv12 AdaPI: Facilitating DNN Model Adaptivity for Efficient Private Inference in Edge Computing not yet
arxiv12 Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition not yet
arxiv12 Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning not yet
arxiv12 A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models not yet
arxiv12 Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models not yet
arxiv11 Can Editing LLMs Inject Harm? not yet
arxiv11 Preliminary WMT24 Ranking of General MT Systems and LLMs not yet
arxiv11 SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
arxiv11 VILA$^2$: VILA Augmented VILA not yet
arxiv11 Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach not yet
arxiv11 Advanced AI Framework for Enhanced Detection and Assessment of Abdominal Trauma: Integrating 3D Segmentation with 2D CNN and RNN Models not yet
arxiv11 Movable Antenna-Enhanced Wireless Communications: General Architectures and Implementation Methods not yet
arxiv11 Interim report for the International Muon Collider Collaboration (IMCC) not yet
arxiv11 Does Refusal Training in LLMs Generalize to the Past Tense? not yet
arxiv11 The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism not yet
arxiv11 Deconstructing What Makes a Good Optimizer for Language Models not yet
arxiv11 Controlling Space and Time with Diffusion Models not yet
arxiv11 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation not yet
arxiv11 MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? not yet
arxiv11 Mixture of A Million Experts
arxiv11 FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs not yet
arxiv11 Quantum coarsening and collective dynamics on a programmable quantum simulator not yet
arxiv11 RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs not yet
arxiv11 CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents not yet
arxiv10 Recursive Introspection: Teaching Language Model Agents How to Self-Improve not yet
arxiv10 Demystifying Verbatim Memorization in Large Language Models not yet
arxiv10 NV-Retriever: Improving text embedding models with effective hard-negative mining not yet
arxiv10 BOND: Aligning LLMs with Best-of-N Distillation not yet
arxiv10 Prover-Verifier Games improve legibility of LLM outputs not yet
arxiv10 A Comprehensive Survey on Kolmogorov Arnold Networks (KAN) not yet
arxiv10 Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows? not yet
arxiv10 Benchmarking Vision Language Models for Cultural Understanding not yet
arxiv10 Robotic Control via Embodied Chain-of-Thought Reasoning not yet
arxiv10 Autoregressive Speech Synthesis without Vector Quantization not yet
arxiv10 Induction Heads as an Essential Mechanism for Pattern Matching in In-context Learning not yet
arxiv10 Entropy Law: The Story Behind Data Compression and LLM Performance not yet
arxiv10 MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions not yet
arxiv10 RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models not yet
arxiv10 Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs
arxiv10 Benchmarking Complex Instruction-Following with Multiple Constraints Composition not yet
arxiv10 MMedAgent: Learning to Use Medical Tools with Multi-modal Agent not yet
arxiv10 Learning tensor networks with tensor cross interpolation: new algorithms and libraries not yet
arxiv10 We-Math: Does Your Large Multimodal Model Achieve Human-like Mathematical Reasoning? not yet
arxiv10 Searching for Best Practices in Retrieval-Augmented Generation not yet
arxiv10 UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI not yet
arxiv9 Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs not yet
arxiv9 MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts
arxiv9 Cross-modality Information Check for Detecting Jailbreaking in Multimodal Large Language Models not yet
arxiv9 Machine Unlearning in Generative AI: A Survey not yet
arxiv9 MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
arxiv9 RLCoder: Reinforcement Learning for Repository-Level Code Completion not yet
arxiv9 Inferring turbulent velocity and temperature fields and their statistics from Lagrangian velocity measurements using physics-informed Kolmogorov-Arnold Networks not yet
arxiv9 When Can Transformers Count to n? not yet
arxiv9 Reduced Effectiveness of Kolmogorov-Arnold Networks on Functions with Noise not yet
arxiv9 Differential Privacy of Cross-Attention with Provable Guarantee not yet
arxiv9 T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation not yet
arxiv9 Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review not yet
arxiv9 Semantic Operators: A Declarative Model for Rich, AI-based Analytics Over Text Data not yet
arxiv9 LAB-Bench: Measuring Capabilities of Language Models for Biology Research not yet
arxiv9 Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training not yet
arxiv9 $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$ not yet
arxiv9 Rectifier: Code Translation with Corrector via LLMs not yet
arxiv9 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes not yet
arxiv9 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps not yet
arxiv9 UltraEdit: Instruction-based Fine-Grained Image Editing at Scale not yet
arxiv9 LoRA-GA: Low-Rank Adaptation with Gradient Approximation not yet
arxiv9 On scalable oversight with weak LLMs judging strong LLMs not yet
arxiv9 Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry, Texture, and PBR Materials not yet
arxiv9 MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations not yet
arxiv9 Efficient Long-distance Latent Relation-aware Graph Neural Network for Multi-modal Emotion Recognition in Conversations not yet
arxiv8 FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention not yet
arxiv8 The Emerged Security and Privacy of LLM Agent: A Survey with Case Studies not yet
arxiv8 Know Your Limits: A Survey of Abstention in Large Language Models not yet
arxiv8 Exploring the Limitations of Kolmogorov-Arnold Networks in Classification: Insights to Software Training and Hardware Implementation not yet
arxiv8 Adaptive Robot Detumbling of a Non-Rigid Satellite not yet
arxiv8 AMEX: Android Multi-annotation Expo Dataset for Mobile GUI Agents not yet
arxiv8 Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning not yet
arxiv8 When Do Universal Image Jailbreaks Transfer Between Vision-Language Models? not yet
arxiv8 Flow as the Cross-Domain Manipulation Interface not yet
arxiv8 Knowledge Mechanisms in Large Language Models: A Survey and Perspective not yet
arxiv8 Falcon2-11B Technical Report not yet
arxiv8 Weak-to-Strong Reasoning not yet
arxiv8 Differential Privacy Mechanisms in Neural Tangent Kernel Regression not yet
arxiv8 Surgical Robot Transformer (SRT): Imitation Learning for Surgical Tasks not yet
arxiv8 BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval not yet
arxiv8 AutoFlow: Automated Workflow Generation for Large Language Model Agents not yet
arxiv8 LLM Inference Serving: Survey of Recent Advances and Opportunities not yet
arxiv8 LongLaMP: A Benchmark for Personalized Long-form Text Generation not yet
arxiv8 UMI on Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers not yet
arxiv8 Lean-STaR: Learning to Interleave Thinking and Proving
arxiv8 The Heterophilic Graph Learning Handbook: Benchmarks, Models, Theoretical Analysis, Applications and Challenges not yet
arxiv8 A Neural Matrix Decomposition Recommender System Model based on the Multimodal Large Language Model not yet
arxiv8 The Solar and Geomagnetic Storms in May 2024: A Flash Data Report not yet
arxiv8 On Leakage of Code Generation Evaluation Datasets not yet
arxiv8 Advanced Financial Fraud Detection Using GNN-CL Model not yet
arxiv8 What's Wrong with Your Code Generated by Large Language Models? An Extensive Study not yet
arxiv8 Empowering 1000 tokens/second on-device LLM prefilling with mllm-NPU not yet
arxiv8 Logical Operators and Fold-Transversal Gates of Bivariate Bicycle Codes not yet
arxiv8 Stacked Intelligent Metasurfaces for Wireless Sensing and Communication: Applications and Challenges not yet
arxiv8 Large-scale quantum reservoir learning with an analog quantum computer not yet
arxiv8 Research on Autonomous Robots Navigation based on Reinforcement Learning not yet
arxiv8 CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models not yet
arxiv8 FineCLIPER: Multi-modal Fine-grained CLIP for Dynamic Facial Expression Recognition with AdaptERs not yet
arxiv8 EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning not yet
arxiv8 On Statistical Rates and Provably Efficient Criteria of Latent Diffusion Transformers (DiTs) not yet
arxiv8 Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles not yet
arxiv8 Diffusion Models and Representation Learning: A Survey not yet
arxiv8 Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP not yet
arxiv8 Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks not yet
arxiv7 Enhanced Self-Checkout System for Retail Based on Improved YOLOv10 not yet
arxiv7 Direct Unlearning Optimization for Robust and Safe Text-to-Image Models not yet
arxiv7 COEFF-KANs: A Paradigm to Address the Electrolyte Field with KANs not yet
arxiv7 Towards Effective and Efficient Continual Pre-training of Large Language Models not yet
arxiv7 PersonaGym: Evaluating Persona Agents and LLMs not yet
arxiv7 Physics Informed Kolmogorov-Arnold Neural Networks for Dynamical Analysis via Efficent-KAN and WAV-KAN not yet
arxiv7 AI Safety in Generative AI Large Language Models: A Survey not yet
arxiv7 Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache Consumption not yet
arxiv7 Financial Statement Analysis with Large Language Models not yet
arxiv7 HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation not yet
arxiv7 PyBench: Evaluating LLM Agent on various real-world coding tasks not yet
arxiv7 MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence not yet
arxiv7 RazorAttention: Efficient KV Cache Compression Through Retrieval Heads not yet
arxiv7 Data driven weather forecasts trained and initialised directly from observations not yet
arxiv7 BIGbench: A Unified Benchmark for Social Bias in Text-to-Image Generative Models Based on Multi-modal LLM not yet
arxiv7 Not All Noises Are Created Equally:Diffusion Noise Selection and Optimization not yet
arxiv7 SciCode: A Research Coding Benchmark Curated by Scientists not yet
arxiv7 DropKAN: Regularizing KANs by masking post-activations not yet
arxiv7 Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems not yet
arxiv7 Reasoning with Large Language Models, a Survey
arxiv7 Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models not yet
arxiv7 Tailoring Solution Accuracy for Fast Whole-body Model Predictive Control of Legged Robots not yet
arxiv7 AccDiffusion: An Accurate Method for Higher-Resolution Image Generation not yet
arxiv7 Self-Consuming Generative Models with Curated Data Provably Optimize Human Preferences not yet
arxiv7 Video Diffusion Alignment via Reward Gradients not yet
arxiv7 Lynx: An Open Source Hallucination Evaluation Model not yet
arxiv7 Model Tells You Where to Merge: Adaptive KV Cache Merging for LLMs on Long-Context Tasks not yet
arxiv7 AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models not yet
arxiv7 Decentralized Adaptive Aerospace Transportation of Unknown Loads Using A Team of Robots not yet
arxiv7 Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs not yet
arxiv7 Diffusion Model-Based Video Editing: A Survey not yet
arxiv7 CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation not yet
arxiv7 Enabling 6G Performance in the Upper Mid-Band by Transitioning From Massive to Gigantic MIMO not yet
arxiv7 Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation not yet
arxiv7 LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts not yet
arxiv7 RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation not yet
arxiv7 Trustworthy Classification through Rank-Based Conformal Prediction Sets not yet
arxiv7 ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild not yet
arxiv7 MiniGPT-Med: Large Language Model as a General Interface for Radiology Diagnosis not yet
arxiv7 BM25S: Orders of magnitude faster lexical search via eager sparse scoring not yet
arxiv7 TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts not yet
arxiv7 Effect of a Process Mining based Pre-processing Step in Prediction of the Critical Health Outcomes not yet
arxiv7 Meta Large Language Model Compiler: Foundation Models of Compiler Optimization not yet
arxiv7 Meta 3D TextureGen: Fast and Consistent Texture Generation for 3D Objects not yet
arxiv7 DrugCLIP: Contrastive Drug-Disease Interaction For Drug Repurposing not yet
arxiv7 PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning not yet
arxiv7 UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks not yet
arxiv7 DiscoveryBench: Towards Data-Driven Discovery with Large Language Models not yet
arxiv7 MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs not yet
arxiv7 Tracking the 2024 US Presidential Election Chatter on Tiktok: A Public Multimodal Dataset not yet
arxiv7 Non-Hermitian skin effect in arbitrary dimensions: non-Bloch band theory and classification not yet
arxiv7 MIRAI: Evaluating LLM Agents for Event Forecasting not yet
arxiv7 Kolmogorov-Arnold Convolutions: Design Principles and Empirical Studies not yet
arxiv7 Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning not yet
arxiv7 Quantum Circuit Synthesis and Compilation Optimization: Overview and Prospects not yet
arxiv7 Large-scale, Independent and Comprehensive study of the power of LLMs for test case generation not yet
arxiv6 Berkeley Humanoid: A Research Platform for Learning-based Control not yet
arxiv6 DKL-KAN: Scalable Deep Kernel Learning using Kolmogorov-Arnold Networks not yet
arxiv6 ThinK: Thinner Key Cache by Query-Driven Pruning not yet
arxiv6 Diffusion Feedback Helps CLIP See Better not yet
arxiv6 Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle not yet
arxiv6 Autonomous Navigation of Unmanned Vehicle Through Deep Reinforcement Learning not yet
arxiv6 LoRA-Pro: Are Low-Rank Adapters Properly Optimized? not yet
arxiv6 LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN prover not yet
arxiv6 From Text to Insight: Large Language Models for Materials Science Data Extraction not yet
arxiv6 Separable DeepONet: Breaking the Curse of Dimensionality in Physics-Informed Machine Learning not yet
arxiv6 Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data
arxiv6 NNsight and NDIF: Democratizing Access to Foundation Model Internals not yet
arxiv6 EVLM: An Efficient Vision-Language Model for Visual Understanding not yet
arxiv6 LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference not yet
arxiv6 Towards Trustworthy AI: A Review of Ethical and Robust Large Language Models not yet
arxiv6 GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model not yet
arxiv6 Non-Fermi liquid and antiferromagnetic correlations with hole doping in the bilayer two-orbital Hubbard model of La$_3$Ni$_2$O$_7$ at zero temperature not yet
arxiv6 Retrieval-Augmented Generation for Natural Language Processing: A Survey not yet
arxiv6 R+X: Retrieval and Execution from Everyday Human Videos not yet
arxiv6 The Foundation Model Transparency Index v1.1: May 2024 not yet
arxiv6 Scaling Retrieval-Based Language Models with a Trillion-Token Datastore not yet
arxiv6 PQCache: Product Quantization-based KVCache for Long Context LLM Inference not yet
arxiv6 IMAGDressing-v1: Customizable Virtual Dressing not yet
arxiv6 The Better Angels of Machine Personality: How Personality Relates to LLM Safety not yet
arxiv6 GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression not yet
arxiv6 Scaling Diffusion Transformers to 16 Billion Parameters not yet
arxiv6 Trust No Bot: Discovering Personal Disclosures in Human-LLM Conversations in the Wild not yet
arxiv6 Integrating Amortized Inference with Diffusion Models for Learning Clean Distribution from Corrupted Images not yet
arxiv6 Back to Newton's Laws: Learning Vision-based Agile Flight via Differentiable Physics not yet
arxiv6 Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena not yet
arxiv6 What Makes and Breaks Safety Fine-tuning? A Mechanistic Study not yet
arxiv6 InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation not yet
arxiv6 Real-time gravitational-wave inference for binary neutron stars using machine learning not yet
arxiv6 Human-like Episodic Memory for Infinite Context LLMs not yet
arxiv6 Evaluating AI Evaluation: Perils and Prospects not yet
arxiv6 Benchmarking quantum computers not yet
arxiv6 Still-Moving: Customized Video Generation without Customized Video Data not yet
arxiv6 WildGaussians: 3D Gaussian Splatting in the Wild not yet
arxiv6 Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting not yet
arxiv6 Source Code Summarization in the Era of Large Language Models
arxiv6 Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities not yet
arxiv6 Fine-Tuning Large Language Models with User-Level Differential Privacy not yet
arxiv6 Video-to-Audio Generation with Hidden Alignment not yet
arxiv6 From Principles to Rules: A Regulatory Approach for Frontier AI not yet
arxiv6 Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models not yet
arxiv6 RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models not yet
arxiv6 Variational Best-of-N Alignment not yet
arxiv6 On the Limitations of Compute Thresholds as a Governance Strategy not yet
arxiv6 Language Representations Can be What Recommenders Need: Findings and Potentials not yet
arxiv6 Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models not yet
arxiv6 Configurable DOA Estimation using Incremental Learning not yet
arxiv6 AgentInstruct: Toward Generative Teaching with Agentic Flows not yet
arxiv6 Solving Motion Planning Tasks with a Scalable Generative Model not yet
arxiv6 LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning not yet
arxiv6 Consistency Flow Matching: Defining Straight Flows with Velocity Consistency not yet
arxiv6 LDP: A Local Diffusion Planner for Efficient Robot Navigation and Collision Avoidance not yet
arxiv6 Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application not yet
arxiv6 Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing not yet
arxiv6 ColPali: Efficient Document Retrieval with Vision Language Models not yet
arxiv6 $\text{Memory}^3$: Language Modeling with Explicit Memory not yet
arxiv6 Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents not yet
arxiv6 Posterior Sampling with Denoising Oracles via Tilted Transport not yet
arxiv6 LiteSearch: Efficacious Tree Search for LLM not yet
arxiv5 Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress? not yet
arxiv5 ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models not yet
arxiv5 Model Attribution in LLM-Generated Disinformation: A Domain Generalization Approach with Supervised Contrastive Learning not yet
arxiv5 Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable Frameworks not yet
arxiv5 AI-Assisted Generation of Difficult Math Questions not yet
arxiv5 Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification not yet
arxiv5 Rethinking the Function of Neurons in KANs not yet
arxiv5 Theia: Distilling Diverse Vision Foundation Models for Robot Learning not yet
arxiv5 F-KANs: Federated Kolmogorov-Arnold Networks not yet
arxiv5 Mixture of Nested Experts: Adaptive Processing of Visual Tokens not yet
arxiv5 OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation not yet
arxiv5 Hybrid summary statistics: neural weak lensing inference beyond the power spectrum not yet
arxiv5 Effects of Scale on Language Model Robustness
arxiv5 Transformers on Markov Data: Constant Depth Suffices not yet
arxiv5 WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries not yet
arxiv5 u-$\mu$P: The Unit-Scaled Maximal Update Parametrization not yet
arxiv5 AIR-Bench 2024: A Safety Benchmark Based on Risk Categories from Regulations and Policies not yet
arxiv5 A Basis-Free Phase Space Electronic Hamiltonian That Recovers Beyond Born-Oppenheimer Electronic Momentum and Current Density not yet
arxiv5 Data-driven Koopman operator predictions of turbulent dynamics in models of shear flows not yet
arxiv5 Predicting Stock Prices with FinBERT-LSTM: Integrating News Sentiment Analysis not yet
arxiv5 Implications of the laser excitation of the Th-229 nucleus for dark matter searches not yet
arxiv5 MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity not yet
arxiv5 AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? not yet
arxiv5 Imposter.AI: Adversarial Attacks with Hidden Intentions towards Aligned Large Language Models not yet
arxiv5 A Multimodal Knowledge-enhanced Whole-slide Pathology Foundation Model not yet
arxiv5 XAI meets LLMs: A Survey of the Relation between Explainable AI and Large Language Models not yet
arxiv5 Deep State Space Recurrent Neural Networks for Time Series Forecasting not yet
arxiv5 Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning not yet
arxiv5 FMamba: Mamba based on Fast-attention for Multivariate Time-series Forecasting not yet
arxiv5 ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities not yet
arxiv5 NeuroBind: Towards Unified Multimodal Representations for Neural Signals not yet
arxiv5 Understanding Reference Policies in Direct Preference Optimization not yet
arxiv5 Beyond Dropout: Robust Convolutional Neural Networks Based on Local Feature Masking not yet
arxiv5 Beyond Augmentation: Empowering Model Robustness under Extreme Capture Environments not yet
arxiv5 CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis not yet
arxiv5 Are Large Language Models Capable of Generating Human-Level Narratives? not yet
arxiv5 Research on Image Super-Resolution Reconstruction Mechanism based on Convolutional Neural Network not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

0
1
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?