0
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2024年6月公開 Arxiv論文ランキング】2406.xxxxx

Last updated at Posted at 2024-10-01

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2024年6月頃に公開されたcsカテゴリの論文 (ID: 2406.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2024年12月9日更新)

被引用数   タイトル 動画
arxiv146 ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools not yet
arxiv106 OpenVLA: An Open-Source Vision-Language-Action Model not yet
arxiv93 Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs not yet
arxiv87 DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence not yet
arxiv79 Depth Anything V2 not yet
arxiv77 The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale not yet
arxiv76 Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation not yet
arxiv74 MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark not yet
arxiv71 VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs not yet
arxiv61 A Survey on Large Language Models for Code Generation
arxiv59 Nemotron-4 340B Technical Report not yet
arxiv58 From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline not yet
arxiv57 Scaling and evaluating sparse autoencoders not yet
arxiv52 ShareGPT4Video: Improving Video Understanding and Generation with Better Captions not yet
arxiv51 Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts not yet
arxiv49 U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation not yet
arxiv47 Convolutional Kolmogorov-Arnold Networks not yet
arxiv46 Autoregressive Image Generation without Vector Quantization not yet
arxiv45 The Prompt Report: A Systematic Survey of Prompting Techniques
arxiv44 Refusal in Language Models Is Mediated by a Single Direction not yet
arxiv41 Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing not yet
arxiv40 DataComp-LM: In search of the next generation of training sets for language models not yet
arxiv39 Long Context Transfer from Language to Vision not yet
arxiv39 BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions not yet
arxiv38 CodeGemma: Open Code Models Based on Gemma not yet
arxiv38 Improving Alignment and Robustness with Circuit Breakers not yet
arxiv36 Scaling Synthetic Data Creation with 1,000,000,000 Personas not yet
arxiv36 HelpSteer2: Open-source dataset for training top-performing reward models not yet
arxiv34 Improve Mathematical Reasoning in Language Models by Automated Process Supervision not yet
arxiv33 An Empirical Study of Mamba-based Language Models not yet
arxiv33 Kolmogorov-Arnold Networks for Time Series: Bridging Predictive Power and Interpretability not yet
arxiv33 AIFS -- ECMWF's data-driven forecasting system not yet
arxiv32 Time Series Modeling for Heart Rate Prediction: From ARIMA to Transformers not yet
arxiv31 Mixture-of-Agents Enhances Large Language Model Capabilities
arxiv30 fKAN: Fractional Kolmogorov-Arnold Networks with trainable Jacobi basis functions not yet
arxiv30 Seed-TTS: A Family of High-Quality Versatile Speech Generation Models not yet
arxiv29 GKAN: Graph Kolmogorov-Arnold Networks not yet
arxiv29 A Temporal Kolmogorov-Arnold Transformer for Time Series Forecasting not yet
arxiv29 FourierKAN-GCF: Fourier Kolmogorov-Arnold Network -- An Effective and Efficient Feature Transformation for Graph Collaborative Filtering not yet
arxiv28 On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey not yet
arxiv27 LiveBench: A Challenging, Contamination-Free LLM Benchmark not yet
arxiv27 Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation not yet
arxiv26 A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges not yet
arxiv26 An Image is Worth 32 Tokens for Reconstruction and Generation not yet
arxiv26 PGSR: Planar-based Gaussian Splatting for Efficient and High-Fidelity Surface Reconstruction not yet
arxiv26 Safety Alignment Should Be Made More Than Just a Few Tokens Deep not yet
arxiv26 VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers not yet
arxiv26 WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild not yet
arxiv26 MLVU: A Comprehensive Benchmark for Multi-Task Long Video Understanding not yet
arxiv26 ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search not yet
arxiv26 Kolmogorov-Arnold Network for Satellite Image Classification in Remote Sensing not yet
arxiv25 CU-Net: a U-Net architecture for efficient brain-tumor segmentation on BraTS 2019 dataset not yet
arxiv25 Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling not yet
arxiv24 APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking not yet
arxiv24 Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
arxiv23 Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs not yet
arxiv23 KAGNNs: Kolmogorov-Arnold Networks meet Graph Learning not yet
arxiv23 LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training not yet
arxiv23 MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding not yet
arxiv23 Streamlining and standardizing software citations with The Software Citation Station not yet
arxiv23 CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation not yet
arxiv22 BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack not yet
arxiv22 Simple and Effective Masked Diffusion Language Models not yet
arxiv22 CRAG -- Comprehensive RAG Benchmark not yet
arxiv21 MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance not yet
arxiv21 LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks not yet
arxiv21 SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors not yet
arxiv21 Suitability of KANs for Computer Vision: A preliminary investigation not yet
arxiv21 TextGrad: Automatic "Differentiation" via Text not yet
arxiv21 XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model not yet
arxiv21 Simplified and Generalized Masked Diffusion for Discrete Data not yet
arxiv21 Credit Card Fraud Detection Using Advanced Transformer Model not yet
arxiv21 Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms not yet
arxiv21 To Believe or Not to Believe Your LLM
arxiv21 PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling not yet
arxiv21 Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization not yet
arxiv20 Simulating Classroom Education with LLM-Empowered Agents not yet
arxiv20 WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs not yet
arxiv20 Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More? not yet
arxiv20 NATURAL PLAN: Benchmarking LLMs on Natural Language Planning not yet
arxiv20 Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation not yet
arxiv20 CodeR: Issue Resolving with Multi-Agent and Task Graphs not yet
arxiv20 BadRAG: Identifying Vulnerabilities in Retrieval Augmented Generation of Large Language Models not yet
arxiv19 Demonstrating the Efficacy of Kolmogorov-Arnold Networks in Vision Tasks not yet
arxiv19 RL on Incorrect Synthetic Data Scales the Efficiency of LLM Math Reasoning by Eight-Fold not yet
arxiv19 HumanPlus: Humanoid Shadowing and Imitation from Humans not yet
arxiv19 Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs not yet
arxiv19 Grounding Image Matching in 3D with MASt3R not yet
arxiv19 MAIRA-2: Grounded Radiology Report Generation not yet
arxiv19 Are We Done with MMLU? not yet
arxiv19 V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation not yet
arxiv18 RouteLLM: Learning to Route LLMs with Preference Data
arxiv18 LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs not yet
arxiv18 rKAN: Rational Kolmogorov-Arnold Networks not yet
arxiv18 BSRBF-KAN: A combination of B-splines and Radial Basis Functions in Kolmogorov-Arnold Networks not yet
arxiv18 Incompressibility and spectral gaps of random circuits not yet
arxiv18 Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B not yet
arxiv17 Finite basis Kolmogorov-Arnold networks: domain decomposition for data-driven and physics-informed problems not yet
arxiv17 Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT not yet
arxiv17 Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models not yet
arxiv17 EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees not yet
arxiv17 Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning not yet
arxiv17 Towards Infinite-Long Prefix in Transformer not yet
arxiv17 Vision-LSTM: xLSTM as Generic Vision Backbone not yet
arxiv17 SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models not yet
arxiv17 RaDe-GS: Rasterizing Depth in Gaussian Splatting not yet
arxiv16 Symbolic Learning Enables Self-Evolving Agents not yet
arxiv16 CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs not yet
arxiv16 Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA not yet
arxiv16 GraphKAN: Enhancing Feature Extraction with Graph Kolmogorov Arnold Networks not yet
arxiv16 Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models not yet
arxiv16 Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges not yet
arxiv16 DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning not yet
arxiv16 Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models not yet
arxiv16 Scaling Large-Language-Model-based Multi-Agent Collaboration not yet
arxiv16 MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures not yet
arxiv16 Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models not yet
arxiv16 PowerInfer-2: Fast Large Language Model Inference on a Smartphone not yet
arxiv16 MotionClone: Training-Free Motion Cloning for Controllable Video Generation not yet
arxiv16 Benchmark Data Contamination of Large Language Models: A Survey not yet
arxiv16 Neural network learns low-dimensional polynomials with SGD near the information-theoretic limit not yet
arxiv16 Spectroscopy and modeling of $^{171}$Yb Rydberg states for high-fidelity two-qubit gates not yet
arxiv16 How to Understand Whole Software Repository? not yet
arxiv16 D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models not yet
arxiv15 HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale not yet
arxiv15 The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm not yet
arxiv15 Are Language Models Actually Useful for Time Series Forecasting? not yet
arxiv15 One Thousand and One Pairs: A "novel" challenge for long-context language models not yet
arxiv15 Blind Baselines Beat Membership Inference Attacks for Foundation Models not yet
arxiv15 PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference not yet
arxiv15 Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging not yet
arxiv15 Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models not yet
arxiv15 Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference not yet
arxiv15 Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL not yet
arxiv15 VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks not yet
arxiv15 Unveiling the Power of Wavelets: A Wavelet-based Kolmogorov-Arnold Network for Hyperspectral Image Classification not yet
arxiv15 CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark not yet
arxiv15 Advanced Payment Security System:XGBoost, LightGBM and SMOTE Integrated not yet
arxiv15 Towards Scalable Automated Alignment of LLMs: A Survey not yet
arxiv14 APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets
arxiv14 WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models not yet
arxiv14 MMBench-Video: A Long-Form Multi-Shot Benchmark for Holistic Video Understanding not yet
arxiv14 RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models not yet
arxiv14 GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents not yet
arxiv14 Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback not yet
arxiv14 Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions not yet
arxiv14 MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance not yet
arxiv14 CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models not yet
arxiv14 On the Effects of Data Scale on UI Control Agents not yet
arxiv14 Guiding a Diffusion Model with a Bad Version of Itself not yet
arxiv14 ReLU-KAN: New Kolmogorov-Arnold Networks that Only Need Matrix Addition, Dot Multiplication, and ReLU not yet
arxiv14 Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration not yet
arxiv14 BoNBoN Alignment for Large Language Models and the Sweetness of Best-of-n Sampling not yet
arxiv13 Changing Answer Order Can Decrease MMLU Accuracy not yet
arxiv13 Kolmogorov-Arnold Graph Neural Networks not yet
arxiv13 CodeRAG-Bench: Can Retrieval Augment Code Generation? not yet
arxiv13 Instruction Pre-Training: Language Models are Supervised Multitask Learners not yet
arxiv13 WebCanvas: Benchmarking Web Agents in Online Environments not yet
arxiv13 $\tau$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains not yet
arxiv13 GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities not yet
arxiv13 STAR: Scale-wise Text-to-image generation via Auto-Regressive representations not yet
arxiv13 MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers not yet
arxiv13 SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding not yet
arxiv13 OmniH2O: Universal and Dexterous Human-to-Humanoid Whole-Body Teleoperation and Learning not yet
arxiv13 What If We Recaption Billions of Web Images with LLaMA-3? not yet
arxiv13 Large Language Model Unlearning via Embedding-Corrupted Prompts not yet
arxiv13 The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models not yet
arxiv13 Towards Semantic Equivalence of Tokenization in Multimodal LLM not yet
arxiv13 Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image not yet
arxiv13 RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation not yet
arxiv13 Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models not yet
arxiv13 Transformers need glasses! Information over-squashing in language tasks not yet
arxiv13 Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt not yet
arxiv13 Scalable MatMul-free Language Modeling not yet
arxiv13 iKAN: Global Incremental Learning with KAN for Human Activity Recognition Across Heterogeneous Datasets not yet
arxiv13 Unlocking Guidance for Discrete State-Space Diffusion and Flow Models not yet
arxiv12 LLaRA: Supercharging Robot Learning Data for Vision-Language Policy not yet
arxiv12 Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation not yet
arxiv12 Understanding and Mitigating Language Confusion in LLMs not yet
arxiv12 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation not yet
arxiv12 From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models not yet
arxiv12 Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration not yet
arxiv12 NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking not yet
arxiv12 Consistency Models Made Easy not yet
arxiv12 A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data not yet
arxiv12 Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data not yet
arxiv12 Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% not yet
arxiv12 GUICourse: From General Vision Language Models to Versatile GUI Agents not yet
arxiv12 From Pixels to Prose: A Large Dataset of Dense Image Captions not yet
arxiv12 Training-free Camera Control for Video Generation not yet
arxiv12 Pandora: Towards General World Model with Natural Language Actions and Video States not yet
arxiv12 Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models not yet
arxiv12 GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices not yet
arxiv12 One-Step Effective Diffusion Network for Real-World Image Super-Resolution not yet
arxiv12 Delving into ChatGPT usage in academic writing through excess vocabulary not yet
arxiv12 Parallelizing Linear Transformers with the Delta Rule over Sequence Length not yet
arxiv12 Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters not yet
arxiv12 How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States not yet
arxiv12 LawGPT: A Chinese Legal Knowledge-Enhanced Large Language Model not yet
arxiv12 Computational Limits of Low-Rank Adaptation (LoRA) for Transformer-Based Models not yet
arxiv12 Enhance Image-to-Image Generation with LLaVA-generated Prompts not yet
arxiv12 Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching not yet
arxiv12 When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs not yet
arxiv12 Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses not yet
arxiv12 Dimba: Transformer-Mamba Diffusion Models not yet
arxiv12 $\Delta$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers not yet
arxiv11 The Remarkable Robustness of LLMs: Stages of Inference? not yet
arxiv11 Resolving Discrepancies in Compute-Optimal Scaling of Language Models not yet
arxiv11 Manipulate-Anything: Automating Real-World Robots using Vision-Language Models not yet
arxiv11 Application of Multimodal Fusion Deep Learning Model in Disease Recognition not yet
arxiv11 LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference not yet
arxiv11 Image anomaly detection and prediction scheme based on SSA optimized ResNet50-BiGRU model not yet
arxiv11 SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model not yet
arxiv11 Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces not yet
arxiv11 MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens not yet
arxiv11 Vul-RAG: Enhancing LLM-based Vulnerability Detection via Knowledge-level RAG not yet
arxiv11 Diffusion Models in Low-Level Vision: A Survey not yet
arxiv11 WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences not yet
arxiv11 A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners not yet
arxiv11 LVBench: An Extreme Long Video Understanding Benchmark not yet
arxiv11 McEval: Massively Multilingual Code Evaluation not yet
arxiv11 MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models not yet
arxiv11 LLM Dataset Inference: Did you train on my dataset? not yet
arxiv11 Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization not yet
arxiv11 RATT: A Thought Structure for Coherent and Correct LLM Reasoning not yet
arxiv11 Safeguarding Large Language Models: A Survey not yet
arxiv11 Teams of LLM Agents can Exploit Zero-Day Vulnerabilities not yet
arxiv11 The Dawn of Natural Language to SQL: Are We Fully Ready? not yet
arxiv10 OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding not yet
arxiv10 Revisiting Backdoor Attacks against Large Vision-Language Models not yet
arxiv10 Navigating LLM Ethics: Advancements, Challenges, and Future Directions not yet
arxiv10 Localized statistics decoding: A parallel decoding algorithm for quantum low-density parity-check codes not yet
arxiv10 Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers not yet
arxiv10 Preference Tuning For Toxicity Mitigation Generalizes Across Languages not yet
arxiv10 AudioBench: A Universal Benchmark for Audio Large Language Models not yet
arxiv10 Adversarial Attacks on Multimodal Agents not yet
arxiv10 MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs not yet
arxiv10 DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer not yet
arxiv10 Avoiding Copyright Infringement via Large Language Model Unlearning not yet
arxiv10 L4GM: Large 4D Gaussian Reconstruction Model not yet
arxiv10 ControlVAR: Exploring Controllable Visual Autoregressive Modeling not yet
arxiv10 VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding not yet
arxiv10 Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning? not yet
arxiv10 RGFN: Synthesizable Molecular Generation Using GFlowNets not yet
arxiv10 Scaling Laws in Linear Regression: Compute, Parameters, and Data not yet
arxiv10 CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models not yet
arxiv10 Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena not yet
arxiv10 BAKU: An Efficient Transformer for Multi-Task Policy Learning not yet
arxiv10 Beyond Model Collapse: Scaling Up with Synthesized Data Requires Verification not yet
arxiv10 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models not yet
arxiv10 MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models not yet
arxiv10 Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion not yet
arxiv10 Multistep Distillation of Diffusion Models via Moment Matching not yet
arxiv10 Does your data spark joy? Performance gains from domain upsampling at the end of training not yet
arxiv10 The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches not yet
arxiv10 Demystifying the Compression of Mixture-of-Experts Through a Unified Framework not yet
arxiv10 DrEureka: Language Model Guided Sim-To-Real Transfer not yet
arxiv10 An Enhanced Encoder-Decoder Network Architecture for Reducing Information Loss in Image Semantic Segmentation not yet
arxiv10 The Geometry of Categorical and Hierarchical Concepts in Large Language Models not yet
arxiv10 Learning Temporally Consistent Video Depth from Video Diffusion Priors not yet
arxiv10 UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation not yet
arxiv10 Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback not yet
arxiv9 Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs? not yet
arxiv9 VERISCORE: Evaluating the factuality of verifiable claims in long-form text generation not yet
arxiv9 Exploration of Multi-Scale Image Fusion Systems in Intelligent Medical Image Analysis not yet
arxiv9 E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS not yet
arxiv9 Following Length Constraints in Instructions not yet
arxiv9 BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models not yet
arxiv9 Steering Without Side Effects: Improving Post-Deployment Control of Language Models not yet
arxiv9 Application of Computer Deep Learning Model in Diagnosis of Pulmonary Nodules not yet
arxiv9 Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level not yet
arxiv9 How Do Large Language Models Acquire Factual Knowledge During Pretraining? not yet
arxiv9 Task Me Anything not yet
arxiv9 Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs not yet
arxiv9 MASAI: Modular Architecture for Software-engineering AI Agents not yet
arxiv9 DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling not yet
arxiv9 Step-level Value Preference Optimization for Mathematical Reasoning not yet
arxiv9 GenQA: Generating Millions of Instructions from a Handful of Prompts not yet
arxiv9 Quantifying Variance in Evaluation Benchmarks not yet
arxiv9 BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages not yet
arxiv9 Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models not yet
arxiv9 OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation not yet
arxiv9 Coupled Ocean-Atmosphere Dynamics in a Machine Learning Earth System Model not yet
arxiv9 Judging the Judges: A Systematic Investigation of Position Bias in Pairwise Comparative Assessments by LLMs not yet
arxiv9 Zero-shot Image Editing with Reference Imitation not yet
arxiv9 AI Sandbagging: Language Models can Strategically Underperform on Evaluations not yet
arxiv9 Needle In A Multimodal Haystack not yet
arxiv9 A Survey of Backdoor Attacks and Defenses on Large Language Models: Implications for Security Measures not yet
arxiv9 WoCoCo: Learning Whole-Body Humanoid Control with Sequential Contacts not yet
arxiv9 Hello Again! LLM-powered Personalized Agent for Long-term Dialogue not yet
arxiv9 Mamba YOLO: SSMs-Based YOLO For Object Detection not yet
arxiv9 Deep Learning Powered Estimate of The Extrinsic Parameters on Unmanned Surface Vehicles not yet
arxiv9 Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive? not yet
arxiv9 AgentGym: Evolving Large Language Model-based Agents across Diverse Environments not yet
arxiv9 Bench2Drive: Towards Multi-Ability Benchmarking of Closed-Loop End-To-End Autonomous Driving not yet
arxiv9 Event3DGS: Event-Based 3D Gaussian Splatting for High-Speed Robot Egomotion not yet
arxiv9 Exploring the Potential of Polynomial Basis Functions in Kolmogorov-Arnold Networks: A Comparative Study of Different Groups of Polynomials not yet
arxiv9 Cross-Modal Safety Alignment: Is textual unlearning all you need? not yet
arxiv9 Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation not yet
arxiv9 DreamPhysics: Learning Physical Properties of Dynamic 3D Gaussians with Video Diffusion Priors not yet
arxiv9 Are you still on track!? Catching LLM Task Drift with Activations not yet
arxiv9 Automatic Instruction Evolving for Large Language Models not yet
arxiv9 Towards Rationality in Language and Multimodal Agents: A Survey not yet
arxiv9 Exploration of Attention Mechanism-Enhanced Deep Learning Models in the Mining of Medical Textual Data not yet
arxiv8 Decoding-Time Language Model Alignment with Multiple Objectives not yet
arxiv8 AI Risk Categorization Decoded (AIR 2024): From Government Regulations to Corporate Policies not yet
arxiv8 MotionBooth: Motion-Aware Customized Text-to-Video Generation not yet
arxiv8 FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models not yet
arxiv8 VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models not yet
arxiv8 Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibration not yet
arxiv8 VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation not yet
arxiv8 Continuous Aperture Array (CAPA)-Based Wireless Communications: Capacity Characterization not yet
arxiv8 Risk thresholds for frontier AI not yet
arxiv8 Fantastic Copyrighted Beasts and How (Not) to Generate Them not yet
arxiv8 Transferable Boltzmann Generators not yet
arxiv8 DASB - Discrete Audio and Speech Benchmark not yet
arxiv8 CityGPT: Empowering Urban Spatial Cognition of Large Language Models not yet
arxiv8 SpatialBot: Precise Spatial Understanding with Vision Language Models not yet
arxiv8 DF40: Toward Next-Generation Deepfake Detection not yet
arxiv8 VRSBench: A Versatile Vision-Language Benchmark Dataset for Remote Sensing Image Understanding not yet
arxiv8 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models not yet
arxiv8 WPO: Enhancing RLHF with Weighted Preference Optimization not yet
arxiv8 Transcendence: Generative Models Can Outperform The Experts That Train Them not yet
arxiv8 Can LLM be a Personalized Judge? not yet
arxiv8 DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models not yet
arxiv8 A Survey on Human Preference Learning for Large Language Models not yet
arxiv8 Predict Click-Through Rates with Deep Interest Network Model in E-commerce Advertising not yet
arxiv8 Detecting and Evaluating Medical Hallucinations in Large Vision Language Models not yet
arxiv8 STALL+: Boosting LLM-based Repository-level Code Completion with Static Analysis not yet
arxiv8 LRM-Zero: Training Large Reconstruction Models with Synthesized Data not yet
arxiv8 Understanding Hallucinations in Diffusion Models through Mode Interpolation not yet
arxiv8 Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models not yet
arxiv8 Multi-Agent Software Development through Cross-Team Collaboration not yet
arxiv8 COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing not yet
arxiv8 MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs not yet
arxiv8 RVT-2: Learning Precise Manipulation from Few Demonstrations not yet
arxiv8 Discovering Preference Optimization Algorithms with and for Large Language Models not yet
arxiv8 Large Language Models Must Be Taught to Know What They Don't Know not yet
arxiv8 Designing a Dashboard for Transparency and Control of Conversational AI not yet
arxiv8 Trim 3D Gaussian Splatting for Accurate Geometry Representation not yet
arxiv8 Effectively Compress KV Heads for LLM not yet
arxiv8 Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models not yet
arxiv8 Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-Distillation not yet
arxiv8 RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection not yet
arxiv8 6DMA Enhanced Wireless Network with Flexible Antenna Position and Rotation: Opportunities and Challenges not yet
arxiv8 Machine Against the RAG: Jamming Retrieval-Augmented Generation with Blocker Documents not yet
arxiv8 Deconstructing The Ethics of Large Language Models from Long-standing Issues to New-emerging Dilemmas: A Survey not yet
arxiv8 UltraMedical: Building Specialized Generalists in Biomedicine not yet
arxiv8 Lean Workbook: A large-scale Lean problem set formalized from natural language math problems not yet
arxiv8 Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data not yet
arxiv8 Understanding the Impact of Negative Prompts: When and How Do They Take Effect? not yet
arxiv8 HYDRA: Model Factorization Framework for Black-Box LLM Personalization not yet
arxiv8 Leveraging KANs For Enhanced Deep Koopman Operator Discovery not yet
arxiv8 RKLD: Reverse KL-Divergence-based Knowledge Distillation for Unlearning Personal Information in Large Language Models not yet
arxiv8 Process-Driven Autoformalization in Lean 4 not yet
arxiv8 DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs not yet
arxiv8 Unleashing Generalization of End-to-End Autonomous Driving with Controllable Long Video Generation not yet
arxiv8 Show, Don't Tell: Aligning Language Models with Demonstrated Feedback not yet
arxiv8 LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models not yet
arxiv7 SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting not yet
arxiv7 The SIFo Benchmark: Investigating the Sequential Instruction Following Ability of Large Language Models not yet
arxiv7 Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model not yet
arxiv7 UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models not yet
arxiv7 Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation not yet
arxiv7 Evaluating Copyright Takedown Methods for Language Models not yet
arxiv7 Online Learning of Multiple Tasks and Their Relationships : Testing on Spam Email Data and EEG Signals Recorded in Construction Fields not yet
arxiv7 On the Evaluation of Large Language Models in Unit Test Generation not yet
arxiv7 Point-SAM: Promptable 3D Segmentation Model for Point Clouds not yet
arxiv7 Quantifying AI Psychology: A Psychometrics Benchmark for Large Language Models not yet
arxiv7 MemServe: Context Caching for Disaggregated LLM Serving with Elastic Memory Pool not yet
arxiv7 A Complete Survey on LLM-based AI Chatbots not yet
arxiv7 Dreamitate: Real-World Visuomotor Policy Learning via Video Generation not yet
arxiv7 DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation not yet
arxiv7 Adam-mini: Use Fewer Learning Rates To Gain More not yet
arxiv7 WARP: On the Benefits of Weight Averaged Rewarded Policies not yet
arxiv7 Trace is the Next AutoDiff: Generative Optimization with Rich Feedback, Execution Traces, and LLMs not yet
arxiv7 LGS: A Light-weight 4D Gaussian Splatting for Efficient Surgical Scene Reconstruction not yet
arxiv7 Can LLM Graph Reasoning Generalize beyond Pattern Memorization? not yet
arxiv7 Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs not yet
arxiv7 SampleAttention: Near-Lossless Acceleration of Long Context LLM Inference with Adaptive Structured Sparse Attention not yet
arxiv7 Image Conductor: Precision Control for Interactive Video Synthesis not yet
arxiv7 GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation not yet
arxiv7 MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression not yet
arxiv7 A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models not yet
arxiv7 Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data not yet
arxiv7 Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs not yet
arxiv7 Timo: Towards Better Temporal Reasoning for Language Models not yet
arxiv7 CityBench: Evaluating the Capabilities of Large Language Model as World Model not yet
arxiv7 CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets not yet
arxiv7 GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation not yet
arxiv7 Nicer Than Humans: How do Large Language Models Behave in the Prisoner's Dilemma? not yet
arxiv7 Coding Speech through Vocal Tract Kinematics not yet
arxiv7 SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents not yet
arxiv7 OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI not yet
arxiv7 TSI-Bench: Benchmarking Time Series Imputation not yet
arxiv7 AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention not yet
arxiv7 AgentReview: Exploring Peer Review Dynamics with LLM Agents not yet
arxiv7 VoCo-LLaMA: Towards Vision Compression with Large Language Models not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

0
1
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
1

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?