1
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

【2024年10月公開 Arxiv論文ランキング】2410.xxxxx

Posted at

AI論文解説 Youtubeチャンネル: AI時代の羅針盤

2024年10月頃に公開されたcsカテゴリの論文 (ID: 2410.xxxxx)を被引用数のデータを元にランキングしています。ランキングは随時更新します。
(2024年12月11日更新)

被引用数   タイトル 動画
arxiv40 Movie Gen: A Cast of Media Foundation Models
arxiv36 GPT-4o System Card not yet
arxiv24 GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
arxiv14 Depth Pro: Sharp Monocular Metric Depth in Less Than a Second
arxiv11 HART: Efficient Visual Generation with Hybrid Autoregressive Transformer not yet
arxiv11 Pixtral 12B not yet
arxiv11 Moshi: a speech-text foundation model for real-time dialogue not yet
arxiv10 Self-Supervised Graph Neural Networks for Enhanced Feature Extraction in Heterogeneous Information Networks not yet
arxiv10 A Recommendation Model Utilizing Separation Embedding and Self-Attention for Feature Mining not yet
arxiv10 Adversarial Neural Networks in Medical Imaging Advancements and Challenges in Semantic Segmentation not yet
arxiv10 A Survey on Diffusion Models for Inverse Problems not yet
arxiv9 Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
arxiv9 Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation not yet
arxiv9 Balancing Innovation and Privacy: Data Security Strategies in Natural Language Processing Applications not yet
arxiv9 Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation not yet
arxiv9 SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference not yet
arxiv9 MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion not yet
arxiv9 Loong: Generating Minute-level Long Videos with Autoregressive Language Models not yet
arxiv8 Predicting Liquidity Coverage Ratio with Gated Recurrent Units: A Deep Learning Model for Risk Management not yet
arxiv8 Efficient and Aesthetic UI Design with a Deep Learning-Based Interface Generation Tree Algorithm not yet
arxiv8 Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm not yet
arxiv8 GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation not yet
arxiv8 Applying Hybrid Graph Neural Networks to Strengthen Credit Risk Analysis not yet
arxiv8 Video Instruction Tuning With Synthetic Data not yet
arxiv7 O1 Replication Journey: A Strategic Progress Report -- Part 1 not yet
arxiv7 Optimizing Retrieval-Augmented Generation with Elasticsearch for Enhanced Question-Answering Systems not yet
arxiv7 Automated Genre-Aware Article Scoring and Feedback Using Large Language Models not yet
arxiv7 HSR-Enhanced Sparse Attention Acceleration not yet
arxiv7 Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think not yet
arxiv7 Pyramidal Flow Matching for Efficient Video Generative Modeling
arxiv7 Differential Transformer
arxiv7 Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents not yet
arxiv7 Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding not yet
arxiv6 Data Scaling Laws in Imitation Learning for Robotic Manipulation not yet
arxiv6 Allegro: Open the Black Box of Commercial-Level Video Generation Model
arxiv6 Jailbreaking and Mitigation of Vulnerabilities in Large Language Models not yet
arxiv6 Blockchain-Based Trust and Transparency in Airline Reservation Systems using Microservices Architecture not yet
arxiv6 Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations not yet
arxiv6 Generative AI and Its Impact on Personalized Intelligent Tutoring Systems not yet
arxiv6 SpeGCL: Self-supervised Graph Spectrum Contrastive Learning without Positive Samples not yet
arxiv6 Looped ReLU MLPs May Be All You Need as Practical Programmable Computers not yet
arxiv6 Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow not yet
arxiv6 MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
arxiv6 Justice or Prejudice? Quantifying Biases in LLM-as-a-Judge not yet
arxiv6 IC3M: In-Car Multimodal Multi-object Monitoring for Abnormal Status of Both Driver and Passengers not yet
arxiv5 Exposing Cross-Platform Coordinated Inauthentic Activity in the Run-Up to the 2024 U.S. Election not yet
arxiv5 Deep Learning for Medical Text Processing: BERT Model Fine-Tuning and Comparative Study not yet
arxiv5 Optimizing Travel Itineraries with AI Algorithms in a Microservices Architecture: Balancing Cost, Time, Preferences, and Sustainability not yet
arxiv5 PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction not yet
arxiv5 One-Step Diffusion Distillation through Score Implicit Matching not yet
arxiv5 Jailbreaking LLM-Controlled Robots not yet
arxiv5 Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws not yet
arxiv5 Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent not yet
arxiv5 Beyond Linear Approximations: A Novel Pruning Approach for Attention Matrix not yet
arxiv5 Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies not yet
arxiv5 SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers not yet
arxiv5 How to Construct Random Unitaries not yet
arxiv5 The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment with Large Language Models not yet
arxiv5 AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents not yet
arxiv5 Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis not yet
arxiv5 Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training not yet
arxiv5 DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation not yet
arxiv5 Aria: An Open Multimodal Native Mixture-of-Experts Model not yet
arxiv5 Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation not yet
arxiv5 ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery not yet
arxiv5 LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning not yet
arxiv5 HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly not yet
arxiv5 How to Train Long-Context Language Models (Effectively) not yet
arxiv5 ImageFolder: Autoregressive Image Generation with Folded Tokens not yet
arxiv4 No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images not yet
arxiv4 $\pi_0$: A Vision-Language-Action Flow Model for General Robot Control not yet
arxiv4 Deep Learning with HM-VGG: AI Strategies for Multi-modal Image Analysis not yet
arxiv4 Unearthing a Billion Telegram Posts about the 2024 U.S. Presidential Election: Development of a Public Dataset not yet
arxiv4 Human-Centric eXplainable AI in Education not yet
arxiv4 LLM-Slice: Dedicated Wireless Network Slicing for Large Language Models not yet
arxiv4 Quantum linear system algorithm with optimal queries to initial state preparation not yet
arxiv4 Graph Contrastive Learning via Cluster-refined Negative Sampling for Semi-supervised Text Classification not yet
arxiv4 YOLOv11: An Overview of the Key Architectural Enhancements not yet
arxiv4 LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
arxiv4 The XLZD Design Book: Towards the Next-Generation Liquid Xenon Observatory for Dark Matter and Neutrino Physics not yet
arxiv4 Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages not yet
arxiv4 Iterative Methods via Locally Evolving Set Process not yet
arxiv4 MCSFF: Multi-modal Consistency and Specificity Fusion Framework for Entity Alignment not yet
arxiv4 Beamforming Optimization for Continuous Aperture Array (CAPA)-based Communications not yet
arxiv4 DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation not yet
arxiv4 ALOHA Unleashed: A Simple Recipe for Robot Dexterity not yet
arxiv4 MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models not yet
arxiv4 MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generation not yet
arxiv4 CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos not yet
arxiv4 Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities not yet
arxiv4 TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models not yet
arxiv4 When Attention Sink Emerges in Language Models: An Empirical View not yet
arxiv4 Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models not yet
arxiv4 Locking Down the Finetuned LLMs Safety not yet
arxiv4 The Ingredients for Robotic Diffusion Transformers not yet
arxiv4 Targeted Vaccine: Safety Alignment for Large Language Models against Harmful Fine-Tuning via Layer-wise Perturbation not yet
arxiv4 Impurities and polarons in bosonic quantum gases: a review on recent progress not yet
arxiv4 Fine-grained Attention I/O Complexity: Comprehensive Analysis for Backward Passes not yet
arxiv4 Improved List Size for Folded Reed-Solomon Codes not yet
arxiv4 Ocean-omni: To Understand the World with Omni-modality not yet
arxiv4 RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation not yet
arxiv4 SEAL: Safety-enhanced Aligned LLM Fine-tuning via Bilevel Data Selection not yet
arxiv4 Automated Creation of Digital Cousins for Robust Policy Learning not yet
arxiv4 Toward hybrid quantum simulations with qubits and qumodes on trapped-ion platforms not yet
arxiv4 IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation not yet
arxiv4 F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
arxiv4 Dynamic metastability in the self-attention model not yet
arxiv4 Manifolds, Random Matrices and Spectral Gaps: The geometric phases of generative diffusion not yet
arxiv4 MDAP: A Multi-view Disentangled and Adaptive Preference Learning Framework for Cross-Domain Recommendation not yet
arxiv4 Strong Model Collapse
arxiv4 Stochastic Runge-Kutta Methods: Provable Acceleration of Diffusion Models not yet
arxiv4 CAR: Controllable Autoregressive Modeling for Visual Generation not yet
arxiv4 Towards Secure Tuning: Mitigating Security Risks Arising from Benign Instruction Fine-Tuning not yet
arxiv4 Dynamic Diffusion Transformer not yet
arxiv4 AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark not yet
arxiv4 MetaMetrics: Calibrating Metrics For Generation Tasks Using Human Preferences not yet
arxiv4 AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models not yet
arxiv4 Iterated Radical Expansions and Convergence not yet
arxiv4 Deep Learning Alternatives of the Kolmogorov Superposition Theorem not yet
arxiv4 CHASE-SQL: Multi-Path Reasoning and Preference Optimized Candidate Selection in Text-to-SQL not yet
arxiv4 Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown not yet
arxiv4 The Patterns of Life Human Mobility Simulation not yet
arxiv3 Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning not yet
arxiv3 Breaking Determinism: Fuzzy Modeling of Sequential Recommendation Using Discrete State Space Diffusion Model not yet
arxiv3 A note on polynomial-time tolerant testing stabilizer states not yet
arxiv3 MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding not yet
arxiv3 Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse
arxiv3 Rethinking Visual Dependency in Long-Context Reasoning for Large Vision-Language Models not yet
arxiv3 Enhancing Resilience and Scalability in Travel Booking Systems: A Microservices Approach to Fault Tolerance, Load Balancing, and Service Discovery not yet
arxiv3 Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms not yet
arxiv3 Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences not yet
arxiv3 Denoising diffusion probabilistic models are optimally adaptive to unknown low dimensionality not yet
arxiv3 Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs not yet
arxiv3 CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation not yet
arxiv3 AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents not yet
arxiv3 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors not yet
arxiv3 A Survey of Conversational Search not yet
arxiv3 Pruning Foundation Models for High Accuracy without Retraining not yet
arxiv3 Transversal non-Clifford gates for quantum LDPC codes on sheaves not yet
arxiv3 DepthSplat: Connecting Gaussian Splatting and Depth not yet
arxiv3 DreamVideo-2: Zero-Shot Subject-Driven Video Customization with Precise Motion Control not yet
arxiv3 DN-4DGS: Denoised Deformable Network with Temporal-Spatial Aggregation for Dynamic Scene Rendering not yet
arxiv3 L3DG: Latent 3D Gaussian Diffusion not yet
arxiv3 aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Completion not yet
arxiv3 Generative Reward Models not yet
arxiv3 FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression not yet
arxiv3 WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines not yet
arxiv3 Latent Action Pretraining from Videos
arxiv3 Agent-as-a-Judge: Evaluate Agents with Agents
arxiv3 DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
arxiv3 Boosting Camera Motion Control for Video Diffusion Transformers not yet
arxiv3 MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling not yet
arxiv3 Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling not yet
arxiv3 Animate-X: Universal Character Image Animation with Enhanced Motion Representation not yet
arxiv3 MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models not yet
arxiv3 Safety-Aware Fine-Tuning of Large Language Models not yet
arxiv3 Lessons Learned: A Smart Campus Environment Using LoRaWAN not yet
arxiv3 OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models not yet
arxiv3 VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model not yet
arxiv3 ARCap: Collecting High-quality Human Demonstrations for Robot Learning with Augmented Reality Feedback not yet
arxiv3 Language model developers should report train-test overlap not yet
arxiv3 Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning not yet
arxiv3 Efficient Quantum Pseudorandomness from Hamiltonian Phase States not yet
arxiv3 Hybrid Summary Statistics not yet
arxiv3 ReinDiffuse: Crafting Physically Plausible Motions with Reinforced Diffusion Model not yet
arxiv3 Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis not yet
arxiv3 Personalized Visual Instruction Tuning not yet
arxiv3 AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs not yet
arxiv3 MIBench: A Comprehensive Benchmark for Model Inversion Attack and Defense not yet
arxiv3 Gibbs state preparation for commuting Hamiltonian: Mapping to classical Gibbs sampling not yet
arxiv3 Hammer: Robust Function-Calling for On-Device Language Models via Function Masking not yet
arxiv3 Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution not yet
arxiv3 Equivariant Neural Functional Networks for Transformers not yet
arxiv3 Sinc Kolmogorov-Arnold Network and Its Applications on Physics-informed Neural Networks not yet
arxiv3 A survey of Zarankiewicz problem in geometry not yet
arxiv3 GenSim2: Scaling Robot Data Generation with Multi-modal and Reasoning LLMs not yet
arxiv3 LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding not yet
arxiv3 Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations not yet
arxiv3 SteerDiff: Steering towards Safe Text-to-Image Diffusion Models not yet
arxiv3 LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
arxiv3 ControlAR: Controllable Image Generation with Autoregressive Models not yet
arxiv3 Analyzing black-hole ringdowns II: data conditioning not yet
arxiv3 Learning classical density functionals for ionic fluids not yet
arxiv3 RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning
arxiv3 Urban Anomalies: A Simulated Human Mobility Dataset with Injected Anomalies not yet
arxiv3 A Little Goes a Long Way: Efficient Long Context Training and Inference with Partial Contexts not yet
arxiv3 softmax is not enough (for sharp out-of-distribution) not yet
arxiv3 MERIT: Multimodal Wearable Vital Sign Waveform Monitoring not yet
arxiv3 Enhanced Credit Score Prediction Using Ensemble Deep Learning Model not yet
arxiv3 Transferable Unsupervised Outlier Detection Framework for Human Semantic Trajectories not yet

※ 被引用数は更新日における NASA ADSのデータを参照しています
https://ui.adsabs.harvard.edu/

1
0
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
1
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?