Vincent's Arxiv FrontPage


Generated on 2025-04-07.


This frontpage is made by scraping arxiv and by running a sentence-model that detects if the abstract describes a paper about a topic of interest. One cool feature: it all pretty much runs via Github Actions.


New Datasets

2025-04-03

Limitations of Religious Data and the Importance of the Target Domain: Towards Machine Translation for Guinea-Bissau Creole

2025-04-03

EvoChain: A Framework for Tracking and Visualizing Smart Contract Evolution

2025-04-03

Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets

2025-04-03

MegaMath: Pushing the Limits of Open Math Corpora

2025-04-03

Concept Lancet: Image Editing with Compositional Representation Transplant

2025-04-02

SOLAQUA: SINTEF Ocean Large Aquaculture Robotics Dataset

2025-04-02

DISINFOX: an open-source threat exchange platform serving intelligence on disinformation and influence operations

2025-04-02

YourBench: Easy Custom Evaluation Sets for Everyone

2025-04-02

Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks

2025-04-02

Extending MovieLens-32M to Provide New Evaluation Objectives

2025-04-02

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

2025-04-02

OpenCodeReasoning: Advancing Data Distillation for Competitive Coding

2025-04-02

Image Difference Grounding with Natural Language

2025-04-02

Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities

2025-03-31

FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics

2025-03-31

Visual Acoustic Fields

2025-03-31

Point Tracking in Surgery--The 2024 Surgical Tattoos in Infrared (STIR) Challenge

2025-03-31

Faster Releases, Fewer Risks: A Study on Maven Artifact Vulnerabilities and Lifecycle Management

2025-03-31

InstructRestore: Region-Customized Image Restoration with Human Instructions

2025-03-31

Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation

2025-03-27

Dataset and Analysis of Long-Term Skill Acquisition in Robot-Assisted Minimally Invasive Surgery

2025-03-27

UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning

2025-03-27

The MVTec AD 2 Dataset: Advanced Scenarios for Unsupervised Anomaly Detection

2025-03-27

COMI-LINGUA: Expert Annotated Large-Scale Dataset for Multitask NLP in Hindi-English Code-Mixing

2025-03-27

JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models' Detection of Human Self-Destructive Behavior Content in Jirai Community

2025-03-27

CMED: A Child Micro-Expression Dataset

2025-03-27

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

2025-03-27

Video-R1: Reinforcing Video Reasoning in MLLMs

2025-03-26

Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions

2025-03-26

AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports

2025-03-26

ARMO: Autoregressive Rigging for Multi-Category Objects

2025-03-26

Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy

2025-03-26

MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams

2025-03-26

BASKET: A Large-Scale Video Dataset for Fine-Grained Skill Estimation

2025-03-25

Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings

2025-03-25

BiPrompt-SAM: Enhancing Image Segmentation via Explicit Selection between Point and Text Prompts

2025-03-25

Outsourcing an Information Operation: A Complete Dataset of Tenet Media's Podcasts on Rumble

2025-03-25

LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset

2025-03-25

Scaling Down Text Encoders of Text-to-Image Diffusion Models

2025-03-24

SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction

2025-03-24

SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding

Data Quality

2025-04-02

Buggin: Automatic intrinsic bugs classification model using NLP and ML

2025-03-25

RCC-PFL: Robust Client Clustering under Noisy Labels in Personalized Federated Learning

2025-03-13

Learning Disease State from Noisy Ordinal Disease Progression Labels

2025-03-13

More Than Just Warnings:Exploring the Ways of Communicating Credibility Assessment on Social Media

2025-03-13

Unlock the Power of Unlabeled Data in Language Driving Model

Benchmarks

2025-04-03

BECAME: BayEsian Continual Learning with Adaptive Model MErging

2025-04-03

SCMPPI: Supervised Contrastive Multimodal Framework for Predicting Protein-Protein Interactions

2025-04-03

Pushing the Limit of PPG Sensing in Sedentary Conditions by Addressing Poor Skin-sensor Contact

2025-04-03

Curbing the Ramifications of Authorship Abuse in Science

2025-04-03

A Survey of Large Language Models in Mental Health Disorder Detection on Social Media

2025-04-02

Budget-Feasible Contracts

2025-04-02

Prompting Medical Vision-Language Models to Mitigate Diagnosis Bias by Generating Realistic Dermoscopic Images

2025-04-02

LARGE: Legal Retrieval Augmented Generation Evaluation Tool

2025-04-02

Enhanced Diffusion Sampling via Extrapolation with Multiple ODE Solutions

2025-04-02

Buggin: Automatic intrinsic bugs classification model using NLP and ML

2025-04-02

A Diffusion-Based Framework for Occluded Object Movement

2025-04-02

TransientTables: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables

2025-04-02

Multi-fidelity Parameter Estimation Using Conditional Diffusion Models

2025-04-02

Build Code Needs Maintenance Too: A Study on Refactoring and Technical Debt in Build Systems

2025-04-02

Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework

2025-04-02

Client Selection in Federated Learning with Data Heterogeneity and Network Latencies

2025-04-02

Review, Refine, Repeat: Understanding Iterative Decoding of AI Agents with Dynamic Evaluation and Selection

2025-04-02

Diffusion-Guided Gaussian Splatting for Large-Scale Unconstrained 3D Reconstruction and Novel View Synthesis

2025-04-02

Learning from Streaming Video with Orthogonal Gradients

2025-03-31

Spatio-temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications in Ridesharing

2025-03-31

Advanced Quantum Annealing Approach to Vehicle Routing Problems with Time Windows

2025-03-31

Fair Dynamic Spectrum Access via Fully Decentralized Multi-Agent Reinforcement Learning

2025-03-31

Point Tracking in Surgery--The 2024 Surgical Tattoos in Infrared (STIR) Challenge

2025-03-31

BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models

2025-03-31

Sample-Optimal Private Regression in Polynomial Time

2025-03-31

Contextual Preference Collaborative Measure Framework Based on Belief System

2025-03-31

Accelerated Approximate Optimization of Multi-Commodity Flows on Directed Graphs

2025-03-31

Easi3R: Estimating Disentangled Motion from DUSt3R Without Training

2025-03-27

Prompt, Divide, and Conquer: Bypassing Large Language Model Safety Filters via Segmented and Distributed Prompt Processing

2025-03-27

Audio-driven Gesture Generation via Deviation Feature in the Latent Space

2025-03-27

The MVTec AD 2 Dataset: Advanced Scenarios for Unsupervised Anomaly Detection

2025-03-27

ClusterSC: Advancing Synthetic Control with Donor Selection

2025-03-27

A Bespoke Design Approach to Low-Power Printed Microprocessors for Machine Learning Applications

2025-03-27

AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation

2025-03-27

Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance

2025-03-27

GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics

2025-03-27

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

2025-03-27

A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One

2025-03-27

Semantic Consistent Language Gaussian Splatting for Point-Level Open-vocabulary Querying

2025-03-27

Optimal Stepsize for Diffusion Sampling

2025-03-27

X2^{2}-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction

2025-03-26

Benchmarking and optimizing organism wide single-cell RNA alignment methods

2025-03-26

SChanger: Change Detection from a Semantic Change and Spatial Consistency Perspective

2025-03-26

Optimal Scaling Laws for Efficiency Gains in a Theoretical Transformer-Augmented Sectional MoE Framework

2025-03-26

ASGO: Adaptive Structured Gradient Optimization

2025-03-26

Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising

LLMs

2025-04-03

Affordable AI Assistants with Knowledge Graph of Thoughts

2025-04-03

LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems

2025-04-03

The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context

2025-04-03

TeleMoM: Consensus-Driven Telecom Intelligence via Mixture of Models

2025-04-03

Computing High-dimensional Confidence Sets for Arbitrary Distributions

2025-04-03

ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization

2025-04-03

Why do LLMs attend to the first token?

2025-04-03

Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study

2025-04-03

How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?

2025-04-03

MultiBLiMP 1.0: A Massively Multilingual Benchmark of Linguistic Minimal Pairs

2025-04-03

BT-ACTION: A Test-Driven Approach for Modular Understanding of User Instruction Leveraging Behaviour Trees and LLMs

2025-04-03

From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks

2025-04-03

A Framework for Robust Cognitive Evaluation of LLMs

2025-04-03

A Survey of Large Language Models in Mental Health Disorder Detection on Social Media

2025-04-03

MegaMath: Pushing the Limits of Open Math Corpora

2025-04-03

Generative Evaluation of Complex Reasoning in Large Language Models

2025-04-03

Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models

Developer Research

2025-04-03

The Myth of Immutability: A Multivocal Review on Smart Contract Upgradeability

2025-04-02

The Factors Influencing Well-Being in Software Engineers: A Cross-Country Mixed-Method Study

2025-04-02

Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks

2025-04-02

Build Code Needs Maintenance Too: A Study on Refactoring and Technical Debt in Build Systems

2025-03-31

MaintainCoder: Maintainable Code Generation Under Dynamic Requirements

2025-03-27

Enhancing Repository-Level Software Repair via Repository-Aware Knowledge Graphs

2025-03-25

SLA-Awareness for AI-assisted coding

2025-03-18

MANTRA: Enhancing Automated Method-Level Refactoring with Contextual RAG and Multi-Agent LLM Collaboration

Data Annotation Techniques