2024-12-16

Title: Blockchain Data Analysis in the Era of Large-Language Models

Title: Machine Learning Driven Smishing Detection Framework for Mobile Security

Title: Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Title: From Noise to Nuance: Advances in Deep Generative Image Models

Title: SEGT: A General Spatial Expansion Group Transformer for nuScenes Lidar-based Object Detection Task

Title: Vision-Language Models Represent Darker-Skinned Black Individuals as More Homogeneous than Lighter-Skinned Black Individuals

Title: The Cost of Replicability in Active Learning

Title: DQA: An Efficient Method for Deep Quantization of Deep Neural Network Activations

Title: TOAP: Towards Better Robustness in Universal Transferable Anti-Facial Retrieval

Title: Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Title: Soybean Maturity Prediction using 2D Contour Plots from Drone based Time Series Imagery

Title: Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation

Title: GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Title: The Unreasonable Effectiveness of Gaussian Score Approximation for Diffusion Models and its Applications

Title: Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models

Title: Bad Crypto: Chessography and Weak Randomness of Chess Games

Title: ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

Title: Private Synthetic Data Generation in Small Memory

Title: L-WISE: Boosting Human Image Category Learning Through Model-Based Image Selection And Enhancement

Title: A Differentiable Wave Optics Model for End-to-End Computational Imaging System Optimization

Title: Is it the model or the metric -- On robustness measures of deeplearning models

Title: AutoPatent: A Multi-Agent Framework for Automatic Patent Generation

Title: deepNoC: A deep learning system to assign the number of contributors to a short tandem repeat DNA profile

Title: LLM Distillation for Efficient Few-Shot Multiple Choice Question Answering

Title: ScaleOT: Privacy-utility-scalable Offsite-tuning with Dynamic LayerReplace and Selective Rank Compression

Title: Temporal Causal Discovery in Dynamic Bayesian Networks Using Federated Learning

Title: Enhancing Multimodal Large Language Models Complex Reason via Similarity Computation

Title: MERaLiON-AudioLLM: Technical Report

Title: FDM-Bench: A Comprehensive Benchmark for Evaluating Large Language Models in Additive Manufacturing Tasks

Title: Empowering Patients for Disease Diagnosis and Clinical Treatment: A Smart Contract-Enabled Informed Consent Strategy

Title: Dynamic Try-On: Taming Video Virtual Try-on with Dynamic Attention Mechanism

Title: Low-Rank Adaptation with Task-Relevant Feature Enhancement for Fine-tuning Language Models

Title: MSC: Multi-Scale Spatio-Temporal Causal Attention for Autoregressive Video Diffusion

Title: Leveraging Programmatically Generated Synthetic Data for Differentially Private Diffusion Training

Title: Learning Structural Causal Models from Ordering: Identifiable Flow Models

Title: Real-time Identity Defenses against Malicious Personalization of Diffusion Models

Title: LinGen: Towards High-Resolution Minute-Length Text-to-Video Generation with Linear Computational Complexity

Title: Dynamic Cross-Modal Alignment for Robust Semantic Location Prediction

Title: Byte Latent Transformer: Patches Scale Better Than Tokens

Title: Selective State Space Memory for Large Vision-Language Models

Title: On the Limit of Language Models as Planning Formalizers

Title: Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning

Title: Benchmarking Table Comprehension In The Wild

Title: T-GMSI: A transformer-based generative model for spatial interpolation under sparse measurements

Title: Analyzing Fairness of Classification Machine Learning Model with Structured Dataset

Title: Analyzing Fairness of Computer Vision and Natural Language Processing Models

Title: Enhancing the Reasoning Capabilities of Small Language Models via Solution Guidance Fine-Tuning

Title: IQViC: In-context, Question Adaptive Vision Compressor for Long-term Video Understanding LMMs

Title: Prompt2Perturb (P2P): Text-Guided Diffusion-Based Adversarial Attacks on Breast Ultrasound Images

Title: All-in-One: Transferring Vision Foundation Models into Stereo Matching

Title: B-VLLM: A Vision Large Language Model with Balanced Spatio-Temporal Tokens

Title: FaceShield: Defending Facial Image against Deepfake Threats

Title: Simulating Hard Attention Using Soft Attention

Title: SCRUBD: Smart Contracts Reentrancy and Unhandled Exceptions Vulnerability Dataset

Title: CaLoRAify: Calorie Estimation with Visual-Text Pairing and LoRA-Driven Visual Language Models

Title: Enhancing Nursing and Elderly Care with Large Language Models: An AI-Driven Framework

Title: Towards Fair Graph Neural Networks via Graph Counterfactual without Sensitive Attributes

Title: Llama 3 Meets MoE: Efficient Upcycling

Title: $\textrm{A}^{\textrm{2}}$RNet: Adversarial Attack Resilient Network for Robust Infrared and Visible Image Fusion

Title: Efficient Dataset Distillation via Diffusion-Driven Patch Selection for Improved Generalization

Title: END$^2$: Robust Dual-Decoder Watermarking Framework Against Non-Differentiable Distortions

Title: EP-CFG: Energy-Preserving Classifier-Free Guidance

Title: Efficient Large-Scale Traffic Forecasting with Transformers: A Spatial Data Management Perspective

Title: SUMI-IFL: An Information-Theoretic Framework for Image Forgery Localization with Sufficiency and Minimality Constraints

Title: SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video

Title: Small Language Model as Data Prospector for Large Language Model

Title: A Comparative Study of LLMs, NMT Models, and Their Combination in Persian-English Idiom Translation

Title: Mr. DETR: Instructive Multi-Route Training for Detection Transformers

Title: Object-Focused Data Selection for Dense Prediction Tasks

Title: Timealign: A multi-modal object detection method for time misalignment fusing in autonomous driving

Title: SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution

Title: TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views

Title: Unsupervised Named Entity Disambiguation for Low Resource Domains

Title: GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?

Title: Quaffure: Real-Time Quasi-Static Neural Hair Simulation

Title: Text2Cypher: Bridging Natural Language and Graph Databases

Title: Lost in the Middle, and In-Between: Enhancing Language Models' Ability to Reason Over Long Contexts in Multi-Hop QA

Title: RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector

Title: The Art of Deception: Color Visual Illusions and Diffusion Models

Title: Feature Selection for Latent Factor Models

Title: ASLoRA: Adaptive Sharing Low-Rank Adaptation Across Layers

Title: Can LLMs Convert Graphs to Text-Attributed Graphs?

Title: ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL

Title: UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object Detection

Title: SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models

Title: Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer

Title: BiCert: A Bilinear Mixed Integer Programming Formulation for Precise Certified Bounds Against Data Poisoning Attacks

Title: Simple Guidance Mechanisms for Discrete Diffusion Models

Title: From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection

Title: Integrative Analysis of Financial Market Sentiment Using CNN and GRU for Risk Prediction and Alert Systems

Title: Retrieval-Augmented Semantic Parsing: Using Large Language Models to Improve Generalization

Title: Efficient Generative Modeling with Residual Vector Quantization-Based Tokens

Title: GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion

Title: Learning Complex Non-Rigid Image Edits from Multimodal Conditioning

Title: SPT: Sequence Prompt Transformer for Interactive Image Segmentation

Title: SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians

Title: Efficient Continual Pre-training of LLMs for Low-resource Languages

Title: Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts

Title: Targeted Angular Reversal of Weights (TARS) for Knowledge Removal in Large Language Models

Title: MVQ:Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization

Title: Adversarial Robustness of Bottleneck Injected Deep Neural Networks for Task-Oriented Communication

Title: Reasoner Outperforms: Generative Stance Detection with Rationalization for Social Media

Title: Benchmarking Linguistic Diversity of Large Language Models

Title: Probabilistic Inverse Cameras: Image to 3D via Multiview Geometry

Title: TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation

Title: One world, one opinion? The superstar effect in LLM responses

Title: Still "Talking About Large Language Models": Some Clarifications

Title: Prompt-Guided Mask Proposal for Two-Stage Open-Vocabulary Segmentation

Title: Coherent 3D Scene Diffusion From a Single RGB Image

Title: BrushEdit: All-In-One Image Inpainting and Editing

Title: SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Title: AdvPrefix: An Objective for Nuanced LLM Jailbreaks

Title: Generative AI in Medicine

Title: XYScanNet: An Interpretable State Space Model for Perceptual Image Deblurring

Title: A Universal Degradation-based Bridging Technique for Domain Adaptive Semantic Segmentation

Title: Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining

Title: VibrantVS: A high-resolution multi-task transformer for forest canopy height estimation

Title: Robust image classification with multi-modal large language models

Title: UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities