GitHub - WailordHe/cv-arxiv-daily-wailord: 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)

[![Contributors][contributors-shield]][contributors-url] [![Forks][forks-shield]][forks-url] [![Stargazers][stars-shield]][stars-url] [![Issues][issues-shield]][issues-url]

Updated on 2024.11.17

Usage instructions: here

Table of Contents

Computer Vision
LLM

Computer Vision

Publish Date	Title	Authors	PDF	Code
2024-11-14	Masked Image Modeling Boosting Semi-Supervised Semantic Segmentation	Yangyang Li et.al.	2411.08756	null
2024-11-06	Retentive Neural Quantum States: Efficient Ansätze for Ab Initio Quantum Chemistry	Oliver Knitter et.al.	2411.03900	null
2024-10-31	Domain-Adaptive Pre-training of Self-Supervised Foundation Models for Medical Image Classification in Gastrointestinal Endoscopy	Marcel Roth et.al.	2410.21302	null
2024-10-10	C^2DA: Contrastive and Context-aware Domain Adaptive Semantic Segmentation	Md. Al-Masrur Khan et.al.	2410.19748	link
2024-10-22	Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged Annotations	Cheng Lei et.al.	2410.16953	null
2024-10-21	TIPS: Text-Image Pretraining with Spatial Awareness	Kevis-Kokitsi Maninis et.al.	2410.16512	null
2024-10-31	Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective	Yongxin Zhu et.al.	2410.12490	link
2024-10-14	Enhancing JEPAs with Spatial Conditioning: Robust and Efficient Representation Learning	Etai Littwin et.al.	2410.10773	null
2024-10-14	LADMIM: Logical Anomaly Detection with Masked Image Modeling in Discrete Latent Space	Shunsuke Sakai et.al.	2410.10234	null
2024-10-10	Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis	Jinbin Bai et.al.	2410.08261	link
2024-10-25	OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling	Linhui Xiao et.al.	2410.08021	link
2024-10-11	Self-Supervised Learning for Real-World Object Detection: a Survey	Alina Ciocarlan et.al.	2410.07442	null
2024-10-09	Robust infrared small target detection using self-supervised and a contrario paradigms	Alina Ciocarlan et.al.	2410.07437	null
2024-10-09	Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers	Stephen Hausler et.al.	2410.06614	null
2024-10-05	RetCompletion:High-Speed Inference Image Completion with Retentive Network	Yueyang Cang et.al.	2410.04056	null
2024-10-02	Denoising with a Joint-Embedding Predictive Architecture	Dengsheng Chen et.al.	2410.03755	null
2024-10-02	Performant, Memory Efficient and Scalable Multi-Agent Reinforcement Learning	Omayma Mahjoub et.al.	2410.01706	null
2024-09-30	MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation	Wenchao Chen et.al.	2409.19937	null
2024-09-28	Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration	Chu-Jie Qin et.al.	2409.19403	link
2024-09-30	UniEmoX: Cross-modal Semantic-Guided Large-Scale Pretraining for Universal Scene Emotion Perception	Chuang Chen et.al.	2409.18877	link
2024-09-26	Self-supervised Pretraining for Cardiovascular Magnetic Resonance Cine Segmentation	Rob A. J. de Mooij et.al.	2409.18100	link
2024-09-20	Leveraging Text Localization for Scene Text Removal via Text-aware Masked Image Modeling	Zixiao Wang et.al.	2409.13431	link
2024-09-13	Interactive Masked Image Modeling for Multimodal Object Detection in Remote Sensing	Minh-Duc Vu et.al.	2409.08885	null
2024-09-13	Hybrid-TTA: Continual Test-time Adaptation via Dynamic Domain Shift Detection	Hyewon Park et.al.	2409.08566	null
2024-09-04	MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling	Jihye Ahn et.al.	2409.02846	null
2024-09-04	SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction	Sumin Son et.al.	2409.02513	null
2024-08-21	AttDiCNN: Attentive Dilated Convolutional Neural Network for Automatic Sleep Staging using Visibility Graph and Force-directed Layout	Md Jobayer et.al.	2409.01962	null
2024-09-14	Dual Advancement of Representation Learning and Clustering for Sparse and Noisy Images	Wenlin Li et.al.	2409.01781	link
2024-08-28	Online pre-training with long-form videos	Itsuki Kato et.al.	2408.15651	null
2024-08-23	MICM: Rethinking Unsupervised Pretraining for Enhanced Few-shot Learning	Zhenyu Zhang et.al.	2408.13385	link
2024-08-23	Symmetric masking strategy enhances the performance of Masked Image Modeling	Khanh-Binh Nguyen et.al.	2408.12772	null
2024-08-13	Membership Inference Attack Against Masked Image Modeling	Zheng Li et.al.	2408.06825	null
2024-08-13	Masked Image Modeling: A Survey	Vlad Hondru et.al.	2408.06687	null
2024-08-11	HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training	Fenghe Tang et.al.	2408.05815	link
2024-08-20	PersonViT: Large-scale Self-supervised Vision Transformer for Person Re-Identification	Bin Hu et.al.	2408.05398	link
2024-08-15	AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation	Asbjørn Munk et.al.	2408.00640	null
2024-07-29	Short-Term Forecasting of Photovoltaic Power Generation Based on Entropy during the Foggy Winter	Xuan Yang et.al.	2407.19663	null
2024-08-02	XLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training	Biao Wu et.al.	2407.19546	link
2024-07-23	QPT V2: Masked Image Modeling Advances Visual Scoring	Qizhi Xie et.al.	2407.16541	link
2024-07-22	Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning	Yibing Wei et.al.	2407.15837	link
2024-07-20	Self-supervised transformer-based pre-training method with General Plant Infection dataset	Zhengle Wang et.al.	2407.14911	null
2024-07-20	Universal Medical Imaging Model for Domain Generalization with Data Privacy	Ahmed Radwan et.al.	2407.14719	null
2024-07-18	Keypoint Aware Masked Image Modelling	Madhava Krishna et.al.	2407.13873	link
2024-07-18	X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs	Sirnam Swetha et.al.	2407.13851	null
2024-07-16	AEMIM: Adversarial Examples Meet Masked Image Modeling	Wenzhao Xiang et.al.	2407.11537	null
2024-07-16	EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis	Ruijie Yang et.al.	2407.11401	null
2024-07-13	ST-RetNet: A Long-term Spatial-Temporal Traffic Flow Prediction Method	Baichao Long et.al.	2407.11074	null
2024-07-12	On the Role of Discrete Tokenization in Visual Representation Learning	Tianqi Du et.al.	2407.09087	null
2024-07-12	Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT	Jie Zheng et.al.	2407.08961	null
2024-07-15	Spatial-Temporal Attention Model for Traffic State Estimation with Sparse Internet of Vehicles	Jianzhe Xue et.al.	2407.08047	null
2024-07-09	D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms	Tajamul Ashraf et.al.	2407.06585	null
2024-07-16	AnatoMask: Enhancing Medical Image Segmentation with Reconstruction-guided Self-masking	Yuheng Li et.al.	2407.06468	link
2024-06-25	Investigating Self-Supervised Methods for Label-Efficient Learning	Srinivasa Rao Nandam et.al.	2406.17460	null
2024-06-25	Pseudo Labelling for Enhanced Masked Autoencoders	Srinivasa Rao Nandam et.al.	2406.17450	null
2024-06-18	GFM4MPM: Towards Geospatial Foundation Models for Mineral Prospectivity Mapping	Angel Daruna et.al.	2406.12756	null
2024-06-17	Scaling Efficient Masked Autoencoder Learning on Large Remote Sensing Dataset	Fengxiang Wang et.al.	2406.11933	link
2024-06-15	SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation	Yike Yuan et.al.	2406.10673	link
2024-06-11	Visual Representation Learning with Stochastic Frame Prediction	Huiwon Jang et.al.	2406.07398	null
2024-06-08	Medical Vision Generalist: Unifying Medical Imaging Tasks in Context	Sucheng Ren et.al.	2406.05565	link
2024-06-03	Boosting Spatial-Spectral Masked Auto-Encoder Through Mining Redundant Spectra for HSI-SAR/LiDAR Classification	Junyan Lin et.al.	2406.01235	null
2024-06-06	Whole Heart 3D+T Representation Learning Through Sparse 2D Cardiac MR Images	Yundi Zhang et.al.	2406.00329	null
2024-06-14	Enhancing Vision-Language Model with Unmasked Token Alignment	Jihao Liu et.al.	2405.19009	link
2024-05-28	Visualizing the loss landscape of Self-supervised Vision Transformer	Youngwan Lee et.al.	2405.18042	null
2024-05-23	Masked Image Modelling for retinal OCT understanding	Theodoros Pissas et.al.	2405.14788	null
2024-05-14	CLIP with Quality Captions: A Strong Pretraining for Vision Tasks	Pavan Kumar Anasosalu Vasu et.al.	2405.08911	null
2024-05-11	Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition	Zuan Gao et.al.	2405.05841	null
2024-05-09	Efficient Pretraining Model based on Multi-Scale Local Visual Field Feature Reconstruction for PCB CT Image Element Segmentation	Chen Chen et.al.	2405.05745	null
2024-05-06	Intra-task Mutual Attention based Vision Transformer for Few-Shot Learning	Weihao Jiang et.al.	2405.03109	null
2024-05-02	Self-Supervised Learning for Interventional Image Analytics: Towards Robust Device Trackers	Saahil Islam et.al.	2405.01156	null
2024-05-25	An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-Training	Jin Gao et.al.	2404.12210	link
2024-04-18	How to Benchmark Vision Foundation Models for Semantic Segmentation?	Tommie Kerssies et.al.	2404.12172	null
2024-04-15	XoFTR: Cross-modal Feature Matching Transformer	Önder Tuzcuoğlu et.al.	2404.09692	null
2024-04-13	Label-free Anomaly Detection in Aerial Agricultural Images with Masked Image Modeling	Sambal Shikhar et.al.	2404.08931	null
2024-04-12	Masked Image Modeling as a Framework for Self-Supervised Learning across Eye Movements	Robin Weiler et.al.	2404.08526	link
2024-04-12	Emerging Property of Masked Token for Effective Pre-training	Hyesong Choi et.al.	2404.08330	null
2024-04-12	Salience-Based Adaptive Masking: Revisiting Token Dynamics for Enhanced Pre-training	Hyesong Choi et.al.	2404.08327	null
2024-04-12	A Novel Vision Transformer based Load Profile Analysis using Load Images as Inputs	Hyeonjin Kim et.al.	2404.08175	null
2024-04-03	A Unified Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability	Jie Zhu et.al.	2404.02462	link
2024-04-01	Bridging Remote Sensors with Multisensor Geospatial Foundation Models	Boran Han et.al.	2404.01260	link
2024-04-01	SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining	Chull Hwan Song et.al.	2404.01156	null
2024-03-31	Learning to Rank Patches for Unbiased Image Redundancy Reduction	Yang Luo et.al.	2404.00680	link
2024-03-31	DailyMAE: Towards Pretraining Masked Autoencoders in One Day	Jiantao Wu et.al.	2404.00509	link
2024-03-23	Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression	Hancheng Ye et.al.	2403.15835	link
2024-03-14	Explore In-Context Segmentation via Latent Diffusion Models	Chaoyang Wang et.al.	2403.09616	null
2024-03-13	MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning	Jialv Zou et.al.	2403.08760	link
2024-03-20	Content-aware Masked Image Modeling Transformer for Stereo Image Compression	Xinjie Zhang et.al.	2403.08505	null
2024-03-07	Masked Capsule Autoencoders	Miles Everett et.al.	2403.04724	null
2024-03-04	Transformers Provably Learn Feature-Position Correlations in Masked Image Modeling	Yu Huang et.al.	2403.02233	null
2024-03-01	Learning and Leveraging World Models in Visual Representation Learning	Quentin Garrido et.al.	2403.00504	null
2024-03-01	Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training	Haowei Liu et.al.	2403.00249	null
2024-02-27	Masked Gamma-SSL: Learning Uncertainty Estimation via Masked Image Modeling	David S. W. Williams et.al.	2402.17622	null
2024-03-08	A Simple Framework Uniting Visual In-context Learning with Masked Image Modeling to Improve Ultrasound Segmentation	Yuyue Zhou et.al.	2402.14300	link
2024-02-15	MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations	Benedikt Alkin et.al.	2402.10093	link
2024-02-13	Improving Token-Based World Models with Parallel Observation Prediction	Lior Cohen et.al.	2402.05643	link
2024-02-07	Sparse Anatomical Prompt Semi-Supervised Learning with Masked Image Modeling for CBCT Tooth Segmentation	Pengyu Dai et.al.	2402.04587	null
2024-01-24	Learning Representations for Clustering via Partial Information Discrimination and Cross-Level Interaction	Hai-Xin Zhang et.al.	2401.13503	link
2024-01-23	Correlation-Embedded Transformer Tracking: A Single-Branch Framework	Fei Xie et.al.	2401.12743	link
2024-01-15	Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing	Jakob Hackstein et.al.	2401.07782	link
2024-01-15	One for All: Toward Unified Foundation Models for Earth Vision	Zhitong Xiong et.al.	2401.07527	null
2024-01-17	Frequency Masking for Universal Deepfake Detection	Chandler Timm Doloriel et.al.	2401.06506	link
2024-01-05	Fus-MAE: A cross-attention-based data fusion approach for Masked Autoencoders in remote sensing	Hugo Chan-To-Hing et.al.	2401.02764	null
2024-01-05	MOODv2: Masked Image Modeling for Out-of-Distribution Detection	Jingyao Li et.al.	2401.02611	null
2024-01-04	SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment	Ziping Ma et.al.	2401.02137	null
2024-01-03	aMUSEd: An Open MUSE Reproduction	Suraj Patil et.al.	2401.01808	link
2023-12-31	Analyzing Local Representations of Self-supervised Vision Transformers	Ani Vanyan et.al.	2401.00463	null
2023-12-30	Masked Image Modeling via Dynamic Token Morphing	Taekyung Kim et.al.	2401.00254	null
2024-01-02	USFM: A Universal Ultrasound Foundation Model Generalized to Tasks and Organs towards Label Efficient Image Analysis	Jing Jiao et.al.	2401.00153	null
2023-12-27	Learning to Embed Time Series Patches Independently	Seunghan Lee et.al.	2312.16427	link
2023-12-19	DMT: Comprehensive Distillation with Multiple Self-supervised Teachers	Yuang Liu et.al.	2312.11938	null
2023-12-13	PAD: Self-Supervised Pre-Training with Patchwise-Scale Adapter for Infrared Images	Tao Zhang et.al.	2312.08192	link
2023-12-12	Pre-trained Universal Medical Image Transformer	Lingxiao Luo et.al.	2312.07630	link
2023-12-08	MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness	Xiaoyun Xu et.al.	2312.04960	link
2023-12-07	Intelligent Anomaly Detection for Lane Rendering Using Transformer with Self-Supervised Pre-Training and Customized Fine-Tuning	Yongqi Dong et.al.	2312.04398	null
2023-12-11	Learning Cortical Anomaly through Masked Encoding for Unsupervised Heterogeneity Mapping	Hao-Chun Yang et.al.	2312.02762	null
2023-12-02	Local Masking Meets Progressive Freezing: Crafting Efficient Vision Transformers for Self-Supervised Learning	Utku Mert Topcuoglu et.al.	2312.02194	link
2023-12-01	Improve Supervised Representation Learning with Masked Image Modeling	Kaifeng Chen et.al.	2312.00950	null
2023-11-28	BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling	Yixuan Luo et.al.	2311.17218	null
2023-11-29	Cross-Axis Transformer with 2D Rotary Embeddings	Lily Erickson et.al.	2311.07184	null
2023-11-08	Self-Supervised Learning for Visual Relationship Detection through Masked Bounding Box Reconstruction	Zacharias Anastasakis et.al.	2311.04834	link
2023-11-08	SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification	Junyan Lin et.al.	2311.04442	link
2023-10-31	HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception	Junkun Yuan et.al.	2310.20695	null
2023-10-30	ViR: Vision Retention Networks	Ali Hatamizadeh et.al.	2310.19731	null
2023-10-29	BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping	Srikumar Sastry et.al.	2310.19168	link
2023-11-20	Adversarial Examples Are Not Real Features	Ang Li et.al.	2310.18936	null
2023-10-28	Pre-training with Random Orthogonal Projection Image Modeling	Maryam Haghighat et.al.	2310.18737	null
2023-10-28	Feature Guided Masked Autoencoder for Self-supervised Learning in Remote Sensing	Yi Wang et.al.	2310.18653	link
2023-10-20	Longer-range Contextualized Masked Autoencoder	Taekyung Kim et.al.	2310.13593	null
2023-10-19	Minimalist and High-Performance Semantic Segmentation with Plain Vision Transformers	Yuanduo Hong et.al.	2310.12755	link
2023-10-11	Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning	Zhiming Qian et.al.	2310.07510	null
2023-10-10	Pre-Trained Masked Image Model for Mobile Robot Navigation	Vishnu Dutt Sharma et.al.	2310.07021	null
2023-10-31	RetSeg: Retention-based Colorectal Polyps Segmentation Network	Khaled ELKarazle et.al.	2310.05446	null
2023-10-06	Self-Supervised Neuron Segmentation with Multi-Agent Reinforcement Learning	Yinda Chen et.al.	2310.04148	link
2023-10-02	Self-distilled Masked Attention guided masked image modeling with noise Regularized Teacher (SMART) for medical image analysis	Jue Jiang et.al.	2310.01209	null
2023-10-15	Information Flow in Self-Supervised Learning	Zhiquan Tan et.al.	2309.17281	link
2023-09-26	M $^{3}$ 3D: Learning 3D priors using Multi-Modal Masked Autoencoders for 2D image and video understanding	Muhammad Abdullah Jamal et.al.	2309.15313	null
2023-10-08	Masked Image Residual Learning for Scaling Deeper Vision Transformers	Guoxi Huang et.al.	2309.14136	link
2023-10-11	RMT: Retentive Networks Meet Vision Transformers	Qihang Fan et.al.	2309.11523	null
2023-09-18	Heterogeneous Generative Knowledge Distillation with Masked Image Modeling	Ziming Wang et.al.	2309.09571	null
2023-09-18	FactoFormer: Factorized Hyperspectral Transformers with Self-Supervised Pre-Training	Shaheer Mohamed et.al.	2309.09431	link
2023-09-16	RingMo-lite: A Remote Sensing Multi-task Lightweight Network with CNN-Transformer Hybrid Framework	Yuelei Wang et.al.	2309.09003	null
2023-09-14	Unleashing the Power of Depth and Pose Estimation Neural Networks by Designing Compatible Endoscopic Images	Junyang Wu et.al.	2309.07390	null
2023-09-11	SCD-Net: Spatiotemporal Clues Disentanglement Network for Self-supervised Skeleton-based Action Recognition	Cong Wu et.al.	2309.05834	null
2023-09-11	An Effective Two-stage Training Paradigm Detector for Small Dataset	Zheng Wang et.al.	2309.05652	null
2023-09-09	BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification	Takuro Fujii et.al.	2309.04675	null
2023-09-08	AMLP:Adaptive Masking Lesion Patches for Self-supervised Medical Image Segmentation	Xiangtao Wang et.al.	2309.04312	null

(back to top)

LLM

Publish Date	Title	Authors	PDF	Code
2024-11-14	MagicQuill: An Intelligent Interactive Image Editing System	Zichen Liu et.al.	2411.09703	null
2024-11-14	Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models	Wei Wang et.al.	2411.09691	null
2024-11-14	Squeezed Attention: Accelerating Long Context Length LLM Inference	Coleman Hooper et.al.	2411.09688	null
2024-11-14	Local deployment of large-scale music AI models on commodity hardware	Xun Zhou et.al.	2411.09625	null
2024-11-14	PTR: Precision-Driven Tool Recommendation for Large Language Models	Hang Gao et.al.	2411.09613	null
2024-11-14	The Moral Foundations Weibo Corpus	Renjie Cao et.al.	2411.09612	null
2024-11-14	Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework	Ronak Pradeep et.al.	2411.09607	null
2024-11-14	Accelerating Knowledge Graph and Ontology Engineering with Large Language Models	Cogan Shimizu et.al.	2411.09601	null
2024-11-14	LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models	Zhengyi Wang et.al.	2411.09595	null
2024-11-14	Adopting RAG for LLM-Aided Future Vehicle Design	Vahid Zolfaghari et.al.	2411.09590	null
2024-11-13	The Limited Impact of Medical Adaptation of Large Language and Vision-Language Models	Daniel P. Jeong et.al.	2411.08870	null
2024-11-13	LLMStinger: Jailbreaking LLMs using RL fine-tuned LLMs	Piyush Jha et.al.	2411.08862	null
2024-11-13	Multimodal Instruction Tuning with Hybrid State Space Models	Jianing Zhou et.al.	2411.08840	null
2024-11-13	FinRobot: AI Agent for Equity Research and Valuation with Large Language Models	Tianyu Zhou et.al.	2411.08804	link
2024-11-13	Evaluating World Models with LLM for Decision Making	Chang Yang et.al.	2411.08794	null
2024-11-13	Can sparse autoencoders be used to decompose and interpret steering vectors?	Harry Mayne et.al.	2411.08790	link
2024-11-13	Separating Tongue from Thought: Activation Patching Reveals Language-Agnostic Concept Representations in Transformers	Clément Dumas et.al.	2411.08745	link
2024-11-13	A Comparative Study of Discrete Speech Tokens for Semantic-Related Tasks with Large Language Models	Dingdong Wang et.al.	2411.08742	null
2024-11-14	Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models	Somanshu Singla et.al.	2411.08733	null
2024-11-13	Polymetis:Large Language Modeling for Multiple Material Domains	Chao Huang et.al.	2411.08728	null
2024-11-12	Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data	Juanhui Li et.al.	2411.08028	null
2024-11-12	LLMPhy: Complex Physical Reasoning Using Large Language Models and World Models	Anoop Cherian et.al.	2411.08027	null
2024-11-12	Language Models as Causal Effect Generators	Lucius E. J. Bynum et.al.	2411.08019	link
2024-11-12	ExpressivityArena: Can LLMs Express Information Implicitly?	Joshua Tint et.al.	2411.08010	null
2024-11-12	Can adversarial attacks by large language models be attributed?	Manuel Cebrian et.al.	2411.08003	null
2024-11-12	Derivational Morphology Reveals Analogical Generalization in Large Language Models	Valentin Hofmann et.al.	2411.07990	null
2024-11-12	JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation	Yiyang Ma et.al.	2411.07975	null
2024-11-12	From General to Specific: Utilizing General Hallucation to Automatically Measure the Role Relationship Fidelity for Specific Role-Play Agents	Chuyi Kong et.al.	2411.07965	null
2024-11-12	Towards Low-bit Communication for Tensor Parallel LLM Inference	Harry Dong et.al.	2411.07942	null
2024-11-12	Leveraging Multimodal Models for Enhanced Neuroimaging Diagnostics in Alzheimer's Disease	Francesco Chiumento et.al.	2411.07871	null
2024-11-11	UTMath: Math Evaluation with Unit Test via Reasoning-to-Coding Thoughts	Bo Yang et.al.	2411.07240	null
2024-11-11	OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model	Sumeth Yuenyong et.al.	2411.07238	null
2024-11-11	Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving	Botao Yu et.al.	2411.07228	null
2024-11-11	TreeCoders: Trees of Transformers	Pierre Colonna D'Istria et.al.	2411.07218	null
2024-11-11	Comparing Bottom-Up and Top-Down Steering Approaches on In-Context Learning Tasks	Madeline Brumley et.al.	2411.07213	null
2024-11-11	DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID	Nyle Siddiqui et.al.	2411.07205	link
2024-11-11	The Super Weight in Large Language Models	Mengxia Yu et.al.	2411.07191	link
2024-11-11	NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics	David Robinson et.al.	2411.07186	null
2024-11-11	Continual Memorization of Factoids in Large Language Models	Howard Chen et.al.	2411.07175	link
2024-11-11	A Domain-Agnostic Neurosymbolic Approach for Big Social Data Analysis: Evaluating Mental Health Sentiment on Social Media during COVID-19	Vedant Khandelwal et.al.	2411.07163	null
2024-11-08	Recycled Attention: Efficient inference for long-context language models	Fangyuan Xu et.al.	2411.05787	null
2024-11-08	Fact or Fiction? Can LLMs be Reliable Annotators for Political Truths?	Veronica Chatrath et.al.	2411.05775	null
2024-11-08	Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024	Christopher Malon et.al.	2411.05762	null
2024-11-08	Unmasking the Limits of Large Language Models: A Systematic Evaluation of Masked Text Processing Ability through MskQA and MskCal	Fuka Matsuzaki et.al.	2411.05665	link
2024-11-08	The influence of persona and conversational task on social interactions with a LLM-controlled embodied conversational agent	Leon O. H. Kroczek et.al.	2411.05653	null
2024-11-08	LightVA: Lightweight Visual Analytics with LLM Agent-Based Task Planning and Execution	Yuheng Zhao et.al.	2411.05651	null
2024-11-08	Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation	Long Truong To et.al.	2411.05641	null
2024-11-08	Assessing Open-Source Large Language Models on Argumentation Mining Subtasks	Mohammad Yeghaneh Abkenar et.al.	2411.05639	null
2024-11-08	A Two-Step Concept-Based Approach for Enhanced Interpretability and Trust in Skin Lesion Diagnosis	Cristiano Patrício et.al.	2411.05609	null
2024-11-08	Evaluating and Adapting Large Language Models to Represent Folktales in Low-Resource Languages	JA Meaney et.al.	2411.05593	null
2024-11-07	SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models	Muyang Li et.al.	2411.05007	link
2024-11-07	Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?	Jonathan Roberts et.al.	2411.05000	null
2024-11-07	LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation	Weiquan Huang et.al.	2411.04997	link
2024-11-07	Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models	Weixin Liang et.al.	2411.04996	null
2024-11-07	Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives	Hao Sun et.al.	2411.04991	link
2024-11-07	Enhancing Reverse Engineering: Investigating and Benchmarking Large Language Models for Vulnerability Analysis in Decompiled Binaries	Dylan Manuel et.al.	2411.04981	null
2024-11-07	SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference	Gabriele Oliaro et.al.	2411.04975	null
2024-11-07	BitNet a4.8: 4-bit Activations for 1-bit LLMs	Hongyu Wang et.al.	2411.04965	null
2024-11-07	Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability	Yanjun Gao et.al.	2411.04962	null
2024-11-07	CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM	Jingwei Xu et.al.	2411.04954	null
2024-11-06	Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?	Daniel P. Jeong et.al.	2411.04118	null
2024-11-07	How Transformers Solve Propositional Logic Problems: A Mechanistic Analysis	Guan Zhe Hong et.al.	2411.04105	null
2024-11-06	Textual Decomposition Then Sub-motion-space Scattering for Open-Vocabulary Motion Generation	Ke Fan et.al.	2411.04079	null
2024-11-06	Beemo: Benchmark of Expert-edited Machine-generated Outputs	Ekaterina Artemova et.al.	2411.04032	null
2024-11-06	Prompt Engineering Using GPT for Word-Level Code-Mixed Language Identification in Low-Resource Dravidian Languages	Aniket Deroy et.al.	2411.04025	null
2024-11-06	$k$ NN Attention Demystified: A Theoretical Exploration for Scalable Transformers	Themistoklis Haris et.al.	2411.04013	null
2024-11-06	Customized Multiple Clustering via Multi-Modal Subspace Proxy Learning	Jiawei Yao et.al.	2411.03978	null
2024-11-06	What Really is Commonsense Knowledge?	Quyet V. Do et.al.	2411.03964	null
2024-11-06	How Does A Text Preprocessing Pipeline Affect Ontology Syntactic Matching?	Zhangcheng Qiang et.al.	2411.03962	null
2024-11-06	Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation	Yuhang Liu et.al.	2411.03957	null
2024-11-05	LLMs for Domain Generation Algorithm Detection	Reynier Leyva La O et.al.	2411.03307	null
2024-11-05	VERITAS: A Unified Approach to Reliability Evaluation	Rajkumar Ramamurthy et.al.	2411.03300	null
2024-11-05	Examining Human-AI Collaboration for Co-Writing Constructive Comments Online	Farhana Shahid et.al.	2411.03295	null
2024-11-05	Interaction2Code: How Far Are We From Automatic Interactive Webpage Generation?	Jingyu Xiao et.al.	2411.03292	null
2024-11-05	The Future of Intelligent Healthcare: A Systematic Analysis and Discussion on the Integration and Impact of Robots Using Large Language Models for Healthcare	Souren Pashangpour et.al.	2411.03287	null
2024-11-05	SMoA: Improving Multi-agent Large Language Models with Sparse Mixture-of-Agents	Dawei Li et.al.	2411.03284	link
2024-11-05	ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal	Xiujin Zhu et.al.	2411.03260	null
2024-11-05	Spontaneous Emergence of Agent Individuality through Social Interactions in LLM-Based Communities	Ryosuke Takata et.al.	2411.03252	null
2024-11-05	DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models	Ying Zhou et.al.	2411.03250	null
2024-11-05	From Pen to Prompt: How Creative Writers Integrate AI into their Writing Practice	Alicia Guo et.al.	2411.03137	null
2024-11-04	Training-free Regional Prompting for Diffusion Transformers	Anthony Chen et.al.	2411.02395	link
2024-11-04	Adaptive Length Image Tokenization via Recurrent Allocation	Shivam Duggal et.al.	2411.02393	link
2024-11-04	Improving Scientific Hypothesis Generation with Knowledge Grounded Large Language Models	Guangzhi Xiong et.al.	2411.02382	null
2024-11-04	Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI	Ramneet Kaur et.al.	2411.02381	null
2024-11-04	DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution	Yang Yue et.al.	2411.02359	link
2024-11-04	"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization	Eldar Kurtic et.al.	2411.02355	null
2024-11-04	Social-RAG: Retrieving from Group Interactions to Socially Ground Proactive AI Generation to Group Preferences	Ruotong Wang et.al.	2411.02353	null
2024-11-04	Can Large Language Models generalize analogy solving like people can?	Claire E. Stevenson et.al.	2411.02348	null
2024-11-04	WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning	Zehan Qi et.al.	2411.02337	null
2024-11-04	Sparsing Law: Towards Large Language Models with Greater Activation Sparsity	Yuqi Luo et.al.	2411.02335	null
2024-11-01	DELTA: Dense Efficient Long-range 3D Tracking for any video	Tuan Duc Ngo et.al.	2410.24211	null
2024-10-31	Length-Induced Embedding Collapse in Transformer-based Models	Yuqi Zhou et.al.	2410.24200	null
2024-11-01	SelfCodeAlign: Self-Alignment for Code Generation	Yuxiang Wei et.al.	2410.24198	link
2024-10-31	Constraint Back-translation Improves Complex Instruction Following of Large Language Models	Yunjia Qi et.al.	2410.24175	null
2024-10-31	Thought Space Explorer: Navigating and Expanding Thought Space for Large Language Model Reasoning	Jinghan Zhang et.al.	2410.24155	null
2024-10-31	Language-Driven Policy Distillation for Cooperative Driving in Multi-Agent Reinforcement Learning	Jiaqi Liu et.al.	2410.24152	null
2024-10-31	Leveraging Large Language Models for Code Translation and Software Development in Scientific Computing	Akash Dhruv et.al.	2410.24119	link
2024-10-31	Repository-Level Compositional Code Translation and Validation	Ali Reza Ibrahimzada et.al.	2410.24117	null
2024-10-31	Matchmaker: Self-Improving Large Language Model Programs for Schema Matching	Nabeel Seedat et.al.	2410.24105	null
2024-10-31	Desert Camels and Oil Sheikhs: Arab-Centric Red Teaming of Frontier LLMs	Muhammed Saeed et.al.	2410.24049	null
2024-10-30	EMMA: End-to-End Multimodal Model for Autonomous Driving	Jyh-Jing Hwang et.al.	2410.23262	null
2024-10-30	Evaluating Cultural and Social Awareness of LLM Web Agents	Haoyi Qiu et.al.	2410.23252	null
2024-10-30	Carrot and Stick: Eliciting Comparison Data and Beyond	Yiling Chen et.al.	2410.23243	null
2024-10-30	A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment	Matteo G. Mecattaf et.al.	2410.23242	null
2024-10-30	EMOTION: Expressive Motion Sequence Generation for Humanoid Robots with In-Context Learning	Peide Huang et.al.	2410.23234	null
2024-10-31	Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval	Sheryl Hsu et.al.	2410.23214	null
2024-10-30	ProTransformer: Robustify Transformers via Plug-and-Play Paradigm	Zhichao Hou et.al.	2410.23182	null
2024-10-30	ReasoningRec: Bridging Personalized Recommendations and Human-Interpretable Explanations through LLM Reasoning	Millennium Bismay et.al.	2410.23180	link
2024-10-30	SciPIP: An LLM-based Scientific Paper Idea Proposer	Wenxiao Wang et.al.	2410.23166	null
2024-10-30	Real-Time Personalization for LLM-based Recommendation with Customized In-Context Learning	Keqin Bao et.al.	2410.23136	link
2024-10-29	Enhancing Code Annotation Reliability: Generative AI's Role in Comment Quality Assessment Models	Seetharam Killivalavan et.al.	2410.22323	null
2024-10-29	Online Detecting LLM-Generated Texts via Sequential Hypothesis Testing by Betting	Can Chen et.al.	2410.22318	link
2024-10-29	Natural Language Inference Improves Compositionality in Vision-Language Models	Paola Cascante-Bonilla et.al.	2410.22315	null
2024-10-30	GPT-4o reads the mind in the eyes	James W. A. Strachan et.al.	2410.22309	null
2024-10-29	SVIP: Towards Verifiable Inference of Open-source Large Language Models	Yifan Sun et.al.	2410.22307	null
2024-10-29	Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning	Yihe Deng et.al.	2410.22304	null
2024-10-29	LLMs are Highly-Constrained Biophysical Sequence Optimizers	Angelica Chen et.al.	2410.22296	null
2024-10-29	Fine-Tuning LLMs for Code Mutation: A New Era of Cyber Threats	Mohammad Setak et.al.	2410.22293	null
2024-10-29	Embedding-based classifiers can detect prompt injection attacks	Md. Ahsan Ayub et.al.	2410.22284	link
2024-10-29	Whose ChatGPT? Unveiling Real-World Educational Inequalities Introduced by Large Language Models	Renzhe Yu et.al.	2410.22282	null
2024-10-28	Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics	Yaniv Nikankin et.al.	2410.21272	null
2024-10-28	LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior	Hanyu Wang et.al.	2410.21264	null
2024-10-28	LongReward: Improving Long-context Large Language Models with AI Feedback	Jiajie Zhang et.al.	2410.21252	null
2024-10-28	Zero-Shot Dense Retrieval with Embeddings from Relevance Feedback	Nour Jedidi et.al.	2410.21242	null
2024-10-28	Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce	Zhantao Yang et.al.	2410.21237	null
2024-10-28	Flaming-hot Initiation with Regular Execution Sampling for Large Language Models	Weizhe Chen et.al.	2410.21236	null
2024-10-28	LoRA vs Full Fine-tuning: An Illusion of Equivalence	Reece Shuttleworth et.al.	2410.21228	null
2024-10-28	Lifting the Veil on the Large Language Model Supply Chain: Composition, Risks, and Mitigations	Kaifeng Huang et.al.	2410.21218	null
2024-10-28	BongLLaMA: LLaMA for Bangla Language	Abdullah Khan Zehady et.al.	2410.21200	null
2024-10-29	Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction	Qintong Zhang et.al.	2410.21169	null
2024-10-25	The Potential and Value of AI Chatbot in Personalized Cognitive Training	Zilong Wang et.al.	2410.19733	null
2024-10-25	Counting Ability of Large Language Models and Impact of Tokenization	Xiang Zhang et.al.	2410.19730	null
2024-10-25	FISHNET: Financial Intelligence from Sub-querying, Harmonizing, Neural-Conditioning, Expert Swarms, and Task Planning	Nicole Cho et.al.	2410.19727	null
2024-10-25	2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision	Shilong Li et.al.	2410.19720	null
2024-10-25	TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning	Xiangyu Zeng et.al.	2410.19702	null
2024-10-25	IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation	Kaixian Qu et.al.	2410.19697	null
2024-10-25	Less is More: Extreme Gradient Boost Rank-1 Adaption for Efficient Finetuning of LLMs	Yifei Zhang et.al.	2410.19694	null
2024-10-25	APRICOT: Active Preference Learning and Constraint-Aware Task Planning with LLMs	Huaxiaoyue Wang et.al.	2410.19656	null
2024-10-25	Take Caution in Using LLMs as Human Surrogates: Scylla Ex Machina	Yuan Gao et.al.	2410.19599	null
2024-10-25	Diverse Sign Language Translation	Xin Shen et.al.	2410.19586	null
2024-10-24	Unbounded: A Generative Infinite Game of Character Life Simulation	Jialu Li et.al.	2410.18975	null
2024-10-24	Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms	Zhangheng Li et.al.	2410.18967	null
2024-10-24	Does Data Contamination Detection Work (Well) for LLMs? A Survey and Evaluation on Detection Assumptions	Yujuan Fu et.al.	2410.18966	null
2024-10-24	OSCAR: Operating System Control via State-Aware Reasoning and Re-Planning	Xiaoqiang Wang et.al.	2410.18963	null
2024-10-24	Bridge-Coder: Unlocking LLMs' Potential to Overcome Language Gaps in Low-Resource Code	Jipeng Zhang et.al.	2410.18957	null
2024-10-24	BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning	Yujuan Velvin Fu et.al.	2410.18955	null
2024-10-24	Dynamic Vocabulary Pruning in Early-Exit LLMs	Jort Vincenti et.al.	2410.18952	link
2024-10-24	SafeBench: A Safety Evaluation Framework for Multimodal Large Language Models	Zonghao Ying et.al.	2410.18927	null
2024-10-24	From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems	A M Muntasir Rahman et.al.	2410.18921	null
2024-10-25	A Survey on Speech Large Language Models	Jing Peng et.al.	2410.18908	null
2024-10-23	TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts	Yuxuan Xie et.al.	2410.18071	null
2024-10-23	LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering	Qingfei Zhao et.al.	2410.18050	link
2024-10-23	Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases	Anna Glazkova et.al.	2410.18040	null
2024-10-23	MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning	Jingfan Zhang et.al.	2410.18035	null
2024-10-23	GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration	Xin Li et.al.	2410.18032	link
2024-10-23	MiniFed : Integrating LLM-based Agentic-Workflow for Simulating FOMC Meeting	Sungil Seok et.al.	2410.18012	null
2024-10-23	ExpertFlow: Optimized Expert Activation and Token Allocation for Efficient Mixture-of-Experts Inference	Xin He et.al.	2410.17954	null
2024-10-23	SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains	Ran Xu et.al.	2410.17952	null
2024-10-23	Benchmarking Floworks against OpenAI & Anthropic: A Novel Framework for Enhanced LLM Function Calling	Nirav Bhan et.al.	2410.17950	null
2024-10-23	Guide for Defense (G4D): Dynamic Guidance for Robust and Balanced Defense in Large Language Models	He Cao et.al.	2410.17922	null
2024-10-22	Large Language Models Empowered Personalized Web Agents	Hongru Cai et.al.	2410.17236	null
2024-10-22	Automated Spinal MRI Labelling from Reports Using a Large Language Model	Robin Y. Park et.al.	2410.17235	link
2024-10-22	Fine-Tuning Large Language Models to Appropriately Abstain with Semantic Entropy	Benedict Aaron Tjandra et.al.	2410.17234	null
2024-10-22	Few-shot In-Context Preference Learning Using Large Language Models	Chao Yu et.al.	2410.17233	null
2024-10-22	Context-aware Prompt Tuning: Advancing In-Context Learning with Adversarial Methods	Tsachi Blau et.al.	2410.17222	null
2024-10-22	Exploring Possibilities of AI-Powered Legal Assistance in Bangladesh through Large Language Modeling	Azmine Toushik Wasi et.al.	2410.17210	link
2024-10-22	VoiceBench: Benchmarking LLM-Based Voice Assistants	Yiming Chen et.al.	2410.17196	link
2024-10-23	Non-myopic Generation of Language Model for Reasoning and Planning	Chang Ma et.al.	2410.17195	null
2024-10-22	From Attention to Activation: Unravelling the Enigmas of Large Language Models	Prannay Kaul et.al.	2410.17174	null
2024-10-22	Improving Pinterest Search Relevance Using Large Language Models	Han Wang et.al.	2410.17152	null
2024-10-21	Reflection-Bench: probing AI intelligence with reflection	Lingyu Li et.al.	2410.16270	link
2024-10-22	Mini-InternVL: A Flexible-Transfer Pocket Multimodal Model with 5% Parameters and 90% Performance	Zhangwei Gao et.al.	2410.16261	link
2024-10-21	Elucidating the design space of language models for image generation	Xuantong Liu et.al.	2410.16257	null
2024-10-21	CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution	Maosong Cao et.al.	2410.16256	link
2024-10-21	Can Knowledge Editing Really Correct Hallucinations?	Baixiang Huang et.al.	2410.16251	link
2024-10-21	Analyzing Context Contributions in LLM-based Machine Translation	Emmanouil Zaranis et.al.	2410.16246	null
2024-10-21	MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report	Samrajya Thapa et.al.	2410.16239	link
2024-10-21	IBGP: Imperfect Byzantine Generals Problem for Zero-Shot Robustness in Communicative Multi-Agent Systems	Yihuan Mao et.al.	2410.16237	null
2024-10-21	LLaVA-KD: A Framework of Distilling Multimodal Large Language Models	Yuxuan Cai et.al.	2410.16236	null
2024-10-21	ToW: Thoughts of Words Improve Reasoning in Large Language Models	Zhikun Xu et.al.	2410.16235	null
2024-10-18	Are AI Detectors Good Enough? A Survey on Quality of Datasets With Machine-Generated Texts	German Gritsai et.al.	2410.14677	null
2024-10-18	SudoLM: Learning Access Control of Parametric Knowledge with Authorization Alignment	Qin Liu et.al.	2410.14676	null
2024-10-18	Enhancing Large Language Models' Situated Faithfulness to External Contexts	Yukun Huang et.al.	2410.14675	link
2024-10-18	Decomposing The Dark Matter of Sparse Autoencoders	Joshua Engels et.al.	2410.14670	link
2024-10-18	MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps	Xiongtao Zhou et.al.	2410.14668	link
2024-10-18	A Large Language Model-Driven Reward Design Framework via Dynamic Feedback for Reinforcement Learning	Shengjie Sun et.al.	2410.14660	null
2024-10-18	EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search	Oliver Sieberling et.al.	2410.14649	null
2024-10-18	Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs	Runchu Tian et.al.	2410.14641	link
2024-10-18	GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings	Raghuveer Thirukovalluru et.al.	2410.14635	null
2024-10-18	DiSCo Meets LLMs: A Unified Approach for Sparse Retrieval and Contextual Distillation in Conversational Search	Simon Lupart et.al.	2410.14609	null
2024-10-17	Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens	Lijie Fan et.al.	2410.13863	null
2024-10-17	PUMA: Empowering Unified MLLM with Multi-granular Visual Generation	Rongyao Fang et.al.	2410.13861	link
2024-10-17	$γ-$ MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models	Yaxin Luo et.al.	2410.13859	null
2024-10-17	How Numerical Precision Affects Mathematical Reasoning Capabilities of LLMs	Guhao Feng et.al.	2410.13857	null
2024-10-17	Can MLLMs Understand the Deep Implication Behind Chinese Images?	Chenhao Zhang et.al.	2410.13854	link
2024-10-17	Retrospective Learning from Interactions	Zizhao Chen et.al.	2410.13852	null
2024-10-17	SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction	Xuan Zhang et.al.	2410.13846	link
2024-10-17	Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs	Tianyu Guo et.al.	2410.13835	null
2024-10-17	AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents	Ke Yang et.al.	2410.13825	null
2024-10-18	Harnessing Webpage UIs for Text-Rich Visual Understanding	Junpeng Liu et.al.	2410.13824	null
2024-10-16	Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception	Jihao Zhao et.al.	2410.12788	null
2024-10-16	In-Context Learning Enables Robot Action Prediction in LLMs	Yida Yin et.al.	2410.12782	null
2024-10-16	Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats	Chen Ziwen et.al.	2410.12781	null
2024-10-16	Identifying Task Groupings for Multi-Task Learning Using Pointwise V-Usable Information	Yingya Li et.al.	2410.12774	null
2024-10-16	StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples	Ajay Patel et.al.	2410.12757	null
2024-10-17	CREAM: Consistency Regularized Self-Rewarding Language Models	Zhaoyang Wang et.al.	2410.12735	null
2024-10-16	FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression	Zhenheng Tang et.al.	2410.12707	null
2024-10-16	Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization	Xingqi Wang et.al.	2410.12700	link
2024-10-17	Automatic Mapping of Anatomical Landmarks from Free-Text Using Large Language Models: Insights from Llama-2	Mohamad Abdi et.al.	2410.12686	null
2024-10-16	Evaluating Morphological Compositional Generalization in Large Language Models	Mete Ismayilzada et.al.	2410.12656	null
2024-10-15	GaVaMoE: Gaussian-Variational Gated Mixture of Experts for Explainable Recommendation	Fei Tang et.al.	2410.11841	null
2024-10-15	MMFuser: Multimodal Multi-Layer Feature Fuser for Fine-Grained Vision-Language Understanding	Yue Cao et.al.	2410.11829	link
2024-10-15	SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing	Zhiyuan Zhang et.al.	2410.11815	null
2024-10-15	NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models	Han Han et.al.	2410.11805	null
2024-10-15	FoundTS: Comprehensive and Unified Benchmarking of Foundation Models for Time Series Forecasting	Zhe Li et.al.	2410.11802	null
2024-10-15	Selection-p: Self-Supervised Task-Agnostic Prompt Compression for Faithfulness and Transferability	Tsz Ting Chung et.al.	2410.11786	null
2024-10-15	G-Designer: Architecting Multi-agent Communication Topologies via Graph Neural Networks	Guibin Zhang et.al.	2410.11782	null
2024-10-15	Language Models Encode Numbers Using Digit Representations in Base 10	Amit Arnold Levy et.al.	2410.11781	null
2024-10-15	MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation	Chenxi Wang et.al.	2410.11779	link
2024-10-15	Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models	Kai Yao et.al.	2410.11772	link
2024-10-14	DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads	Guangxuan Xiao et.al.	2410.10819	link
2024-10-14	Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free	Ziyue Li et.al.	2410.10814	null
2024-10-14	LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory	Di Wu et.al.	2410.10813	link
2024-10-14	Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning	Aakanksha et.al.	2410.10801	null
2024-10-15	MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling	Jian Yang et.al.	2410.10798	null
2024-10-14	Context-Parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance	Sachin Goyal et.al.	2410.10796	link
2024-10-14	Focused ReAct: Improving ReAct through Reiterate and Early Stop	Shuoqiu Li et.al.	2410.10779	null
2024-10-14	AFlow: Automating Agentic Workflow Generation	Jiayi Zhang et.al.	2410.10762	link
2024-10-14	Denial-of-Service Poisoning Attacks against Large Language Models	Kuofeng Gao et.al.	2410.10760	link
2024-10-14	SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization	Akrit Mudvari et.al.	2410.10759	null
2024-10-11	AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation	Zijun Wang et.al.	2410.09040	link
2024-10-11	Semi-Supervised Learning of Noisy Mixture of Experts Models	Oh-Ran Kwon et.al.	2410.09039	null
2024-10-11	SimpleStrat: Diversifying Language Model Generation with Stratification	Justin Wong et.al.	2410.09038	null
2024-10-11	Mentor-KD: Making Small Language Models Better Multi-step Reasoners	Hojae Lee et.al.	2410.09037	link
2024-10-11	PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents	Xiangyu Yin et.al.	2410.09034	null
2024-10-11	The Impact of Visual Information in Chinese Characters: Evaluating Large Models' Ability to Recognize and Utilize Radicals	Xiaofeng Wu et.al.	2410.09013	null
2024-10-11	Software Engineering and Foundation Models: Insights from Industry Blogs Using a Jury of Foundation Models	Hao Li et.al.	2410.09012	null
2024-10-11	SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights	Ling Yang et.al.	2410.09008	link
2024-10-11	From Interaction to Impact: Towards Safer AI Agents Through Understanding and Evaluating UI Operation Impacts	Zhuohao Jerry Zhang et.al.	2410.09006	null
2024-10-11	Hypothesis-only Biases in Large Language Model-Elicited Natural Language Inference	Grace Proebsting et.al.	2410.08996	null
2024-10-10	Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training	Gen Luo et.al.	2410.08202	null
2024-10-10	From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions	Changle Qu et.al.	2410.08197	link
2024-10-10	MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code	Zimu Lu et.al.	2410.08196	link
2024-10-10	GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment	Yuancheng Xu et.al.	2410.08193	null
2024-10-10	Sample then Identify: A General Framework for Risk Control and Assessment in Multimodal Large Language Models	Qingni Wang et.al.	2410.08174	null
2024-10-10	On the Evaluation of Generative Robotic Simulations	Feng Chen et.al.	2410.08172	null
2024-10-10	Agent S: An Open Agentic Framework that Uses Computers Like a Human	Saaket Agashe et.al.	2410.08164	link
2024-10-10	Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning	Amrith Setlur et.al.	2410.08146	null
2024-10-10	Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs	Xiaoyuan Liu et.al.	2410.08145	null
2024-10-10	DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory	Yutong Wang et.al.	2410.08143	link
2024-10-09	Astute RAG: Overcoming Imperfect Retrieval Augmentation and Knowledge Conflicts for Large Language Models	Fei Wang et.al.	2410.07176	null
2024-10-09	Do better language models have crisper vision?	Jona Ruthardt et.al.	2410.07173	null
2024-10-09	Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate	Qidong Huang et.al.	2410.07167	link
2024-10-09	Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making	Manling Li et.al.	2410.07166	link
2024-10-09	Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning	Chongyu Fan et.al.	2410.07163	null
2024-10-09	Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis	Bohan Zeng et.al.	2410.07155	link
2024-10-09	Stuffed Mamba: State Collapse and State Capacity of RNN-Based Long-Context Modeling	Yingfa Chen et.al.	2410.07145	null
2024-10-09	Mental Disorders Detection in the Era of Large Language Models	Gleb Kuzmin et.al.	2410.07129	null
2024-10-09	Personalized Visual Instruction Tuning	Renjie Pi et.al.	2410.07113	null
2024-10-09	I Want to Break Free! Anti-Social Behavior and Persuasion Ability of LLMs in Multi-Agent Settings with Social Hierarchy	Gian Maria Campedelli et.al.	2410.07109	null
2024-10-07	Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models	Fei Wang et.al.	2410.05269	null
2024-10-07	PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs	Mengzhao Chen et.al.	2410.05265	link
2024-10-07	TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles	Qingchen Yu et.al.	2410.05262	link
2024-10-07	Differential Transformer	Tianzhu Ye et.al.	2410.05258	null
2024-10-07	GLEE: A Unified Framework and Benchmark for Language-based Economic Environments	Eilam Shapira et.al.	2410.05254	link
2024-10-07	Causal Micro-Narratives	Mourad Heddaya et.al.	2410.05252	null
2024-10-07	SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe	Yuxin Xiao et.al.	2410.05248	null
2024-10-07	Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents	Boyu Gou et.al.	2410.05243	null
2024-10-07	GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models	Iman Mirzadeh et.al.	2410.05229	null
2024-10-07	Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates	Avanika Narayan et.al.	2410.05224	null
2024-10-04	Enhance Reasoning by Learning from Mistakes: Peer-Review Knowledge Distillation from Multiple Large Language Models	Zhuochun Li et.al.	2410.03663	null
2024-10-04	RAFT: Realistic Attacks to Fool Text Detectors	James Wang et.al.	2410.03658	null
2024-10-04	Aligning LLMs with Individual Preferences via Interaction	Shujin Wu et.al.	2410.03642	link
2024-10-04	Large Language Model Performance Benchmarking on Mobile Platforms: A Thorough Evaluation	Jie Xiao et.al.	2410.03613	null
2024-10-04	TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation	Jonathan Cook et.al.	2410.03608	null
2024-10-04	Efficiently Identifying Watermarked Segments in Mixed-Source Texts	Xuandong Zhao et.al.	2410.03600	null
2024-10-04	Understanding Reasoning in Chain-of-Thought from the Hopfieldian View	Lijie Hu et.al.	2410.03595	null
2024-10-04	Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models	Xin Zou et.al.	2410.03577	null
2024-10-04	Towards Linguistically-Aware and Language-Independent Tokenization for Large Language Models (LLMs)	Abrar Rahman et.al.	2410.03568	null
2024-10-04	Structure-Enhanced Protein Instruction Tuning: Towards General-Purpose Protein Understanding	Wei Wu et.al.	2410.03553	null
2024-10-03	FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models	Zhipei Xu et.al.	2410.02761	null
2024-10-03	Loong: Generating Minute-level Long Videos with Autoregressive Language Models	Yuqing Wang et.al.	2410.02757	null
2024-10-03	SIEVE: General Purpose Data Filtering System Matching GPT-4o Accuracy at 1% the Cost	Jifan Zhang et.al.	2410.02755	null
2024-10-03	Training Language Models on Synthetic Edit Sequences Improves Code Synthesis	Ulyana Piterbarg et.al.	2410.02749	null
2024-10-03	CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text Generation	Han He et.al.	2410.02748	null
2024-10-03	Contrastive Localized Language-Image Pre-Training	Hong-You Chen et.al.	2410.02746	null
2024-10-03	Neutral residues: revisiting adapters for model extension	Franck Signe Talla et.al.	2410.02744	null
2024-10-03	MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions	Yekun Chai et.al.	2410.02743	null
2024-10-03	Grounding Large Language Models In Embodied Environment With Imperfect World Models	Haolan Liu et.al.	2410.02742	null
2024-10-03	Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization	Lei Xu et.al.	2410.02741	null
2024-10-02	Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads	Yuxiang Huang et.al.	2410.01805	link
2024-10-02	Efficient $1$ -bit tensor approximations	Alex W. Neal Riasanovsky et.al.	2410.01799	null
2024-10-02	Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models	Joseph Lee et.al.	2410.01795	link
2024-10-02	When a language model is optimized for reasoning, does it still show embers of autoregression? An analysis of OpenAI o1	R. Thomas McCoy et.al.	2410.01792	null
2024-10-02	Investigating on RLHF methodology	Alexey Kutalev et.al.	2410.01789	null
2024-10-02	OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models	Heng Yang et.al.	2410.01784	link
2024-10-02	Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models	Shayekh Bin Islam et.al.	2410.01782	null
2024-10-02	Trained Transformer Classifiers Generalize and Exhibit Benign Overfitting In-Context	Spencer Frei et.al.	2410.01774	null
2024-10-03	Quantifying Generalization Complexity for Large Language Models	Zhenting Qi et.al.	2410.01769	null
2024-10-03	Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks	Mengzhao Jia et.al.	2410.01744	null
2024-09-30	MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning	Haotian Zhang et.al.	2409.20566	null
2024-09-30	Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos	Md Mohaiminul Islam et.al.	2409.20557	null
2024-09-30	LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation	Ziyao Zhang et.al.	2409.20550	null
2024-09-30	Robi Butler: Remote Multimodal Interactions with Household Robot Assistant	Anxing Xiao et.al.	2409.20548	null
2024-09-30	Uncertainty-Informed Screening for Safer Solvents Used in the Synthesis of Perovskite via Language Models	Arpan Mukherjee et.al.	2409.20512	null
2024-09-30	COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models	Divyanshu Daiya et.al.	2409.20502	null
2024-10-01	Instance-adaptive Zero-shot Chain-of-Thought Prompting	Xiaosong Yuan et.al.	2409.20441	null
2024-09-30	Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation	Shan Chen et.al.	2409.20385	null
2024-09-30	The Perfect Blend: Redefining RLHF with Mixture of Judges	Tengyu Xu et.al.	2409.20370	null
2024-09-30	VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs	Ruotong Liao et.al.	2409.20365	null
2024-09-27	LML: Language Model Learning a Dataset for Data-Augmented Prediction	Praneeth Vadlapati et.al.	2409.18957	link
2024-09-27	Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models	Jiaming Li et.al.	2409.18943	link
2024-09-27	From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding	Heqing Zou et.al.	2409.18938	null
2024-09-27	AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow	Huizi Yu et.al.	2409.18924	null
2024-09-27	Soft Measures for Extracting Causal Collective Intelligence	Maryam Berijanian et.al.	2409.18911	link
2024-09-27	IDGen: Item Discrimination Induced Prompt Generation for LLM Evaluation	Fan Lin et.al.	2409.18892	null
2024-09-27	Predicting and analyzing memorization within fine-tuned Large Language Models	Jérémie Dentan et.al.	2409.18858	null
2024-09-27	Mitigating Selection Bias with Node Pruning and Auxiliary Options	Hyeong Kyu Choi et.al.	2409.18857	null
2024-09-27	LLMs4Synthesis: Leveraging Large Language Models for Scientific Synthesis	Hamed Babaei Giglou et.al.	2409.18812	null
2024-09-27	Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs	Yanyuan Qiao et.al.	2409.18794	null
2024-09-26	EgoLM: Multi-Modal Language Model of Egocentric Motions	Fangzhou Hong et.al.	2409.18127	null
2024-09-26	Multi-View and Multi-Scale Alignment for Contrastive Language-Image Pre-training in Mammography	Yuexi Du et.al.	2409.18119	null
2024-09-26	E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding	Ye Liu et.al.	2409.18111	link
2024-09-26	Infering Alt-text For UI Icons With Large Language Models During App Development	Sabrina Haque et.al.	2409.18060	null
2024-09-26	DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving	Dingrui Wang et.al.	2409.18053	null
2024-09-26	EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions	Kai Chen et.al.	2409.18042	null
2024-09-26	Compositional Hardness of Code in Large Language Models -- A Probabilistic Perspective	Yotam Wolf et.al.	2409.18028	null
2024-09-26	An Adversarial Perspective on Machine Unlearning for AI Safety	Jakub Łucki et.al.	2409.18025	null
2024-09-26	DARE: Diverse Visual Question Answering with Robustness Evaluation	Hannah Sterz et.al.	2409.18023	null
2024-09-26	Role-RL: Online Long-Context Processing with Role Reinforcement Learning for Distinct LLMs in Their Optimal Roles	Lewei He et.al.	2409.18014	null
2024-09-25	Attention Prompting on Image for Large Vision-Language Models	Runpeng Yu et.al.	2409.17143	link
2024-09-25	FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression	Fazal Mittu et.al.	2409.17141	link
2024-09-25	Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents	Junting Lu et.al.	2409.17140	null
2024-09-25	Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale	Fan Zhou et.al.	2409.17115	link
2024-09-25	Accumulator-Aware Post-Training Quantization	Ian Colbert et.al.	2409.17092	null
2024-09-25	VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models	Yifei Liu et.al.	2409.17066	link
2024-09-25	Using LLM for Real-Time Transcription and Summarization of Doctor-Patient Interactions into ePuskesmas in Indonesia	Azmul Asmar Irfan et.al.	2409.17054	null
2024-09-25	How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not	Francesco Verdini et.al.	2409.17044	null
2024-09-25	Counterfactual Token Generation in Large Language Models	Ivi Chatzi et.al.	2409.17027	null
2024-09-25	LLM-CARD: Towards a Description and Landscape of Large Language Models	Shengwei Tian et.al.	2409.17011	null
2024-09-24	LLM Echo Chamber: personalized and automated disinformation	Tony Ma et.al.	2409.16241	link
2024-09-24	Towards Enhancing Linked Data Retrieval in Conversational UIs using Large Language Models	Omar Mussa et.al.	2409.16220	null
2024-09-24	LLMCount: Enhancing Stationary mmWave Detection with Multimodal-LLM	Boyan Li et.al.	2409.16209	null
2024-09-25	CJEval: A Benchmark for Assessing Large Language Models Using Chinese Junior High School Exam Data	Qian-Wen Zhang et.al.	2409.16202	link
2024-09-24	HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models	Haoran Que et.al.	2409.16191	link
2024-09-24	Cyber Knowledge Completion Using Large Language Models	Braden K Webb et.al.	2409.16176	null
2024-09-24	Merging LoRAs like Playing LEGO: Pushing the Modularity of LoRA to Extremes Through Rank-Wise Clustering	Ziyu Zhao et.al.	2409.16167	null
2024-09-24	Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework	Lu Chen et.al.	2409.16146	null
2024-09-24	MOSS: Enabling Code-Driven Evolution and Context Management for AI Agents	Ming Zhu et.al.	2409.16120	link
2024-09-24	Exploring Hint Generation Approaches in Open-Domain Question Answering	Jamshid Mozafari et.al.	2409.16096	link
2024-09-20	Gender Representation and Bias in Indian Civil Service Mock Interviews	Somonnoy Banerjee et.al.	2409.12194	null
2024-09-18	To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning	Zayne Sprague et.al.	2409.12183	null
2024-09-18	Finetuning Language Models to Emit Linguistic Expressions of Uncertainty	Arslan Chaudhry et.al.	2409.12180	null
2024-09-18	Decoding Style: Efficient Fine-Tuning of LLMs for Image-Guided Outfit Recommendation with Preference	Najmeh Forouzandehmehr et.al.	2409.12150	null
2024-09-18	MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning	Justin Chih-Yao Chen et.al.	2409.12147	link
2024-09-18	MoRAG -- Multi-Fusion Retrieval Augmented Generation for Human Motion	Kalakonda Sai Shashank et.al.	2409.12140	null
2024-09-24	Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models	Sijing Chen et.al.	2409.12139	null
2024-09-18	Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement	An Yang et.al.	2409.12122	null
2024-09-18	Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference	Edresson Casanova et.al.	2409.12117	null
2024-09-18	Measuring Human and AI Values based on Generative Psychometrics with Large Language Models	Haoran Ye et.al.	2409.12106	link
2024-09-17	AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs	Basel Mousi et.al.	2409.11404	null
2024-09-17	NVLM: Open Frontier-Class Multimodal LLMs	Wenliang Dai et.al.	2409.11402	null
2024-09-17	Says Who? Effective Zero-Shot Annotation of Focalization	Rebecca M. M. Hicke et.al.	2409.11390	null
2024-09-17	Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement	Simon Yu et.al.	2409.11378	null
2024-09-17	Towards Time Series Reasoning with LLMs	Winnie Chow et.al.	2409.11376	null
2024-09-17	Multi-OCT-SelfNet: Integrating Self-Supervised Learning with Multi-Source Data Fusion for Enhanced Multi-Class Retinal Disease Classification	Fatema-E- Jannat et.al.	2409.11375	null
2024-09-17	CoCA: Regaining Safety-awareness of Multimodal Large Language Models with Constitutional Calibration	Jiahui Gao et.al.	2409.11365	null
2024-09-17	AI Suggestions Homogenize Writing Toward Western Styles and Diminish Cultural Nuances	Dhruv Agarwal et.al.	2409.11360	null
2024-09-17	THaMES: An End-to-End Tool for Hallucination Mitigation and Evaluation in Large Language Models	Mengfei Liang et.al.	2409.11353	null
2024-09-17	Leveraging Distillation Techniques for Document Understanding: A Case Study with FLAN-T5	Marcel Lamott et.al.	2409.11282	null
2024-09-16	RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval	Di Liu et.al.	2409.10516	null
2024-09-16	Context-aware Code Segmentation for C-to-Rust Translation using Large Language Models	Momoko Shiraishi et.al.	2409.10506	null
2024-09-16	DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction	John Wu et.al.	2409.10504	null
2024-09-16	Causal Language Modeling Can Elicit Search and Reasoning Capabilities on Logic Puzzles	Kulin Shah et.al.	2409.10502	null
2024-09-16	Code Vulnerability Detection: A Comparative Analysis of Emerging Large Language Models	Shaznin Sultana et.al.	2409.10490	null
2024-09-16	XLM for Autonomous Driving Systems: A Comprehensive Review	Sonda Fourati et.al.	2409.10484	null
2024-09-17	Schrodinger's Memory: Large Language Models	Wei Wang et.al.	2409.10482	null
2024-09-16	LLM as BT-Planner: Leveraging LLMs for Behavior Tree Generation in Robot Task Planning	Jicong Ao et.al.	2409.10444	null
2024-09-16	A Large-Scale Privacy Assessment of Android Third-Party SDKs	Mark Huasong Meng et.al.	2409.10411	null
2024-09-17	Learnings from a Large-Scale Deployment of an LLM-Powered Expert-in-the-Loop Healthcare Chatbot	Bhuvan Sachdeva et.al.	2409.10354	null
2024-09-13	Agents in Software Engineering: Survey, Landscape, and Vision	Yanxian Huang et.al.	2409.09030	link
2024-09-13	Contri(e)ve: Context + Retrieve for Scholarly Question Answering	Kanchan Shivashankar et.al.	2409.09010	null
2024-09-13	Safeguarding Decentralized Social Media: LLM Agents for Automating Community Rule Compliance	Lucio La Cava et.al.	2409.08963	null
2024-09-13	Emerging Reliance Behaviors in Human-AI Text Generation: Hallucinations, Data Quality Assessment, and Cognitive Forcing Functions	Zahra Ashktorab et.al.	2409.08937	null
2024-09-13	SynSUM -- Synthetic Benchmark with Structured and Unstructured Medical Records	Paloma Rabaey et.al.	2409.08936	link
2024-09-13	LLM-based Weak Supervision Framework for Query Intent Classification in Video Search	Farnoosh Javadi et.al.	2409.08931	null
2024-09-13	AnyBipe: An End-to-End Framework for Training and Deploying Bipedal Robots Guided by Large Language Models	Yifei Yao et.al.	2409.08904	null
2024-09-13	A Market for Lemons? Strategic Directions for a Vigilant Application of Artificial Intelligence in Entrepreneurship Research	Martin Obschonka et.al.	2409.08890	null
2024-09-13	Exploring Graph Structure Comprehension Ability of Multimodal Large Language Models: Case Studies	Zhiqiang Zhong et.al.	2409.08864	null
2024-09-13	FP-VEC: Fingerprinting Large Language Models via Efficient Vector Addition	Zhenhua Xu et.al.	2409.08846	null
2024-09-12	Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale	Rogerio Bonatti et.al.	2409.08264	link
2024-09-12	OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering	Jiahao Nick Li et.al.	2409.08250	null
2024-09-12	Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources	Alisia Lupidi et.al.	2409.08239	null
2024-09-12	LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems	Hakan T. Otal et.al.	2409.08234	link
2024-09-12	What Makes a Maze Look Like a Maze?	Joy Hsu et.al.	2409.08202	null
2024-09-12	Fine-tuning Large Language Models for Entity Matching	Aaron Steiner et.al.	2409.08185	link
2024-09-12	Faster Speech-LLaMA Inference with Multi-token Prediction	Desh Raj et.al.	2409.08148	null
2024-09-12	LLM-POTUS Score: A Framework of Analyzing Presidential Debates with Large Language Models	Zhengliang Liu et.al.	2409.08147	null
2024-09-12	The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal	Huiyuan Xie et.al.	2409.08098	null
2024-09-12	Securing Large Language Models: Addressing Bias, Misinformation, and Prompt Attacks	Benji Peng et.al.	2409.08087	null
2024-09-11	"My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays	Shengxin Hong et.al.	2409.07453	null
2024-09-11	SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories	Ben Bogin et.al.	2409.07440	link
2024-09-11	CLNX: Bridging Code and Natural Language for C/C++ Vulnerability-Contributing Commits Identification	Zeqing Qin et.al.	2409.07407	null
2024-09-11	AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge	Han Wang et.al.	2409.07394	link
2024-09-11	Demo: SGCode: A Flexible Prompt-Optimizing System for Secure Generation of Code	Khiem Ton et.al.	2409.07368	null
2024-09-11	Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation	SeongYeub Chu et.al.	2409.07355	link
2024-09-11	Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering	Weixi Weng et.al.	2409.07331	null
2024-09-11	MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications	Praveen K Kanithi et.al.	2409.07314	null
2024-09-11	STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM	Qijiong Liu et.al.	2409.07276	null
2024-09-11	MiniDrive: More Efficient Vision-Language Models with Multi-Level 2D Features as Text Tokens for Autonomous Driving	Enming Zhang et.al.	2409.07267	link
2024-09-10	E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning	Zihan Liao et.al.	2409.06679	null
2024-09-10	LLaMA-Omni: Seamless Speech Interaction with Large Language Models	Qingkai Fang et.al.	2409.06666	link
2024-09-10	Human Perception of LLM-generated Text Content in Social Media Environments	Kristina Radivojevic et.al.	2409.06653	null
2024-09-10	Optimal Workload Placement on Multi-Instance GPUs	Bekir Turkkan et.al.	2409.06646	null
2024-09-10	MoWE-Audio: Multitask AudioLLMs with Mixture of Weak Encoders	Wenyu Zhang et.al.	2409.06635	null
2024-09-10	A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio	Ningyuan Xi et.al.	2409.06624	null
2024-09-10	Alleviating Hallucinations in Large Language Models with Scepticism Modeling	Yetao Wu et.al.	2409.06601	null
2024-09-10	GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering	Sacha Muller et.al.	2409.06595	null
2024-09-10	MAPS: Energy-Reliability Tradeoff Management in Autonomous Vehicles Through LLMs Penetrated Science	Mahdieh Aliazam et.al.	2409.06558	null
2024-09-10	Questioning Internal Knowledge Structure of Large Language Models Through the Lens of the Olympic Games	Juhwan Choi et.al.	2409.06518	null
2024-09-09	MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct	Run Luo et.al.	2409.05840	null
2024-09-09	Are Large Language Models a Threat to Programming Platforms? An Exploratory Study	Md Mustakim Billah et.al.	2409.05824	null
2024-09-09	GASP: Gaussian Splatting for Physic-Based Simulations	Piotr Borycki et.al.	2409.05819	null
2024-09-09	Benchmarking Chinese Knowledge Rectification in Large Language Models	Tianhe Lu et.al.	2409.05806	link
2024-09-09	Evidence from fMRI Supports a Two-Phase Abstraction Process in Language Models	Emily Cheng et.al.	2409.05771	null
2024-09-09	Model Input Verification of Large Scale Simulations	Rumyana Neykova et.al.	2409.05768	null
2024-09-09	A Novel Idea Generation Tool using a Structured Conversational AI (CAI) System	B. Sankar et.al.	2409.05747	null
2024-09-09	LLMs Will Always Hallucinate, and We Need to Live With This	Sourav Banerjee et.al.	2409.05746	null
2024-09-09	A System and Benchmark for LLM-based Q&A on Heterogeneous Data	Achille Fokoue et.al.	2409.05735	null
2024-09-09	Towards Democratizing Multilingual Large Language Models For Medicine Through A Two-Stage Instruction Fine-tuning Approach	Meng Zhou et.al.	2409.05732	null
2024-09-06	RLPF: Reinforcement Learning from Prediction Feedback for User Summarization with LLMs	Jiaxing Wu et.al.	2409.04421	null
2024-09-06	Question-Answering Dense Video Events	Hangyu Qin et.al.	2409.04388	null
2024-09-06	Learning vs Retrieval: The Role of In-Context Examples in Regression with LLMs	Aliakbar Nafar et.al.	2409.04318	null
2024-09-06	An optically accelerated extreme learning machine using hot atomic vapors	Pierre Azam et.al.	2409.04312	null
2024-09-06	Using Large Language Models to Generate Authentic Multi-agent Knowledge Work Datasets	Desiree Heim et.al.	2409.04286	null
2024-09-06	Advancing Automated Knowledge Transfer in Evolutionary Multitasking via Large Language Models	Yuxiao Huang et.al.	2409.04270	null
2024-09-06	GALLa: Graph Aligned Large Language Models for Improved Source Code Understanding	Ziyin Zhang et.al.	2409.04183	null
2024-09-06	Combining LLMs and Knowledge Graphs to Reduce Hallucinations in Question Answering	Larissa Pusch et.al.	2409.04181	null
2024-09-06	From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks	Andreas Stephan et.al.	2409.04168	null
2024-09-06	Can OpenSource beat ChatGPT? -- A Comparative Study of Large Language Models for Text-to-Code Generation	Luis Mayer et.al.	2409.04164	null
2024-09-05	Attention Heads of Large Language Models: A Survey	Zifan Zheng et.al.	2409.03752	link
2024-09-05	LLM-CI: Assessing Contextual Integrity Norms in Language Models	Yan Shvartzshnaider et.al.	2409.03735	null
2024-09-05	Safety vs. Performance: How Multi-Objective Learning Reduces Barriers to Market Entry	Meena Jagadeesan et.al.	2409.03734	null
2024-09-05	Planning In Natural Language Improves LLM Search For Code Generation	Evan Wang et.al.	2409.03733	null
2024-09-06	RAG based Question-Answering for Contextual Response Prediction System	Sriram Veturi et.al.	2409.03708	null
2024-09-05	TRACE-cs: Trustworthy Reasoning for Contrastive Explanations in Course Scheduling Problems	Stylianos Loukas Vasileiou et.al.	2409.03671	null
2024-09-05	A Fused Large Language Model for Predicting Startup Success	Abdurahman Maarouf et.al.	2409.03668	null
2024-09-05	The representation landscape of few-shot learning and fine-tuning in large language models	Diego Doimo et.al.	2409.03662	link
2024-09-06	LLM-based multi-agent poetry generation in non-cooperative environments	Ran Zhang et.al.	2409.03659	link
2024-09-05	From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents	Jifan Yu et.al.	2409.03512	null
2024-09-04	RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version)	Yao Mu et.al.	2409.02920	null
2024-09-04	LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA	jiajie Zhang et.al.	2409.02897	null
2024-09-04	LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture	Xidong Wang et.al.	2409.02889	link
2024-09-04	Historical German Text Normalization Using Type- and Token-Based Language Modeling	Anton Ehrmanntraut et.al.	2409.02841	null
2024-09-04	Exploring Sentiment Dynamics and Predictive Behaviors in Cryptocurrency Discussions by Few-Shot Learning with Large Language Models	Moein Shahiki Tash et.al.	2409.02836	null
2024-09-04	CMM-Math: A Chinese Multimodal Math Dataset To Evaluate and Enhance the Mathematics Reasoning of Large Multimodal Models	Wentao Liu et.al.	2409.02834	null
2024-09-04	ExpLLM: Towards Chain of Thought for Facial Expression Recognition	Xing Lan et.al.	2409.02828	null
2024-09-04	Design Contradictions: Help or Hindrance?	Aron E. Owen et.al.	2409.02823	null
2024-09-04	Language Understanding as a Constraint on Consensus Size in LLM Societies	Giordano De Marzo et.al.	2409.02822	null
2024-09-04	Towards a Unified View of Preference Learning for Large Language Models: A Survey	Bofei Gao et.al.	2409.02795	null
2024-08-30	SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists	Raoyuan Zhao et.al.	2408.17437	link
2024-08-30	Advancing Multi-talker ASR Performance with Large Language Models	Mohan Shi et.al.	2408.17431	null
2024-08-30	Getting Inspiration for Feature Elicitation: App Store- vs. LLM-based Approach	Jialiang Wei et.al.	2408.17404	null
2024-08-30	NDP: Next Distribution Prediction as a More Broad Target	Junhao Ruan et.al.	2408.17377	null
2024-08-30	Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain	Francesca Grasso et.al.	2408.17362	link
2024-08-30	Forget to Flourish: Leveraging Machine-Unlearning on Pretrained Language Models for Privacy Leakage	Md Rafi Ur Rashid et.al.	2408.17354	null
2024-08-30	Bridging Domain Knowledge and Process Discovery Using Large Language Models	Ali Norouzifar et.al.	2408.17316	link
2024-08-30	Flexible and Effective Mixing of Large Language Models into a Mixture of Domain Experts	Rhui Dih Lee et.al.	2408.17280	null
2024-08-30	Joint Estimation and Prediction of City-wide Delivery Demand: A Large Language Model Empowered Graph-based Learning Approach	Tong Nie et.al.	2408.17258	null
2024-08-30	VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters	Mouxiang Chen et.al.	2408.17253	link
2024-08-29	How Far Can Cantonese NLP Go? Benchmarking Cantonese Capabilities of Large Language Models	Jiyue Jiang et.al.	2408.16756	null
2024-08-29	Reinforcement Learning without Human Feedback for Last Mile Fine-Tuning of Large Language Models	Alec Solway et.al.	2408.16753	null
2024-08-29	Assessing Large Language Models for Online Extremism Research: Identification, Explanation, and New Knowledge	Beidi Dong et.al.	2408.16749	null
2024-08-29	Theoretical and Methodological Framework for Studying Texts Produced by Large Language Models	Jiří Milička et.al.	2408.16740	null
2024-08-29	GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models	Moreno D'Incà et.al.	2408.16700	link
2024-08-29	Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity	Ziniu Li et.al.	2408.16673	null
2024-08-29	Towards Efficient Modelling of String Dynamics: A Comparison of State Space and Koopman based Deep Learning Methods	Rodrigo Diaz et.al.	2408.16650	null
2024-08-29	Examination of Code generated by Large Language Models	Robin Beer et.al.	2408.16601	link
2024-08-29	Enhancing Dialogue Generation in Werewolf Game Through Situation Analysis and Persuasion Strategies	Zhiyang Qi et.al.	2408.16586	null
2024-08-29	CNIMA: A Universal Evaluation Framework and Automated Approach for Assessing Second Language Dialogues	Rena Gao et.al.	2408.16518	null
2024-08-28	Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders	Min Shi et.al.	2408.15998	link
2024-08-28	BattleAgentBench: A Benchmark for Evaluating Cooperation and Competition Capabilities of Language Models in Multi-Agent Systems	Wei Wang et.al.	2408.15971	null
2024-08-28	More Text, Less Point: Towards 3D Data-Efficient Point-Language Understanding	Yuan Tang et.al.	2408.15966	null
2024-08-28	Atari-GPT: Investigating the Capabilities of Multimodal Large Language Models as Low-Level Policies for Atari Games	Nicholas R. Waytowich et.al.	2408.15950	null
2024-08-28	Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models	Yuncheng Yang et.al.	2408.15915	null
2024-08-28	Decentralized LLM Inference over Edge Networks with Energy Harvesting	Aria Khoshsirat et.al.	2408.15907	null
2024-08-28	LLM-Based Multi-Hop Question Answering with Knowledge Graph Integration in Evolving Environments	Ruirui Chen et.al.	2408.15903	null
2024-08-28	Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts	Nikolas Gritsch et.al.	2408.15901	null
2024-08-28	Bias in LLMs as Annotators: The Effect of Party Cues on Labelling Decision by Large Language Models	Sebastian Vallejo Vera et.al.	2408.15895	null
2024-08-28	Persuasion Games using Large Language Models	Ganesh Prasath Ramani et.al.	2408.15879	null
2024-08-27	Generative Verifiers: Reward Modeling as Next-Token Prediction	Lunjun Zhang et.al.	2408.15240	null
2024-08-27	LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet	Nathaniel Li et.al.	2408.15221	null
2024-08-27	Investigating Coverage Criteria in Large Language Models: An In-Depth Study Through Jailbreak Attacks	Shide Zhou et.al.	2408.15207	null
2024-08-27	Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation	Jian Hu et.al.	2408.15205	null
2024-08-27	Can Unconfident LLM Annotations Be Used for Confident Conclusions?	Kristina Gligorić et.al.	2408.15204	null
2024-08-27	Unlocking Potential in Pre-Trained Music Language Models for Versatile Multi-Track Music Arrangement	Longshen Ou et.al.	2408.15176	null
2024-08-27	X-Reflect: Cross-Reflection Prompting for Multimodal Recommendation	Hanjia Lyu et.al.	2408.15172	null
2024-08-27	Measuring text summarization factuality using atomic facts entailment metrics in the context of retrieval augmented generation	N. E. Kriman et.al.	2408.15171	null
2024-08-27	BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline	Guosheng Dong et.al.	2408.15079	null
2024-08-27	Constraining Participation: Affordances of Feedback Features in Interfaces to Large Language Models	Ned Cooper et.al.	2408.15066	null
2024-08-27	Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models	Aradhye Agarwal et.al.	2408.14470	link
2024-08-26	Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos	Qirui Chen et.al.	2408.14469	null
2024-08-26	Explicit Inductive Inference using Large Language Models	Tianyang Liu et.al.	2408.14467	null
2024-08-26	Evaluating Large Language Models on Spatial Tasks: A Multi-Task Benchmarking Study	Liuchang Xu Shuo Zhao et.al.	2408.14438	null
2024-08-26	CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models	Shubham Bharti et.al.	2408.14419	null
2024-08-26	MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues	Kuluhan Binici et.al.	2408.14418	null
2024-08-26	Language-specific Calibration for Pruning Multilingual Language Models	Simon Kurz et.al.	2408.14398	null
2024-08-26	Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning	Sakhinana Sagar Srinivas et.al.	2408.14387	null
2024-08-26	Probing Causality Manipulation of Large Language Models	Chenyang Zhang et.al.	2408.14380	link
2024-08-26	SWE-bench-java: A GitHub Issue Resolving Benchmark for Java	Daoguang Zan et.al.	2408.14354	link
2024-08-23	MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?	Yi-Fan Zhang et.al.	2408.13257	null
2024-08-23	Domain-specific long text classification from sparse relevant information	Célia D'Cruz et.al.	2408.13253	null
2024-08-23	Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time	Yingyu Liang et.al.	2408.13233	null
2024-08-23	EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods	Hongcheng Ding et.al.	2408.13214	null
2024-08-23	DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation	Qiming Zhu et.al.	2408.13204	null
2024-08-23	Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning	Hourui Deng et.al.	2408.13184	null
2024-08-23	IntelliCare: Improving Healthcare Analysis with Variance-Controlled Patient-Level Knowledge from Large Language Models	Zhihao Yu et.al.	2408.13073	null
2024-08-23	Guiding IoT-Based Healthcare Alert Systems with Large Language Models	Yulan Gao et.al.	2408.13071	null
2024-08-23	VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models	Wentao Wu et.al.	2408.13031	link
2024-08-23	In-Context Learning with Reinforcement Learning for Incomplete Utterance Rewriting	Haowei Du et.al.	2408.13028	null
2024-08-22	Controllable Text Generation for Large Language Models: A Survey	Xun Liang et.al.	2408.12599	link
2024-08-22	xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations	Can Qin et.al.	2408.12590	null
2024-08-22	RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment	Xiaohan Wang et.al.	2408.12579	null
2024-08-22	Jamba-1.5: Hybrid Transformer-Mamba Models at Scale	Jamba Team et.al.	2408.12570	null
2024-08-22	ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation	Lujia Zhong et.al.	2408.12561	link
2024-08-22	Towards Evaluating and Building Versatile Large Language Models for Medicine	Chaoyi Wu et.al.	2408.12547	link
2024-08-22	MEDCO: Medical Education Copilots Based on A Multi-Agent Framework	Hao Wei et.al.	2408.12496	null
2024-08-22	GenderCARE: A Comprehensive Framework for Assessing and Reducing Gender Bias in Large Language Models	Kunsheng Tang et.al.	2408.12494	link
2024-08-23	Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese	Khang T. Doan et.al.	2408.12480	null
2024-08-22	Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition	Bozheng Li et.al.	2408.12475	null
2024-08-21	SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs	Yuanyang Yin et.al.	2408.11813	null
2024-08-21	Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models	Yuzhou Huang et.al.	2408.11801	null
2024-08-21	PermitQA: A Benchmark for Retrieval Augmented Generation in Wind Siting and Permitting domain	Rounak Meyur et.al.	2408.11800	null
2024-08-21	EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model	Feipeng Ma et.al.	2408.11795	null
2024-08-21	Leveraging Chemistry Foundation Models to Facilitate Structure Focused Retrieval Augmented Generation in Multi-Agent Workflows for Catalyst and Materials Design	Nathaniel H. Park et.al.	2408.11793	null
2024-08-21	Critique-out-Loud Reward Models	Zachary Ankner et.al.	2408.11791	link
2024-08-21	DreamFactory: Pioneering Multi-Scene Long Video Generation with a Multi-Agent Framework	Zhifei Xie et.al.	2408.11788	null
2024-08-21	Personality Alignment of Large Language Models	Minjun Zhu et.al.	2408.11779	link
2024-08-21	Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards	Omar Erak et.al.	2408.11775	link
2024-08-21	Against All Odds: Overcoming Typology, Script, and Language Confusion in Multilingual Embedding Inversion Attacks	Yiyi Chen et.al.	2408.11749	null
2024-08-20	Revisiting VerilogEval: Newer LLMs, In-Context Learning, and Specification-to-RTL Tasks	Nathaniel Pinckney et.al.	2408.11053	null
2024-08-20	FLAME: Learning to Navigate with Multimodal LLM in Urban Environments	Yunzhe Xu et.al.	2408.11051	link
2024-08-21	MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding	Jian Chen et.al.	2408.11049	null
2024-08-20	Reconciling Methodological Paradigms: Employing Large Language Models as Novice Qualitative Research Assistants in Talent Management Research	Sreyoshi Bhaduri et.al.	2408.11043	null
2024-08-20	Scaling Law with Learning Rate Annealing	Howe Tissue et.al.	2408.11029	null
2024-08-20	Athena: Safe Autonomous Agents with Verbal Contrastive Learning	Tanmana Sadhu et.al.	2408.11021	null
2024-08-20	While GitHub Copilot Excels at Coding, Does It Ensure Responsible Output?	Wen Cheng et.al.	2408.11006	link
2024-08-20	CTP-LLM: Clinical Trial Phase Transition Prediction Using Large Language Models	Michael Reinisch et.al.	2408.10995	null
2024-08-20	Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models	Yuyan Chen et.al.	2408.10947	null
2024-08-20	Large Language Model Driven Recommendation	Anton Korikov et.al.	2408.10946	null
2024-08-19	Demystifying the Communication Characteristics for Distributed Transformer Models	Quentin Anthony et.al.	2408.10197	null
2024-08-19	SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation Models	Anke Tang et.al.	2408.10174	link
2024-08-19	Customizing Language Models with Instance-wise LoRA for Sequential Recommendation	Xiaoyu Kong et.al.	2408.10159	null
2024-08-19	Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models	Amey Hengle et.al.	2408.10151	null
2024-08-19	In-Context Learning with Representations: Contextual Generalization of Trained Transformers	Tong Yang et.al.	2408.10147	null
2024-08-19	Instruction Finetuning for Leaderboard Generation from Empirical AI Research	Salomon Kabongo et.al.	2408.10141	null
2024-08-19	Molecular Graph Representation Learning Integrating Large Language Models with Domain-specific Small Models	Tianyu Zhang et.al.	2408.10124	link
2024-08-20	PLUTUS: A Well Pre-trained Large Unified Transformer can Unveil Financial Time Series Regularities	Yuanjian Xu et.al.	2408.10111	null
2024-08-19	ARMADA: Attribute-Based Multimodal Data Augmentation	Xiaomeng Jin et.al.	2408.10086	null
2024-08-19	FFAA: Multimodal Large Language Model based Explainable Open-World Face Forgery Analysis Assistant	Zhengchao Huang et.al.	2408.10072	null
2024-08-19	PEDAL: Enhancing Greedy Decoding with Large Language Models using Diverse Exemplars	Sumanth Prabhu et.al.	2408.08869	null
2024-08-16	Visual Agents as Fast and Slow Thinkers	Guangyan Sun et.al.	2408.08862	null
2024-08-16	ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis	Yubao Zhao et.al.	2408.08849	null
2024-08-16	PsychoLex: Unveiling the Psychological Mind of Large Language Models	Mohammad Amin Abbasi et.al.	2408.08848	null
2024-08-16	FLEXTAF: Enhancing Table Reasoning with Flexible Tabular Formats	Xuanliang Zhang et.al.	2408.08841	link
2024-08-16	Artificial Intelligence and Strategic Decision-Making: Evidence from Entrepreneurs and Investors	Felipe A. Csaszar et.al.	2408.08811	null
2024-08-16	Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge	Ravi Raju et.al.	2408.08808	null
2024-08-16	EmoDynamiX: Emotional Support Dialogue Strategy Prediction by Modelling MiXed Emotions and Discourse Dynamics	Chenwei Wan et.al.	2408.08782	link
2024-08-16	Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions	Chenming Tang et.al.	2408.08780	null
2024-08-16	DAC: Decomposed Automation Correction for Text-to-SQL	Dingzirui Wang et.al.	2408.08779	link
2024-08-15	Can Large Language Models Understand Symbolic Graphics Programs?	Zeju Qiu et.al.	2408.08313	null
2024-08-15	ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws	Ruihang Li et.al.	2408.08310	null
2024-08-15	Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors	Usman Syed et.al.	2408.08302	null
2024-08-15	HELP: Hierarchical Embeddings-based Log Parsing	Andy Xu et.al.	2408.08300	null
2024-08-15	The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community	Shachar Don-Yehiya et.al.	2408.08291	null
2024-08-15	Autonomous Behavior Planning For Humanoid Loco-manipulation Through Grounded Language Model	Jin Wang et.al.	2408.08282	null
2024-08-15	BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts	Qizhen Zhang et.al.	2408.08274	null
2024-08-15	DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System	Xihong Yang et.al.	2408.08231	null
2024-08-15	RED-CT: A Systems Design Methodology for Using LLM-labeled Data to Train and Deploy Edge Classifiers for Computational Social Science	David Farr et.al.	2408.08217	null
2024-08-15	Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models	Javier González et.al.	2408.08210	null
2024-08-14	The Death of Schema Linking? Text-to-SQL in the Age of Well-Reasoned Language Models	Karime Maamari et.al.	2408.07702	null
2024-08-15	Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities	Enneng Yang et.al.	2408.07666	link
2024-08-14	Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models	Yi-Cheng Lin et.al.	2408.07665	null
2024-08-14	Alignment-Enhanced Decoding:Defending via Token-Level Adaptive Refining of Probability Distributions	Quan Liu et.al.	2408.07663	link
2024-08-14	WeKnow-RAG: An Adaptive Approach for Retrieval-Augmented Generation Integrating Web Search and Knowledge Graphs	Weijian Xie et.al.	2408.07611	null
2024-08-14	Transformers and Large Language Models for Efficient Intrusion Detection Systems: A Comprehensive Survey	Hamza Kheddar et.al.	2408.07583	null
2024-08-15	MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark	Minxuan Zhou et.al.	2408.07543	null
2024-08-15	Usefulness of data flow diagrams and large language models for security threat validation: a registered report	Winnie Bahati Mbaka et.al.	2408.07537	null
2024-08-14	Development of a Multi-Agent Clinical Decision Support System for Korean Triage and Acuity Scale (KTAS)-Based Triage and Treatment Planning in Emergency Departments	Seungjun Han et.al.	2408.07531	null
2024-08-14	Large Language Models Know What Makes Exemplary Contexts	Quanyu Long et.al.	2408.07505	null
2024-08-13	Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents	Kexun Zhang et.al.	2408.07060	null
2024-08-13	LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs	Yushi Bai et.al.	2408.07055	link
2024-08-13	Casper: Prompt Sanitization for Protecting User Privacy in Web-Based Large Language Models	Chun Jie Chong et.al.	2408.07004	null
2024-08-13	LLMs can Schedule	Henrik Abgaryan et.al.	2408.06993	link
2024-08-13	OpenResearcher: Unleashing AI for Accelerated Scientific Research	Yuxiang Zheng et.al.	2408.06941	link
2024-08-13	Evaluating Cultural Adaptability of a Large Language Model via Simulation of Synthetic Personas	Louis Kwok et.al.	2408.06929	null
2024-08-13	Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives	Zhihu Wang et.al.	2408.06904	null
2024-08-13	Leveraging Language Models for Emotion and Behavior Analysis in Education	Kaito Tanaka et.al.	2408.06874	null
2024-08-13	LoRA $^2$ : Multi-Scale Low-Rank Approximations for Fine-Tuning Large Language Models	Jia-Chen Zhang et.al.	2408.06854	null
2024-08-13	Causal Agent based on Large Language Model	Kairong Han et.al.	2408.06849	link
2024-08-12	Animate, or Inanimate, That is the Question for Large Language Models	Leonardo Ranaldi et.al.	2408.06332	null
2024-08-12	Can We Rely on LLM Agents to Draft Long-Horizon Plans? Let's Take TravelPlanner as an Example	Yanan Chen et.al.	2408.06318	null
2024-08-12	The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery	Chris Lu et.al.	2408.06292	link
2024-08-12	MovieSum: An Abstractive Summarization Dataset for Movie Screenplays	Rohit Saxena et.al.	2408.06281	link
2024-08-13	Review-driven Personalized Preference Reasoning with Large Language Models for Recommendation	Jieyong Kim et.al.	2408.06276	null
2024-08-13	FuxiTranyu: A Multilingual Large Language Model Trained with Balanced Data	Haoran Sun et.al.	2408.06273	null
2024-08-12	A RAG-Based Question-Answering Solution for Cyber-Attack Investigation and Attribution	Sampath Rajapaksha et.al.	2408.06272	null
2024-08-12	Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment	Karel D'Oosterlinck et.al.	2408.06266	null
2024-08-12	On Effects of Steering Latent Representation for Large Language Model Unlearning	Dang Huu-Tien et.al.	2408.06223	null
2024-08-12	Improving Structural Diversity of Blackbox LLMs via Chain-of-Specification Prompting	Halley Young et.al.	2408.06186	null
2024-08-10	Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions	Michele Miranda et.al.	2408.05212	null
2024-08-09	VITA: Towards Open-Source Interactive Omni Multimodal LLM	Chaoyou Fu et.al.	2408.05211	null
2024-08-09	Evaluating the capability of large language models to personalize science texts for diverse middle-school-age learners	Michael Vaccaro Jr et.al.	2408.05204	null
2024-08-09	TaSL: Task Skill Localization and Consolidation for Language Model Continual Learning	Yujie Feng et.al.	2408.05200	null
2024-08-09	AttackER: Towards Enhancing Cyber-Attack Attribution with a Named Entity Recognition Dataset	Pritam Deka et.al.	2408.05149	null
2024-08-09	A Hybrid RAG System with Comprehensive Enhancement on Complex Reasoning	Ye Yuan et.al.	2408.05141	null
2024-08-09	Is ChatGPT a Good Software Librarian? An Exploratory Study on the Use of ChatGPT for Software Library Recommendations	Jasmine Latendresse et.al.	2408.05128	null
2024-08-09	Large Language Models and Thematic Analysis: Human-AI Synergy in Researching Hate Speech on Social Media	Petre Breazu et.al.	2408.05126	null
2024-08-09	Sportify: Question Answering with Embedded Visualizations and Personified Narratives for Sports Video	Chunggi Lee et.al.	2408.05123	null
2024-08-09	A Survey of NL2SQL with Large Language Models: Where are we, and where are we going?	Xinyu Liu et.al.	2408.05109	null
2024-08-08	Better Alignment with Instruction Back-and-Forth Translation	Thao Nguyen et.al.	2408.04614	null
2024-08-09	Img-Diff: Contrastive Data Synthesis for Multimodal Large Language Models	Qirui Jiao et.al.	2408.04594	link
2024-08-08	Towards Resilient and Efficient LLMs: A Comparative Study of Efficiency, Performance, and Adversarial Robustness	Xiaojing Fan et.al.	2408.04585	null
2024-08-08	SCENE: Evaluating Explainable AI Techniques Using Soft Counterfactuals	Haoran Zheng et.al.	2408.04575	null
2024-08-08	Learning Fine-Grained Grounded Citations for Attributed Large Language Models	Lei Huang et.al.	2408.04568	link
2024-08-08	Bias-Aware Low-Rank Adaptation: Mitigating Catastrophic Inheritance of Large Language Models	Yupeng Chang et.al.	2408.04556	link
2024-08-08	Compromesso! Italian Many-Shot Jailbreaks Undermine the Safety of Large Language Models	Fabio Pernisi et.al.	2408.04522	null
2024-08-08	What You Need is What You Get: Theory of Mind for an LLM-Based Code Understanding Assistant	Jonan Richards et.al.	2408.04477	null
2024-08-08	Can LLMs Beat Humans in Debating? A Dynamic Multi-agent Framework for Competitive Debate	Yiqun Zhang et.al.	2408.04472	link
2024-08-08	RiskAwareBench: Towards Evaluating Physical Risk Awareness for High-level Planning of LLM-based Embodied Agents	Zihao Zhu et.al.	2408.04449	null
2024-08-07	How Well Can Vision Language Models See Image Details?	Chenhui Gou et.al.	2408.03940	null
2024-08-07	SLIM-RAFT: A Novel Fine-Tuning Approach to Improve Cross-Linguistic Performance for Mercosur Common Nomenclature	Vinícius Di Oliveira et.al.	2408.03936	null
2024-08-07	CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases	Xiangyan Liu et.al.	2408.03910	link
2024-08-07	Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models	Shachi H Kumar et.al.	2408.03907	null
2024-08-07	From Data to Story: Towards Automatic Animated Data Video Creation with LLM-based Multi-Agent Systems	Leixian Shen et.al.	2408.03876	null
2024-08-07	PackMamba: Efficient Processing of Variable-Length Sequences in Mamba training	Haoran Xu et.al.	2408.03865	null
2024-08-07	GAIA -- A Large Language Model for Advanced Power Dispatch	Yuheng Cheng et.al.	2408.03847	null
2024-08-07	MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models	Yuchen Dong et.al.	2408.03841	null
2024-08-07	WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models	Prannaya Gupta et.al.	2408.03837	null
2024-08-07	Target Prompting for Information Extraction with Vision Language Model	Dipankar Medhi et.al.	2408.03834	null
2024-08-06	TextIM: Part-aware Interactive Motion Synthesis from Text	Siyuan Fan et.al.	2408.03302	null
2024-08-06	KaPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models	Ruizhe Zhang et.al.	2408.03297	null
2024-08-07	StructEval: Deepen and Broaden Large Language Model Assessment via Structured Evaluation	Boxi Cao et.al.	2408.03281	link
2024-08-06	Synthesizing Text-to-SQL Data from Weak and Strong LLMs	Jiaxi Yang et.al.	2408.03256	null
2024-08-06	Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons	Yifei Wang et.al.	2408.03247	null
2024-08-06	Crab Pulsar: IXPE Observations Reveal Unified Polarization Properties Across Optical and Soft X-Ray Bands	Denis González-Caniulef et.al.	2408.03245	null
2024-08-06	Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi	Pranita Deshmukh et.al.	2408.03172	null
2024-08-06	Conditioning LLMs with Emotion in Neural Machine Translation	Charles Brazier et.al.	2408.03150	null
2024-08-06	Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations	Leo Donisch et.al.	2408.03130	null
2024-08-06	Lisbon Computational Linguists at SemEval-2024 Task 2: Using A Mistral 7B Model and Data Augmentation	Artur Guimarães et.al.	2408.03127	null
2024-08-05	Can Reinforcement Learning Unlock the Hidden Dangers in Aligned Large Language Models?	Mohammad Bahrami Karkevandi et.al.	2408.02651	null
2024-08-05	SEAS: Self-Evolving Adversarial Safety Optimization for Large Language Models	Muxi Diao et.al.	2408.02632	null
2024-08-05	LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba	Yunxiang Fu et.al.	2408.02615	null
2024-08-05	Progressively Selective Label Enhancement for Language Model Alignment	Biao Liu et.al.	2408.02599	null
2024-08-05	Leveraging the Power of LLMs: A Fine-Tuning Approach for High-Quality Aspect-Based Summarization	Ankan Mullick et.al.	2408.02584	null
2024-08-05	Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information	Yauwai Yim et.al.	2408.02559	null
2024-08-05	Generative AI as a Service in 6G Edge-Cloud: Generation Task Offloading by In-context Learning	Hao Zhou et.al.	2408.02549	null
2024-08-05	RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation	Daniel Fleischer et.al.	2408.02545	null
2024-08-05	Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions	Xinbei Ma et.al.	2408.02544	null
2024-08-05	Towards Coarse-grained Visual Language Navigation Task Planning Enhanced by Event Knowledge Graph	Zhao Kaichen et.al.	2408.02535	null
2024-08-02	Prompt Recursive Search: A Living Framework with Adaptive Growth in LLM Auto-Prompting	Xiangyu Zhao et.al.	2408.01423	null
2024-08-02	Mission Impossible: A Statistical Perspective on Jailbreaking LLMs	Jingtong Su et.al.	2408.01420	null
2024-08-02	DebateQA: Evaluating Question Answering on Debatable Knowledge	Rongwu Xu et.al.	2408.01419	null
2024-08-02	Talk Less, Interact Better: Evaluating In-context Conversational Adaptation in Multimodal LLMs	Yilun Hua et.al.	2408.01417	null
2024-08-02	Pre-trained Language Models Improve the Few-shot Prompt Ability of Decision Transformer	Yu Yang et.al.	2408.01402	null
2024-08-02	Coalitions of Large Language Models Increase the Robustness of AI Agents	Prattyush Mangal et.al.	2408.01380	null
2024-08-02	Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation	Jheng-Hong Yang et.al.	2408.01363	null
2024-08-05	Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs	Peng Ding et.al.	2408.01355	null
2024-08-02	MCGMark: An Encodable and Robust Online Watermark for LLM-Generated Malicious Code	Kaiwen Ning et.al.	2408.01354	null
2024-08-02	Prompt Refinement or Fine-tuning? Best Practices for using LLMs in Computational Social Science Tasks	Anders Giovanni Møller et.al.	2408.01346	null
2024-08-01	AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation	Mengkang Hu et.al.	2408.00764	null
2024-08-01	Tamper-Resistant Safeguards for Open-Weight LLMs	Rishub Tamirisa et.al.	2408.00761	null
2024-08-01	DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency	Jovan Stojkovic et.al.	2408.00741	null
2024-08-01	Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions	Guangzhi Xiong et.al.	2408.00727	null
2024-08-01	An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models	Yangzhen Wu et.al.	2408.00724	null
2024-08-01	Pathway to Secure and Trustworthy 6G for LLMs: Attacks, Defense, and Opportunities	Sunder Ali Khowaja et.al.	2408.00722	null
2024-08-02	Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning	Trapoom Ukarapol et.al.	2408.00690	null
2024-08-01	Can Developers Prompt? A Controlled Experiment for Code Documentation Generation	Hans-Alexander Kruse et.al.	2408.00686	null
2024-08-01	AutoM3L: An Automated Multimodal Machine Learning Framework with Large Language Models	Daqin Luo et.al.	2408.00665	null
2024-08-01	Disentangling Dense Embeddings with Sparse Autoencoders	Charles O'Neill et.al.	2408.00657	null
2024-07-31	Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs	Shi Liu et.al.	2407.21771	null
2024-07-31	ReplanVLM: Replanning Robotic Tasks with Visual Language Models	Aoran Mei et.al.	2407.21762	null
2024-07-31	Adaptive Retrieval-Augmented Generation for Conversational Systems	Xi Wang et.al.	2407.21712	null
2024-07-31	CEAR: Automatic construction of a knowledge graph of chemical entities and roles from scientific literature	Stefan Langer et.al.	2407.21708	null
2024-07-31	TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities	Ming Zhang et.al.	2407.21693	null
2024-07-31	Synth-Empathy: Towards High-Quality Synthetic Empathy Data	Hao Liang et.al.	2407.21669	null
2024-07-31	LLM-for-X: Application-agnostic Integration of Large Language Models to Support Personal Writing Workflows	Lukas Teufelberger et.al.	2407.21593	null
2024-07-31	A Performance Study of LLM-Generated Code on Leetcode	Tristan Coignion et.al.	2407.21579	null
2024-07-31	PMoE: Progressive Mixture of Experts with Asymmetric Transformer for Continual Learning	Min Jae Jung et.al.	2407.21571	null
2024-07-31	CXSimulator: A User Behavior Simulation using LLM Embeddings for Web-Marketing Campaign Assessment	Akira Kasuga et.al.	2407.21553	null
2024-07-30	ThinK: Thinner Key Cache by Query-Driven Pruning	Yuhui Xu et.al.	2407.21018	null
2024-07-30	CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning	Yuexi Du et.al.	2407.21011	link
2024-07-31	MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning	Yupeng Chen et.al.	2407.20999	null
2024-07-30	From Feature Importance to Natural Language Explanations Using LLMs with RAG	Sule Tekkesinoglu et.al.	2407.20990	null
2024-07-30	Large Language Models (LLMs) for Semantic Communication in Edge-based IoT Networks	Alakesh Kalita et.al.	2407.20970	null
2024-07-30	Automated Review Generation Method Based on Large Language Models	Shican Wu et.al.	2407.20906	link
2024-07-30	ThinkRepair: Self-Directed Automated Program Repair	Xin Yin et.al.	2407.20898	link
2024-07-30	Effective Black Box Testing of Sentiment Analysis Classification Networks	Parsa Karbasizadeh et.al.	2407.20884	null
2024-07-30	Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification	Boyang Zhang et.al.	2407.20859	null
2024-07-30	Learn by Selling: Equipping Large Language Models with Product Knowledge for Context-Driven Recommendations	Sarthak Anand et.al.	2407.20856	null
2024-07-29	Specify and Edit: Overcoming Ambiguity in Text-Based Image Editing	Ekaterina Iakovleva et.al.	2407.20232	null
2024-07-29	Can Editing LLMs Inject Harm?	Canyu Chen et.al.	2407.20224	null
2024-07-29	QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval	Hongming Tan et.al.	2407.20207	null
2024-07-29	MindSearch: Mimicking Human Minds Elicits Deep AI Searcher	Zehui Chen et.al.	2407.20183	link
2024-07-29	Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning	Xingchen Zeng et.al.	2407.20174	link
2024-07-29	Diffusion Feedback Helps CLIP See Better	Wenxuan Wang et.al.	2407.20171	null
2024-07-29	Language-Conditioned Offline RL for Multi-Robot Navigation	Steven Morad et.al.	2407.20164	null
2024-07-29	rLLM: Relational Table Learning with LLMs	Weichen Li et.al.	2407.20157	link
2024-07-29	ByteCheckpoint: A Unified Checkpointing System for LLM Development	Borui Wan et.al.	2407.20143	null
2024-07-29	Orca: Ocean Significant Wave Height Estimation with Spatio-temporally Aware Large Language Models	Zhe Li et.al.	2407.20053	null
2024-07-26	Small Molecule Optimization with Large Language Models	Philipp Guevorguian et.al.	2407.18897	link
2024-07-26	Human-artificial intelligence teaming for scientific information extraction from data-driven additive manufacturing research using large language models	Mutahar Safdar et.al.	2407.18827	null
2024-07-26	Automatic Detection of Moral Values in Music Lyrics	Vjosa Preniqi et.al.	2407.18787	null
2024-07-26	The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs	Aleix Sant et.al.	2407.18786	null
2024-07-26	TAGIFY: LLM-powered Tagging Interface for Improved Data Findability on OGD portals	Kevin Kliimask et.al.	2407.18764	null
2024-07-29	Knowledge Graph Structure as Prompt: Improving Small Language Models Capabilities for Knowledge-based Causal Discovery	Yuni Susanti et.al.	2407.18752	link
2024-07-26	Towards Effective and Efficient Continual Pre-training of Large Language Models	Jie Chen et.al.	2407.18743	null
2024-07-26	Towards Generalized Offensive Language Identification	Alphaeus Dmonte et.al.	2407.18738	null
2024-07-26	LLASP: Fine-tuning Large Language Models for Answer Set Programming	Erica Coppolillo et.al.	2407.18723	null
2024-07-26	Neurosymbolic AI for Enhancing Instructability in Generative AI	Amit Sheth et.al.	2407.18722	null
2024-07-26	Recursive Introspection: Teaching Language Model Agents How to Self-Improve	Yuxiao Qu et.al.	2407.18219	null
2024-07-26	Exploring Scaling Trends in LLM Robustness	Nikolaus Howe et.al.	2407.18213	null
2024-07-25	Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models	Sanae Lotfi et.al.	2407.18158	null
2024-07-26	Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic	Fakhraddin Alwajih et.al.	2407.18129	null
2024-07-25	Fine-Tuning Large Language Models for Stock Return Prediction Using Newsflow	Tian Guo et.al.	2407.18103	null
2024-07-25	PEFT-U: Parameter-Efficient Fine-Tuning for User Personalization	Christopher Clarke et.al.	2407.18078	link
2024-07-25	C2P: Featuring Large Language Models with Causal Reasoning	Abdolmahdi Bagheri et.al.	2407.18069	null
2024-07-25	ComPeer: A Generative Conversational Agent for Proactive Peer Support	Tianjian Liu et.al.	2407.18064	null
2024-07-25	Audio Entailment: Assessing Deductive Reasoning for Audio Understanding	Soham Deshmukh et.al.	2407.18062	link
2024-07-25	Difficulty Estimation and Simplification of French Text Using LLMs	Henri Jamet et.al.	2407.18061	null
2024-07-24	I Could've Asked That: Reformulating Unanswerable Questions	Wenting Zhao et.al.	2407.17469	link
2024-07-24	WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries	Wenting Zhao et.al.	2407.17468	null
2024-07-24	CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models	Jiawei Gu et.al.	2407.17467	null
2024-07-24	$VILA^2$ : VILA Augmented VILA	Yunhao Fang et.al.	2407.17453	null
2024-07-24	Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?	Michael-Andrei Panaitescu-Liess et.al.	2407.17417	null
2024-07-24	(PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork	Tianjin Huang et.al.	2407.17412	null
2024-07-24	Grammar-based Game Description Generation using Large Language Models	Tsunehiko Tanaka et.al.	2407.17404	null
2024-07-24	3D Question Answering for City Scene Understanding	Penglei Sun et.al.	2407.17398	null
2024-07-24	ViPer: Visual Personalization of Generative Models via Individual Preference Learning	Sogand Salehi et.al.	2407.17365	null
2024-07-24	Scalify: scale propagation for efficient low-precision LLM training	Paul Balança et.al.	2407.17353	link
2024-07-23	Can Large Language Models Automatically Jailbreak GPT-4V?	Yuanwei Wu et.al.	2407.16686	null
2024-07-23	RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent	Huiyu Xu et.al.	2407.16667	null
2024-07-23	Course-Correction: Safety Alignment Using Synthetic Preferences	Rongwu Xu et.al.	2407.16637	null
2024-07-23	Lawma: The Power of Specialization for Legal Tasks	Ricardo Dominguez-Olmedo et.al.	2407.16615	null
2024-07-23	Shared Imagination: LLMs Hallucinate Alike	Yilun Zhou et.al.	2407.16604	null
2024-07-23	Exploring Automatic Cryptographic API Misuse Detection in the Era of LLMs	Yifan Xia et.al.	2407.16576	null
2024-07-23	Retrieve, Generate, Evaluate: A Case Study for Medical Paraphrases Generation with Small Language Models	Ioana Buhnila et.al.	2407.16565	null
2024-07-23	Patched RTC: evaluating LLMs for diverse software development tasks	Asankhaya Sharma et.al.	2407.16557	null
2024-07-24	MicroEmo: Time-Sensitive Multimodal Emotion Recognition with Micro-Expression Dynamics in Video Dialogues	Liyun Zhang et.al.	2407.16552	null
2024-07-23	HAPFI: History-Aware Planning based on Fused Information	Sujin Jeon et.al.	2407.16533	null
2024-07-22	AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description	Junyu Xie et.al.	2407.15850	link
2024-07-22	LLMmap: Fingerprinting For Large Language Models	Dario Pasquini et.al.	2407.15847	null
2024-07-22	SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models	Mingze Xu et.al.	2407.15841	null
2024-07-22	MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity	Yangzhou Liu et.al.	2407.15838	null
2024-07-22	dMel: Speech Tokenization made Simple	He Bai et.al.	2407.15835	null
2024-07-22	Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight	Ziyuan Huang et.al.	2407.15819	null
2024-07-22	Extracting Structured Insights from Financial News: An Augmented LLM Driven Approach	Rian Dolphin et.al.	2407.15788	null
2024-07-22	MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation	Marco Simoni et.al.	2407.15748	null
2024-07-22	OMoS-QA: A Dataset for Cross-Lingual Extractive Question Answering in a German Migration Context	Steffen Kleinle et.al.	2407.15736	null
2024-07-22	TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON	John Chong Min Tan et.al.	2407.15734	null
2024-07-19	Internal Consistency and Self-Feedback in Large Language Models: A Survey	Xun Liang et.al.	2407.14507	link
2024-07-19	On Pre-training of Multimodal Language Models Customized for Chart Understanding	Wan-Cyuan Fan et.al.	2407.14506	null
2024-07-19	Evaluating the Reliability of Self-Explanations in Large Language Models	Korbinian Randl et.al.	2407.14487	link
2024-07-19	Contrastive Learning with Counterfactual Explanations for Radiology Report Generation	Mingjie Li et.al.	2407.14474	null
2024-07-19	Check-Eval: A Checklist-based Approach for Evaluating Text Quality	Jayr Pereira et.al.	2407.14467	null
2024-07-19	Undermining Mental Proof: How AI Can Make Cooperation Harder by Making Thinking Easier	Zachary Wojtowicz et.al.	2407.14452	null
2024-07-19	Token-level Correlation-guided Compression for Efficient Multimodal Document Understanding	Renshan Zhang et.al.	2407.14439	link
2024-07-19	The Vision of Autonomic Computing: Can LLMs Make It a Reality?	Zhiyang Zhang et.al.	2407.14402	null
2024-07-19	Open Artificial Knowledge	Vadim Borisov et.al.	2407.14371	null
2024-07-19	Enhancing Zero-shot Audio Classification using Sound Attribute Knowledge from Large Language Models	Xuenan Xu et.al.	2407.14355	null
2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761	null
2024-07-18	Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion	Boyang Deng et.al.	2407.13759	null
2024-07-18	Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models	Zhuo Chen et.al.	2407.13757	null
2024-07-18	CellularLint: A Systematic Approach to Identify Inconsistent Behavior in Cellular Network Specifications	Mirza Masfiqur Rahman et.al.	2407.13742	null
2024-07-18	Baba Is AI: Break the Rules to Beat the Benchmark	Nathan Cloos et.al.	2407.13729	null
2024-07-18	CoDefeater: Using LLMs To Find Defeaters in Assurance Cases	Usman Gohar et.al.	2407.13717	null
2024-07-18	Understanding Reference Policies in Direct Preference Optimization	Yixin Liu et.al.	2407.13709	null
2024-07-18	A Comprehensive Review of Recommender Systems: Transitioning from Theory to Practice	Shaina Raza et.al.	2407.13699	null
2024-07-18	Prover-Verifier Games improve legibility of LLM outputs	Jan Hendrik Kirchner et.al.	2407.13692	null
2024-07-18	COMCAT: Leveraging Human Judgment to Improve Automatic Documentation and Summarization	Skyler Grandel et.al.	2407.13648	null
2024-07-17	EchoSight: Advancing Visual-Language Models with Wiki Knowledge	Yibin Yan et.al.	2407.12735	null
2024-07-17	NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model	Zhongqun Zhang et.al.	2407.12727	null
2024-07-17	Is Sarcasm Detection A Step-by-Step Reasoning Process in Large Language Models?	Ben Yao et.al.	2407.12725	null
2024-07-17	The Future of Learning: Large Language Models through the Lens of Students	He Zhang et.al.	2407.12723	null
2024-07-17	MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models	Leyang Shen et.al.	2407.12709	link
2024-07-17	Patch-Level Training for Large Language Models	Chenze Shao et.al.	2407.12665	link
2024-07-17	Zero-shot Text-guided Infinite Image Synthesis with LLM guidance	Soyeong Kwon et.al.	2407.12642	null
2024-07-17	Harnessing the Power of Artificial Intelligence to Vitalize Endangered Indigenous Languages: Technologies and Experiences	Claudio Pinhanez et.al.	2407.12620	null
2024-07-17	AudienceView: AI-Assisted Interpretation of Audience Feedback in Journalism	William Brannon et.al.	2407.12613	link
2024-07-17	E5-V: Universal Embeddings with Multimodal Large Language Models	Ting Jiang et.al.	2407.12580	link
2024-07-16	UrbanWorld: An Urban World Model for 3D City Generation	Yu Shang et.al.	2407.11965	null
2024-07-16	NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?	Mo Li et.al.	2407.11963	link
2024-07-16	Code Documentation and Analysis to Secure Software Development	Paul Attie et.al.	2407.11934	null
2024-07-16	What's Wrong? Refining Meeting Summaries with LLM Feedback	Frederic Kirstein et.al.	2407.11919	null
2024-07-16	Ascend-CC: Confidential Computing on Heterogeneous NPU for Emerging Generative AI Workloads	Aritra Dhar et.al.	2407.11888	null
2024-07-16	Schema Matching with Large Language Models: an Experimental Study	Marcel Parciak et.al.	2407.11852	link
2024-07-16	LoFTI: Localization and Factuality Transfer to Indian Locales	Sona Elza Simon et.al.	2407.11833	link
2024-07-16	GPT Assisted Annotation of Rhetorical and Linguistic Features for Interpretable Propaganda Technique Detection in News Text	Kyle Hamilton et.al.	2407.11827	null
2024-07-16	PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation	Branden Butler et.al.	2407.11798	null
2024-07-16	Large Language Models as Misleading Assistants in Conversation	Betty Li Hou et.al.	2407.11789	null
2024-07-15	VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation	Bocheng Zou et.al.	2407.10972	link
2024-07-15	Q-Sparse: All Large Language Models can be Fully Sparsely-Activated	Hongyu Wang et.al.	2407.10969	null
2024-07-15	Fast Matrix Multiplications for Lookup Table-Quantized LLMs	Han Guo et.al.	2407.10960	null
2024-07-15	MMM: Multilingual Mutual Reinforcement Effect Mix Datasets & Test with Open-domain Information Extraction Large Language Models	Chengguang Gan et.al.	2407.10953	null
2024-07-15	Can Textual Semantics Mitigate Sounding Object Segmentation Preference?	Yaoting Wang et.al.	2407.10947	link
2024-07-15	GRUtopia: Dream General Robots in a City at Scale	Hanqing Wang et.al.	2407.10943	link
2024-07-15	OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting	Penglei Gao et.al.	2407.10923	null
2024-07-15	FinDKG: Dynamic Knowledge Graphs with Large Language Models for Detecting Global Trends in Financial Markets	Xiaohui Victor Li et.al.	2407.10909	null
2024-07-15	Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique	Mark Russinovich et.al.	2407.10887	null
2024-07-15	SLIP: Securing LLMs IP Using Weights Decomposition	Yehonathan Refael et.al.	2407.10886	null
2024-07-12	FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-3	Georgios Makridis et.al.	2407.09467	null
2024-07-12	Human-like Episodic Memory for Infinite Context LLMs	Zafeirios Fountas et.al.	2407.09450	null
2024-07-12	ASTPrompter: Weakly Supervised Automated Language Model Red-Teaming to Identify Likely Toxic Prompts	Amelia F. Hardy et.al.	2407.09447	null
2024-07-12	MUSCLE: A Model Update Strategy for Compatible LLM Evolution	Jessica Echterhoff et.al.	2407.09435	null
2024-07-12	Open (Clinical) LLMs are Sensitive to Instruction Phrasings	Alberto Mario Ceballos Arroyo et.al.	2407.09429	null
2024-07-12	TelecomGPT: A Framework to Build Telecom-Specfic Large Language Models	Hang Zou et.al.	2407.09424	null
2024-07-12	Mitigating Entity-Level Hallucination in Large Language Models	Weihang Su et.al.	2407.09417	link
2024-07-12	SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers	Shraman Pramanick et.al.	2407.09413	link
2024-07-12	PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents	Saber Zerhoudi et.al.	2407.09394	null
2024-07-12	GAVEL: Generating Games Via Evolution and Language Models	Graham Todd et.al.	2407.09388	null
2024-07-11	MAVIS: Mathematical Visual Instruction Tuning	Renrui Zhang et.al.	2407.08739	link
2024-07-11	Real-Time Anomaly Detection and Reactive Planning with Large Language Models	Rohan Sinha et.al.	2407.08735	null
2024-07-11	Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist	Zihao Zhou et.al.	2407.08733	null
2024-07-11	A Taxonomy for Data Contamination in Large Language Models	Medha Palavalli et.al.	2407.08716	null
2024-07-11	GTA: A Benchmark for General Tool Agents	Jize Wang et.al.	2407.08713	link
2024-07-11	Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models	Zhening Xing et.al.	2407.08701	null
2024-07-11	Mitigating Catastrophic Forgetting in Language Transfer via Model Merging	Anton Alexandrov et.al.	2407.08699	null
2024-07-11	Cloud Atlas: Efficient Fault Localization for Cloud Systems using Language Models and Causal Insight	Zhiqiang Xie et.al.	2407.08694	null
2024-07-11	SEED-Story: Multimodal Long Story Generation with Large Language Model	Shuai Yang et.al.	2407.08683	link
2024-07-11	Uncertainty Estimation of Large Language Models in Medical Question Answering	Jiaxin Wu et.al.	2407.08662	null
2024-07-10	Training on the Test Task Confounds Evaluation and Emergence	Ricardo Dominguez-Olmedo et.al.	2407.07890	link
2024-07-10	Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization	Junkang Wu et.al.	2407.07880	link
2024-07-10	FACTS About Building Retrieval Augmented Generation-based Chatbots	Rama Akkiraju et.al.	2407.07858	null
2024-07-10	OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training	Sami Jaghouar et.al.	2407.07852	link
2024-07-10	Natural Language Mechanisms via Self-Resolution with Foundation Models	Nicolas Della Penna et.al.	2407.07845	null
2024-07-10	Transformer Alignment in Large Language Models	Murdock Aubry et.al.	2407.07810	null
2024-07-10	Attribute or Abstain: Large Language Models as Long Document Assistants	Jan Buchmann et.al.	2407.07799	link
2024-07-11	Evaluating Large Language Models with Grid-Based Game Competitions: An Extensible LLM Benchmark and Leaderboard	Oguzhan Topsakal et.al.	2407.07796	link
2024-07-10	Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent Communities	Tianjie Ju et.al.	2407.07791	link
2024-07-10	WorldAPIs: The World Is Worth How Many APIs? A Thought Experiment	Jiefu Ou et.al.	2407.07778	null
2024-07-09	AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning	Jiaxi Cui et.al.	2407.07094	link
2024-07-09	FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation	Liqun Ma et.al.	2407.07093	link
2024-07-09	Hypothetical Minds: Scaffolding Theory of Mind for Multi-Agent Tasks with Large Language Models	Logan Cross et.al.	2407.07086	link
2024-07-09	Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities	Shaltiel Shmidman et.al.	2407.07080	null
2024-07-09	Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps	Yung-Sung Chuang et.al.	2407.07071	link
2024-07-09	Prompting Techniques for Secure Code Generation: A Systematic Investigation	Catherine Tony et.al.	2407.07064	null
2024-07-10	Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence	Weize Chen et.al.	2407.07061	link
2024-07-10	Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model	Wenqi Zhang et.al.	2407.07053	link
2024-07-09	Using Large Language Models for Generating Smart Contracts for Health Insurance from Textual Policies	Inwon Kang et.al.	2407.07019	null
2024-07-09	End-To-End Causal Effect Estimation from Unstructured Natural Language Data	Nikita Dhawan et.al.	2407.07018	null
2024-07-08	Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision	Orr Zohar et.al.	2407.06189	link
2024-07-08	CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation	Xinying Guo et.al.	2407.06188	null
2024-07-08	On Speeding Up Language Model Evaluation	Jin Peng Zhou et.al.	2407.06172	null
2024-07-08	What's Wrong with Your Code Generated by Large Language Models? An Extensive Study	Shihan Dou et.al.	2407.06153	null
2024-07-09	Using Grammar Masking to Ensure Syntactic Validity in LLM-based Modeling Tasks	Lukas Netz et.al.	2407.06146	null
2024-07-08	ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation	Ethan Chern et.al.	2407.06135	link
2024-07-09	Evaluating the Semantic Profiling Abilities of LLMs for Natural Language Utterances in Data Visualization	Hannah K. Bako et.al.	2407.06129	link
2024-07-08	Depression Detection and Analysis using Large Language Models on Textual and Audio-Visual Modalities	Avinash Anand et.al.	2407.06125	null
2024-07-08	Artificial Intuition: Efficient Classification of Scientific Abstracts	Harsh Sakhrani et.al.	2407.06093	null
2024-07-08	Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models	Jinliang Lu et.al.	2407.06089	null
2024-07-05	Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs	Rudolf Laine et.al.	2407.04694	null
2024-07-05	ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models	Yuzhe Gu et.al.	2407.04693	null
2024-07-05	Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge	Yuanze Lin et.al.	2407.04681	null
2024-07-05	Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition	Ye Bai et.al.	2407.04675	null
2024-07-05	Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement	Yongji Wu et.al.	2407.04656	null
2024-07-05	Entity Decomposition with Filtering: A Zero-Shot Clinical Named Entity Recognition Framework	Reza Averly et.al.	2407.04629	null
2024-07-05	On scalable oversight with weak LLMs judging strong LLMs	Zachary Kenton et.al.	2407.04622	null
2024-07-05	Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions	Shumaila Javaid et.al.	2407.04581	null
2024-07-05	VRSD: Rethinking Similarity and Diversity for Retrieval in Large Language Models	Hang Gao et.al.	2407.04573	null
2024-07-05	PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts	Ana-Cristina Rogoz et.al.	2407.04541	link
2024-07-03	Universal Length Generalization with Turing Programs	Kaiying Hou et.al.	2407.03310	null
2024-07-03	Large Language Models for JSON Schema Discovery	Michael J. Mior et.al.	2407.03286	null
2024-07-03	LLM Internal States Reveal Hallucination Risk Faced With a Query	Ziwei Ji et.al.	2407.03282	null
2024-07-03	Programming universal unitary transformations on a general-purpose silicon photonics platform	Jose Roberto Rausell-Campo et.al.	2407.03235	null
2024-07-03	Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning	Zhili Shen et.al.	2407.03227	null
2024-07-03	How Does Quantization Affect Multilingual LLMs?	Kelly Marchisio et.al.	2407.03211	null
2024-07-03	TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts	Ruida Wang et.al.	2407.03203	link
2024-07-03	Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models	Haritz Puerto et.al.	2407.03181	link
2024-07-03	Investigating Decoder-only Large Language Models for Speech-to-text Translation	Chao-Wei Huang et.al.	2407.03169	null
2024-07-03	SOS! Soft Prompt Attack Against Open-Source Large Language Models	Ziqing Yang et.al.	2407.03160	null
2024-07-02	MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention	Huiqiang Jiang et.al.	2407.02490	link
2024-07-02	Neurocache: Efficient Vector Retrieval for Long-range Language Modeling	Ali Safaya et.al.	2407.02486	link
2024-07-02	RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs	Yue Yu et.al.	2407.02485	null
2024-07-02	MMedAgent: Learning to Use Medical Tools with Multi-modal Agent	Binxu Li et.al.	2407.02483	null
2024-07-02	Understanding Alignment in Multimodal LLMs: A Comprehensive Study	Elmira Amirloo et.al.	2407.02477	null
2024-07-02	Open Scene Graphs for Open World Object-Goal Navigation	Joel Loo et.al.	2407.02473	null
2024-07-02	Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I	Harrie Oosterhuis et.al.	2407.02464	null
2024-07-03	Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs	Jinmin Li et.al.	2407.02411	null
2024-07-02	CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models	Song Wang et.al.	2407.02408	null
2024-07-02	Assessing the Code Clone Detection Capability of Large Language Models	Zixian Zhang et.al.	2407.02402	null
2024-06-28	Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs	Sukmin Yun et.al.	2406.20098	link
2024-06-28	LLaRA: Supercharging Robot Learning Data for Vision-Language Policy	Xiang Li et.al.	2406.20095	link
2024-06-28	Scaling Synthetic Data Creation with 1,000,000,000 Personas	Xin Chan et.al.	2406.20094	null
2024-06-28	LLaVolta: Efficient Multi-modal Models via Stage-wise Visual Context Compression	Jieneng Chen et.al.	2406.20092	link
2024-06-28	ProgressGym: Alignment with a Millennium of Moral Progress	Tianyi Qiu et.al.	2406.20087	null
2024-06-28	Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language	Yicheng Chen et.al.	2406.20085	null
2024-06-28	Molecular Facts: Desiderata for Decontextualization in LLM Fact Verification	Anisha Gunjal et.al.	2406.20079	link
2024-07-02	BMW Agents -- A Framework For Task Automation Through Multi-Agent Collaboration	Noel Crawford et.al.	2406.20041	null
2024-06-28	LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models	Renzhi Wang et.al.	2406.20030	null
2024-06-28	ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models	Yuxiang Zhang et.al.	2406.20015	link
2024-06-27	ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos	Jr-Jen Chen et.al.	2406.19392	link
2024-06-27	The Remarkable Robustness of LLMs: Stages of Inference?	Vedang Lad et.al.	2406.19384	link
2024-06-27	Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model	Haobo Yuan et.al.	2406.19369	null
2024-06-27	The Model Arena for Cross-lingual Sentiment Analysis: A Comparative Study in the Era of Large Language Models	Xiliang Zhu et.al.	2406.19358	null
2024-06-27	DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions	Nigel Fernandez et.al.	2406.19356	null
2024-06-27	IndoToxic2024: A Demographically-Enriched Dataset of Hate Speech and Toxicity Types for Indonesian Language	Lucky Susanto et.al.	2406.19349	null
2024-06-27	Efficient World Models with Context-Aware Tokenization	Vincent Micheli et.al.	2406.19320	link
2024-06-27	Jump Starting Bandits with LLM-Generated Prior Knowledge	Parand A. Alamdari et.al.	2406.19317	null
2024-06-27	From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data	Zheyang Xiong et.al.	2406.19292	null
2024-06-27	PhysioLLM: Supporting Personalized Health Insights with Wearables and Large Language Models	Cathy Mengying Fang et.al.	2406.19283	null
2024-06-26	Symbolic Learning Enables Self-Evolving Agents	Wangchunshu Zhou et.al.	2406.18532	link
2024-06-26	PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation	Christoph Leiter et.al.	2406.18528	null
2024-06-26	CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs	Zirui Wang et.al.	2406.18521	null
2024-06-26	"Is ChatGPT a Better Explainer than My Professor?": Evaluating the Explanation Capabilities of LLMs in Conversation Compared to a Human Baseline	Grace Li et.al.	2406.18512	null
2024-06-26	Mental Modeling of Reinforcement Learning Agents by Language Models	Wenhao Lu et.al.	2406.18505	null
2024-06-26	Is In-Context Learning a Type of Gradient-Based Learning? Evidence from the Inverse Frequency Effect in Structural Priming	Zhenghao Zhou et.al.	2406.18501	null
2024-06-26	LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism	Diandian Gu et.al.	2406.18485	null
2024-06-26	Role-Play Zero-Shot Prompting with Large Language Models for Open-Domain Human-Machine Conversation	Ahmed Njifenjou et.al.	2406.18460	null
2024-06-26	Cascading Large Language Models for Salient Event Graph Generation	Xingwei Tan et.al.	2406.18449	null
2024-06-26	New intelligent empowerment for digital transformation	Peng Yifeng et.al.	2406.18440	null
2024-06-25	MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning	Xiangyu Zhao et.al.	2406.17770	link
2024-06-25	BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning	Ercong Nie et.al.	2406.17764	null
2024-06-25	CaLMQA: Exploring culturally specific long-form question answering across 23 languages	Shane Arora et.al.	2406.17761	link
2024-06-25	Accelerating Clinical Evidence Synthesis with Large Language Models	Zifeng Wang et.al.	2406.17755	null
2024-06-25	Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language	Amalie Brogaard Pauli et.al.	2406.17753	null
2024-06-25	LLM Targeted Underperformance Disproportionately Impacts Vulnerable Users	Elinor Poole-Dayan et.al.	2406.17737	null
2024-06-25	FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model	Feijie Wu et.al.	2406.17706	null
2024-06-25	From Distributional to Overton Pluralism: Investigating Large Language Model Alignment	Thom Lake et.al.	2406.17692	link
2024-06-26	VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation	Kun Qian et.al.	2406.17681	link
2024-06-25	Quantifying AI Psychology: A Psychometrics Benchmark for Large Language Models	Yuan Li et.al.	2406.17675	null
2024-06-24	EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees	Yuhui Li et.al.	2406.16858	null
2024-06-24	From Decoding to Meta-Generation: Inference-time Algorithms for Large Language Models	Sean Welleck et.al.	2406.16838	null
2024-06-24	USDC: A Dataset of $\underline{U}$ser $\underline{S}$tance and $\underline{D}$ogmatism in Long $\underline{C}$ onversations	Mounika Marreddy et.al.	2406.16833	null
2024-06-24	Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track	Ronak Pradeep et.al.	2406.16828	null
2024-06-24	RES-Q: Evaluating Code-Editing Large Language Model Systems at the Repository Scale	Beck LaBash et.al.	2406.16801	link
2024-06-25	Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs	Ashwinee Panda et.al.	2406.16797	link
2024-06-24	M2Lingual: Enhancing Multilingual, Multi-Turn Instruction Alignment in Large Language Models	Rishabh Maheshwary et.al.	2406.16783	null
2024-06-24	It Is Not About What You Say, It Is About How You Say It: A Surprisingly Simple Approach for Improving Reading Comprehension	Sagi Shaier et.al.	2406.16779	null
2024-06-24	Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024	Sai Koneru et.al.	2406.16777	null
2024-06-24	WARP: On the Benefits of Weight Averaged Rewarded Policies	Alexandre Ramé et.al.	2406.16768	null
2024-06-21	GenoTEX: A Benchmark for Evaluating LLM-Based Exploration of Gene Expression Data in Alignment with Bioinformaticians	Haoyang Liu et.al.	2406.15341	link
2024-06-21	Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance	Haoling Li et.al.	2406.15330	null
2024-06-21	Bug In the Code Stack: Can LLMs Find Bugs in Large Python Code Stacks	Hokyung Lee et.al.	2406.15325	null
2024-06-21	Towards Fine-Grained Citation Evaluation in Generated Text: A Comparative Analysis of Faithfulness Metrics	Weijia Zhang et.al.	2406.15264	null
2024-06-21	Detecting Synthetic Lyrics with Few-Shot Inference	Yanis Labrak et.al.	2406.15231	null
2024-06-21	A LLM-Based Ranking Method for the Evaluation of Automatic Counter-Narrative Generation	Irune Zubiaga et.al.	2406.15227	null
2024-06-21	Unsupervised Extraction of Dialogue Policies from Conversations	Makesh Narsimhan Sreedhar et.al.	2406.15214	null
2024-06-21	Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding	Mohan Li et.al.	2406.15209	null
2024-06-21	Exploring the Efficacy of Robotic Assistants with ChatGPT and Claude in Enhancing ADHD Therapy: Innovating Treatment Paradigms	Santiago Berrezueta-Guzman et.al.	2406.15198	null
2024-06-21	UDA: A Benchmark Suite for Retrieval Augmented Generation in Real-world Document Analysis	Yulong Hui et.al.	2406.15187	link
2024-06-20	Model Merging and Safety Alignment: One Bad Model Spoils the Bunch	Hasan Abed Al Kader Hammoud et.al.	2406.14563	null
2024-06-20	Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities	Sachit Menon et.al.	2406.14562	null
2024-06-21	Asynchronous Large Language Model Enhanced Planner for Autonomous Driving	Yuan Chen et.al.	2406.14556	null
2024-06-20	GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models	Shilong Li et.al.	2406.14550	null
2024-06-20	Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models	Sunny Duan et.al.	2406.14549	null
2024-06-20	Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data	Johannes Treutlein et.al.	2406.14546	link
2024-06-20	Unmasking Database Vulnerabilities: Zero-Knowledge Schema Inference Attacks in Text-to-SQL Systems	Đorđe Klisura et.al.	2406.14545	null
2024-06-20	Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs	Yuxuan Qiao et.al.	2406.14544	link
2024-06-21	Are LLMs Naturally Good at Synthetic Tabular Data Generation?	Shengzhe Xu et.al.	2406.14541	link
2024-06-20	PostMark: A Robust Blackbox Watermark for Large Language Models	Yapei Chang et.al.	2406.14517	link
2024-06-18	DrVideo: Document Retrieval Based Long Video Understanding	Ziyu Ma et.al.	2406.12846	null
2024-06-18	Interpretable Preferences via Multi-Objective Reward Modeling and Mixture-of-Experts	Haoxiang Wang et.al.	2406.12845	link
2024-06-18	Synergizing Foundation Models and Federated Learning: A Survey	Shenghui Li et.al.	2406.12844	null
2024-06-18	LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation	Seyedarmin Azizi et.al.	2406.12832	link
2024-06-18	Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models?	Pinzhen Chen et.al.	2406.12822	null
2024-06-18	Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones?	Zhe Yang et.al.	2406.12809	null
2024-06-18	Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents	Zehao Wang et.al.	2406.12806	null
2024-06-18	Supporting Human Raters with the Detection of Harmful Content using Large Language Models	Kurt Thomas et.al.	2406.12800	null
2024-06-18	ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools	Team GLM et.al.	2406.12793	null
2024-06-18	UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions	Xunzhi Wang et.al.	2406.12784	null
2024-06-17	LLaNA: Large Language and NeRF Assistant	Andrea Amaduzzi et.al.	2406.11840	null
2024-06-17	mDPO: Conditional Preference Optimization for Multimodal Large Language Models	Fei Wang et.al.	2406.11839	null
2024-06-17	Unveiling Encoder-Free Vision-Language Models	Haiwen Diao et.al.	2406.11832	link
2024-06-17	Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models	Bingqi Ma et.al.	2406.11831	null
2024-06-17	WPO: Enhancing RLHF with Weighted Preference Optimization	Wenxuan Zhou et.al.	2406.11827	link
2024-06-17	Embodied Instruction Following in Unknown Environments	Zhenyu Wu et.al.	2406.11818	null
2024-06-17	VideoLLM-online: Online Video Large Language Model for Streaming Video	Joya Chen et.al.	2406.11816	null
2024-06-17	How Do Large Language Models Acquire Factual Knowledge During Pretraining?	Hoyeon Chang et.al.	2406.11813	null
2024-06-17	RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content	Joao Monteiro et.al.	2406.11811	null
2024-06-17	Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations	Rima Hazra et.al.	2406.11801	link
2024-06-14	Quantifying Variance in Evaluation Benchmarks	Lovish Madaan et.al.	2406.10229	null
2024-06-14	Semantic Membership Inference Attack against Large Language Models	Hamid Mozaffari et.al.	2406.10218	null
2024-06-14	Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs	Rui Yang et.al.	2406.10216	null
2024-06-14	Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs	Abhimanyu Hans et.al.	2406.10209	link
2024-06-14	TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners	Tomas de la Rosa et.al.	2406.10196	null
2024-06-14	Detecting and Evaluating Medical Hallucinations in Large Vision Language Models	Jiawei Chen et.al.	2406.10185	null
2024-06-14	Practical offloading for fine-tuning LLM on commodity GPU via learned subspace projectors	Siyuan Chen et.al.	2406.10181	null
2024-06-14	Datasets for Multilingual Answer Sentence Selection	Matteo Gabburo et.al.	2406.10172	null
2024-06-14	Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models	Carson Denison et.al.	2406.10162	link
2024-06-14	BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack	Yuri Kuratov et.al.	2406.10149	null
2024-06-13	VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding	Muhammad Maaz et.al.	2406.09418	link
2024-06-13	Explore the Limits of Omni-modal Pretraining at Scale	Yiyuan Zhang et.al.	2406.09412	link
2024-06-13	Aligning Vision Models with Human Aesthetics in Retrieval: Benchmarks and Algorithms	Miaosen Zhang et.al.	2406.09397	null
2024-06-13	Too Many Frames, not all Useful:Efficient Strategies for Long-Form Video QA	Jongwoo Park et.al.	2406.09396	null
2024-06-13	Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs	Zijia Zhao et.al.	2406.09367	link
2024-06-13	ElicitationGPT: Text Elicitation Mechanisms via Language Models	Yifan Wu et.al.	2406.09363	null
2024-06-13	DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding	Suwon Shon et.al.	2406.09345	null
2024-06-13	REVS: Unlearning Sensitive Information in Language Models via Rank Editing in the Vocabulary Space	Tomer Ashuach et.al.	2406.09325	null
2024-06-13	Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs	Zhao Xu et.al.	2406.09324	link
2024-06-13	JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models	Delong Ran et.al.	2406.09321	link
2024-06-12	Improving LLMs for Recommendation with Out-Of-Vocabulary Tokens	Ting-Ji Huang et.al.	2406.08477	null
2024-06-13	Real2Code: Reconstruct Articulated Objects via Code Generation	Zhao Mandi et.al.	2406.08474	null
2024-06-12	Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing	Zhangchen Xu et.al.	2406.08464	null
2024-06-12	TasTe: Teaching Large Language Models to Translate through Self-Reflection	Yutong Wang et.al.	2406.08434	link
2024-06-12	Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL	Zijin Hong et.al.	2406.08426	null
2024-06-12	State Soup: In-Context Skill Learning, Retrieval and Mixing	Maciej Pióro et.al.	2406.08423	null
2024-06-12	OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text	Qingyun Li et.al.	2406.08418	link
2024-06-12	Discovering Preference Optimization Algorithms with and for Large Language Models	Chris Lu et.al.	2406.08414	link
2024-06-12	Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference	Christopher Wolters et.al.	2406.08413	null
2024-06-12	Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models	Chun-Yi Kuan et.al.	2406.08402	link
2024-06-11	Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena	Aidar Myrzakhan et.al.	2406.07545	link
2024-06-11	QuickLLaMA: Query-aware Inference Acceleration for Large Language Models	Jingyao Li et.al.	2406.07528	link
2024-06-11	Beyond Model Collapse: Scaling Up with Synthesized Data Requires Reinforcement	Yunzhen Feng et.al.	2406.07515	null
2024-06-11	THaLLE: Text Hyperlocally Augmented Large Language Extension -- Technical Report	KBTG Labs et.al.	2406.07505	null
2024-06-11	Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions	Renjie Pi et.al.	2406.07502	link
2024-06-11	TextGrad: Automatic "Differentiation" via Text	Mert Yuksekgonul et.al.	2406.07496	link
2024-06-12	CADS: A Systematic Literature Review on the Challenges of Abstractive Dialogue Summarization	Frederic Kirstein et.al.	2406.07494	null
2024-06-11	PITCH: Productivity and Mental Well-being Coaching through Daily Conversational Interaction	Adnan Abbas et.al.	2406.07485	null
2024-06-11	Advancing Annotation of Stance in Social Media Posts: A Comparative Analysis of Large Language Models and Crowd Sourcing	Mao Li et.al.	2406.07483	null
2024-06-11	VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs	Zesen Cheng et.al.	2406.07476	link
2024-06-10	Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation	Peize Sun et.al.	2406.06525	link
2024-06-10	UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor	Shivani Upadhyay et.al.	2406.06519	link
2024-06-10	NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative	Asmar Nadeem et.al.	2406.06499	null
2024-06-10	Parallelizing Linear Transformers with the Delta Rule over Sequence Length	Songlin Yang et.al.	2406.06484	null
2024-06-10	Towards a Personal Health Large Language Model	Justin Cosentino et.al.	2406.06474	null
2024-06-10	AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction	Zhen Xing et.al.	2406.06465	null
2024-06-11	Transforming Wearable Data into Health Insights using Large Language Model Agents	Mike A. Merrill et.al.	2406.06464	null
2024-06-11	Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies	Junlin Wang et.al.	2406.06461	null
2024-06-10	Evaluating the Retrieval Component in LLM-Based Question Answering Systems	Ashkan Alinejad et.al.	2406.06458	null
2024-06-10	A Large Language Model Pipeline for Breast Cancer Oncology	Tristen Pool et.al.	2406.06455	null
2024-06-07	3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs	Jianing Yang et.al.	2406.05132	null
2024-06-07	An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models	Xiongtao Zhou et.al.	2406.05130	null
2024-06-07	Towards Semantic Equivalence of Tokenization in Multimodal LLM	Shengqiong Wu et.al.	2406.05127	null
2024-06-07	LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration	Tavor Lipman et.al.	2406.05107	null
2024-06-07	Multi-Head RAG: Solving Multi-Aspect Problems with LLMs	Maciej Besta et.al.	2406.05085	link
2024-06-07	Are Large Language Models More Empathetic than Humans?	Anuradha Welivita et.al.	2406.05063	null
2024-06-07	Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions	Shi-Yu Tian et.al.	2406.05055	null
2024-06-07	Hints-In-Browser: Benchmarking Language Models for Programming Feedback Generation	Nachiket Kotalwar et.al.	2406.05053	null
2024-06-07	Bootstrapping Referring Multi-Object Tracking	Yani Zhang et.al.	2406.05039	null
2024-06-07	Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs	Shentong Mo et.al.	2406.05038	null
2024-06-06	Verbalized Machine Learning: Revisiting Machine Learning with Language Models	Tim Z. Xiao et.al.	2406.04344	null
2024-06-06	RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation	Jiaming Liu et.al.	2406.04339	null
2024-06-06	Coherent Zero-Shot Visual Instruction Generation	Quynh Phung et.al.	2406.04337	null
2024-06-06	DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs	Lingchen Meng et.al.	2406.04334	null
2024-06-06	PaCE: Parsimonious Concept Engineering for Large Language Models	Jinqi Luo et.al.	2406.04331	link
2024-06-06	Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step	Zhanhao Liang et.al.	2406.04314	null
2024-06-06	Semantically Diverse Language Generation for Uncertainty Estimation in Language Models	Lukas Aichberger et.al.	2406.04306	link
2024-06-06	Quixer: A Quantum Transformer Model	Nikhil Khatri et.al.	2406.04305	null
2024-06-06	Text-to-Drive: Diverse Driving Behavior Synthesis via Large Language Models	Phat Nguyen et.al.	2406.04300	null
2024-06-07	What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages	Nadav Borenstein et.al.	2406.04289	null
2024-06-05	Wings: Learning Multimodal LLMs without Text-only Forgetting	Yi-Kai Zhang et.al.	2406.03496	null
2024-06-06	Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training	Ao Sun et.al.	2406.03488	null
2024-06-05	Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends	Sanjana Ramprasad et.al.	2406.03487	null
2024-06-05	BIPED: Pedagogically Informed Tutoring System for ESL Education	Soonwoo Kwon et.al.	2406.03486	null
2024-06-05	Does your data spark joy? Performance gains from domain upsampling at the end of training	Cody Blakeney et.al.	2406.03476	null
2024-06-05	AD-H: Autonomous Driving with Hierarchical Agents	Zaibin Zhang et.al.	2406.03474	null
2024-06-05	What is the Best Way for ChatGPT to Translate Poetry?	Shanshan Wang et.al.	2406.03450	null
2024-06-05	Pre-trained Large Language Models Use Fourier Features to Compute Addition	Tianyi Zhou et.al.	2406.03445	null
2024-06-05	Cycles of Thought: Measuring LLM Confidence through Stable Explanations	Evan Becker et.al.	2406.03441	null
2024-06-05	Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach	Saehyung Lee et.al.	2406.03411	link
2024-06-04	Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks	Tianyu He et.al.	2406.02550	link
2024-06-04	Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning	Alex Jinpeng Wang et.al.	2406.02547	link
2024-06-04	To Believe or Not to Believe Your LLM	Yasin Abbasi Yadkori et.al.	2406.02543	null
2024-06-04	Loki: Low-Rank Keys for Efficient Sparse Attention	Prajwal Singhania et.al.	2406.02542	null
2024-06-04	Parrot: Multilingual Visual Instruction Tuning	Hai-Long Sun et.al.	2406.02539	null
2024-06-04	Mitigate Position Bias in Large Language Models via Scaling a Single Dimension	Yijiong Yu et.al.	2406.02536	null
2024-06-04	SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices	Ruslan Svirschevski et.al.	2406.02532	null
2024-06-04	Scalable MatMul-free Language Modeling	Rui-Jie Zhu et.al.	2406.02528	link
2024-06-04	CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks	Maciej Besta et.al.	2406.02524	null
2024-06-04	RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots	Soroush Nasiriany et.al.	2406.02523	null
2024-05-31	Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis	Chaoyou Fu et.al.	2405.21075	null
2024-05-31	Grammar-Aligned Decoding	Kanghee Park et.al.	2405.21047	null
2024-05-31	Direct Alignment of Language Models via Quality-Aware Self-Refinement	Runsheng Yu et.al.	2405.21040	null
2024-05-31	Standards for Belief Representations in LLMs	Daniel A. Herrmann et.al.	2405.21030	null
2024-05-31	LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models	Elias Stengel-Eskin et.al.	2405.21028	link
2024-05-31	You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet	Zhen Qin et.al.	2405.21022	null
2024-05-31	Improved Techniques for Optimization-Based Jailbreaking on Large Language Models	Xiaojun Jia et.al.	2405.21018	link
2024-05-31	DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models	Linli Yao et.al.	2405.20985	null
2024-05-31	Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training	Feiteng Fang et.al.	2405.20978	null
2024-05-31	SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales	Tianyang Xu et.al.	2405.20974	link
2024-05-30	MotionLLM: Understanding Human Behaviors from Human Motions and Videos	Ling-Hao Chen et.al.	2405.20340	null
2024-05-30	Visual Perception by Large Language Model's Weights	Feipeng Ma et.al.	2405.20339	null
2024-05-30	OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving	Lening Wang et.al.	2405.20337	link
2024-05-30	Xwin-LM: Strong and Scalable Alignment Practice for LLMs	Bolin Ni et.al.	2405.20335	link
2024-05-31	ParSEL: Parameterized Shape Editing with Language	Aditya Ganeshan et.al.	2405.20319	null
2024-05-30	CausalQuest: Collecting Natural Causal Questions for AI Agents	Roberto Ceraolo et.al.	2405.20318	link
2024-05-30	ANAH: Analytical Annotation of Hallucinations in Large Language Models	Ziwei Ji et.al.	2405.20315	link
2024-05-30	Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone Generation	Guillaume Huguet et.al.	2405.20313	null
2024-05-30	Large Language Models Can Self-Improve At Web Agent Tasks	Ajay Patel et.al.	2405.20309	null
2024-05-30	Group Robust Preference Optimization in Reward-free RLHF	Shyam Sundhar Ramesh et.al.	2405.20304	link
2024-05-29	X-VILA: Cross-Modality Alignment for Large Language Model	Hanrong Ye et.al.	2405.19335	null
2024-05-29	LLMs Meet Multimodal Generation and Editing: A Survey	Yingqing He et.al.	2405.19334	link
2024-05-29	Multi-Modal Generative Embedding Model	Feipeng Ma et.al.	2405.19333	null
2024-05-29	Self-Exploring Language Models: Active Preference Elicitation for Online Alignment	Shenao Zhang et.al.	2405.19332	link
2024-05-29	Normative Modules: A Generative Agent Architecture for Learning Norms that Supports Multi-Agent Cooperation	Atrisha Sarkar et.al.	2405.19328	null
2024-05-30	MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series	Ge Zhang et.al.	2405.19327	null
2024-05-29	Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models	Tianrun Chen et.al.	2405.19326	null
2024-05-29	Nearest Neighbor Speculative Decoding for LLM Generation and Attribution	Minghan Li et.al.	2405.19325	null
2024-05-29	Are Large Language Models Chameleons?	Mingmeng Geng et.al.	2405.19323	null
2024-05-29	Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF	Shicong Cen et.al.	2405.19320	null
2024-05-28	DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention	Lianghui Zhu et.al.	2405.18428	link
2024-05-29	ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention	Bencheng Liao et.al.	2405.18425	link
2024-05-28	Don't Forget to Connect! Improving RAG with Graph-based Reranking	Jialin Dong et.al.	2405.18414	null
2024-05-29	Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning	Yixiao Zhang et.al.	2405.18386	link
2024-05-28	OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning	Pengxiang Li et.al.	2405.18380	link
2024-05-28	LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models	Anthony Sarah et.al.	2405.18377	null
2024-05-28	Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning	Dongjie Chen et.al.	2405.18376	link
2024-05-28	Thai Winograd Schemas: A Benchmark for Thai Commonsense Reasoning	Phakphum Artkaew et.al.	2405.18375	null
2024-05-28	PromptWizard: Task-Aware Agent-driven Prompt Optimization Framework	Eshaan Agarwal et.al.	2405.18369	null
2024-05-28	Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?	Yifan Bai et.al.	2405.18361	null
2024-05-27	Matryoshka Multimodal Models	Mu Cai et.al.	2405.17430	null
2024-05-27	NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models	Chankyu Lee et.al.	2405.17428	null
2024-05-27	Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model	Kuan-Chih Huang et.al.	2405.17427	link
2024-05-27	LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence	Zhuoling Li et.al.	2405.17424	null
2024-05-27	Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation	Jiaming Liu et.al.	2405.17418	null
2024-05-27	THREAD: Thinking Deeper with Recursive Spawning	Philip Schroeder et.al.	2405.17402	null
2024-05-27	MindMerger: Efficient Boosting LLM Reasoning in non-English Languages	Zixian Huang et.al.	2405.17386	null
2024-05-27	Unlocking the Secrets of Linear Complexity Sequence Model from A Unified Perspective	Zhen Qin et.al.	2405.17383	null
2024-05-27	ReMoDetect: Reward Models Recognize Aligned LLM's Generations	Hyunseok Lee et.al.	2405.17382	null
2024-05-27	Various Lengths, Constant Speed: Efficient Language Modeling with Lightning Attention	Zhen Qin et.al.	2405.17381	link
2024-05-24	Scaling Laws for Discriminative Classification in Large Language Models	Dean Wyatte et.al.	2405.15765	null
2024-05-24	Large Language Models Reflect Human Citation Patterns with a Heightened Citation Bias	Andres Algaba et.al.	2405.15739	null
2024-05-24	LM4LV: A Frozen Large Language Model for Low-level Vision Tasks	Boyang Zheng et.al.	2405.15734	null
2024-05-24	Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks	Jerome Sieber et.al.	2405.15731	link
2024-05-24	Optimizing Large Language Models for OpenAPI Code Completion	Bohdan Petryshyn et.al.	2405.15729	null
2024-05-24	Prompt-Aware Adapter: Towards Learning Adaptive Visual Tokens for Multimodal Large Language Models	Yue Zhang et.al.	2405.15684	null
2024-05-24	What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models	Abdelrahman Abdelhamed et.al.	2405.15668	null
2024-05-24	Class Machine Unlearning for Complex Data via Concepts Inference and Data Poisoning	Wenhan Chang et.al.	2405.15662	null
2024-05-24	$$\mathbf{L^2\cdot M = C^2}$$ Large Language Models as Covert Channels... a Systematic Analysis	Simen Gaure et.al.	2405.15652	null
2024-05-24	LLM-based Robot Task Planning with Exceptional Handling for General Purpose Service Robots	Ruoyu Wang et.al.	2405.15646	null
2024-05-23	A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns	Asaf Yehudai et.al.	2405.14863	null
2024-05-23	Bitune: Bidirectional Instruction-Tuning	Dawid J. Kopiczko et.al.	2405.14862	null
2024-05-23	PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression	Vladimir Malinovskii et.al.	2405.14852	null
2024-05-23	HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models	Bernal Jiménez Gutiérrez et.al.	2405.14831	null
2024-05-23	Can LLMs Solve longer Math Word Problems Better?	Xin Xu et.al.	2405.14804	null
2024-05-23	Lessons from the Trenches on Reproducible Evaluation of Language Models	Stella Biderman et.al.	2405.14782	null
2024-05-23	WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models	Peng Wang et.al.	2405.14768	link
2024-05-23	FinRobot: An Open-Source AI Agent Platform for Financial Applications using Large Language Models	Hongyang Yang et.al.	2405.14767	link
2024-05-23	Evaluating Large Language Models for Public Health Classification and Extraction Tasks	Joshua Harris et.al.	2405.14766	null
2024-05-23	Large language models can be zero-shot anomaly detectors for time series?	Sarah Alnegheimish et.al.	2405.14755	null
2024-05-21	Reducing Transformer Key-Value Cache Size with Cross-Layer Attention	William Brandon et.al.	2405.12981	null
2024-05-21	Energy Rank Alignment: Using Preference Optimization to Search Chemical Space at Scale	Shriram Chennakesavalu et.al.	2405.12961	null
2024-05-21	Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models	Zhangyue Yin et.al.	2405.12939	null
2024-05-21	Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs	Bilgehan Sel et.al.	2405.12933	null
2024-05-21	Code-mixed Sentiment and Hate-speech Prediction	Anjali Yadav et.al.	2405.12929	null
2024-05-21	Streamlining Software Reviews: Efficient Predictive Modeling with Minimal Examples	Tim Menzies et.al.	2405.12920	null
2024-05-21	G-DIG: Towards Gradient-based DIverse and hiGh-quality Instruction Data Selection for Machine Translation	Xingyuan Pan et.al.	2405.12915	null
2024-05-21	An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation	Zhiyu Tan et.al.	2405.12914	null
2024-05-21	Topic Modelling Case Law Using a Large Language Model and a New Taxonomy for UK Law: AI Insights into Summary Judgment	Holli Sargeant et.al.	2405.12910	link
2024-05-21	Adversarial DPO: Harnessing Harmful Data for Reducing Toxicity with Minimal Impact on Coherence and Evasiveness in Dialogue Agents	San Kim et.al.	2405.12900	null
2024-05-20	Adapting Large Multimodal Models to Distribution Shifts: The Role of In-Context Learning	Guanglin Zhou et.al.	2405.12217	link
2024-05-20	MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark	Hongwei Liu et.al.	2405.12209	link
2024-05-20	Developers' Perceptions on the Impact of ChatGPT in Software Development: A Survey	Thiago S. Vaillant et.al.	2405.12195	null
2024-05-20	CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models	Haoxiang Shi et.al.	2405.12174	null
2024-05-20	Fennec: Fine-grained Language Model Evaluation and Correction Extended through Branching and Bridging	Xiaobo Liang et.al.	2405.12163	link
2024-05-20	Eliciting Problem Specifications via Large Language Models	Robert E. Wray et.al.	2405.12147	null
2024-05-20	MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning	Ting Jiang et.al.	2405.12130	link
2024-05-20	Reindex-Then-Adapt: Improving Large Language Models for Conversational Recommendation	Zhankui He et.al.	2405.12119	null
2024-05-20	Imp: Highly Capable Large Multimodal Models for Mobile Devices	Zhenwei Shao et.al.	2405.12107	link
2024-05-20	DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction	Hao Chen et.al.	2405.12100	null
2024-05-17	A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers	Kaiyu Huang et.al.	2405.10936	link
2024-05-17	The Local Interaction Basis: Identifying Computationally-Relevant and Sparsely Interacting Features in Neural Networks	Lucius Bushnaq et.al.	2405.10928	null
2024-05-17	COGNET-MD, an evaluation framework and dataset for Large Language Model benchmarks in the medical domain	Dimitrios P. Panagoulias et.al.	2405.10893	null
2024-05-17	Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: Systematic Literature Review	Hongyi Yang et.al.	2405.10883	null
2024-05-17	The Future of Large Language Model Pre-training is Federated	Lorenzo Sani et.al.	2405.10853	null
2024-05-17	Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities	Hao Zhou et.al.	2405.10825	null
2024-05-17	ActiveLLM: Large Language Model-based Active Learning for Textual Few-Shot Scenarios	Markus Bayer et.al.	2405.10808	null
2024-05-17	Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix	Jixun Yao et.al.	2405.10786	null
2024-05-17	Empowering Small-Scale Knowledge Graphs: A Strategy of Leveraging General-Purpose Knowledge Graphs for Enriched Embeddings	Albert Sawczyn et.al.	2405.10745	null
2024-05-17	Efficient Multimodal Large Language Models: A Survey	Yizhang Jin et.al.	2405.10739	link
2024-05-16	UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models	Sahel Sharifymoghaddam et.al.	2405.10311	null
2024-05-16	4D Panoptic Scene Graph Generation	Jingkang Yang et.al.	2405.10305	link
2024-05-16	HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models	Rhea Sanjay Sukthanker et.al.	2405.10299	link
2024-05-16	Timeline-based Sentence Decomposition with In-Context Learning for Temporal Fact Extraction	Jianhao Chen et.al.	2405.10288	null
2024-05-16	Revisiting OPRO: The Limitations of Small-Scale LLMs as Optimizers	Tuo Zhang et.al.	2405.10276	null
2024-05-16	Keep It Private: Unsupervised Privatization of Online Text	Calvin Bao et.al.	2405.10260	link
2024-05-16	When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models	Xianzheng Ma et.al.	2405.10255	null
2024-05-16	A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks	Xuanfan Ni et.al.	2405.10251	null
2024-05-16	IntelliExplain: Enhancing Interactive Code Generation through Natural Language Explanations for Non-Professional Programmers	Hao Yan et.al.	2405.10250	null
2024-05-16	CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations	Jiahao Zhao et.al.	2405.10212	null
2024-05-15	Modeling Bilingual Sentence Processing: Evaluating RNN and Transformer Architectures for Cross-Language Structural Priming	Bushi Xiao et.al.	2405.09508	null
2024-05-15	Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts	Donya Rooein et.al.	2405.09482	null
2024-05-15	Tell Me Why: Explainable Public Health Fact-Checking with Large Language Models	Majid Zarharan et.al.	2405.09454	link
2024-05-15	Facilitating Opinion Diversity through Hybrid NLP Approaches	Michiel van der Meer et.al.	2405.09439	null
2024-05-15	MicroPython Testbed for Federated Learning Algorithms	Miroslav Popovic et.al.	2405.09423	null
2024-05-15	Matching domain experts by training from scratch on domain knowledge	Xiaoliang Luo et.al.	2405.09395	null
2024-05-15	PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language Models	Devansh Jain et.al.	2405.09373	null
2024-05-15	Analysis of the Geometric Structure of Neural Networks and Neural ODEs via Morse Functions	Christian Kuehn et.al.	2405.09351	null
2024-05-15	Large Language Model Bias Mitigation from the Perspective of Knowledge Editing	Ruizhe Chen et.al.	2405.09341	null
2024-05-15	Prompting-based Synthetic Data Generation for Few-Shot Question Answering	Maximilian Schmidt et.al.	2405.09335	null
2024-05-14	Towards Enhanced RAC Accessibility: Leveraging Datasets and LLMs	Edison Jair Bejarano Sepulveda et.al.	2405.08792	null
2024-05-14	Incorporating Clinical Guidelines through Adapting Multi-modal Large Language Model for Prostate Cancer PI-RADS Scoring	Tiantian Zhang et.al.	2405.08786	null
2024-05-14	Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Intent Resolution in LLMs	Akhila Yerukola et.al.	2405.08760	link
2024-05-14	Distributed Threat Intelligence at the Edge Devices: A Large Language Model-Driven Approach	Syed Mhamudul Hasan et.al.	2405.08755	null
2024-05-14	Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding	Zhimin Li et.al.	2405.08748	link
2024-05-15	ALMol: Aligned Language-Molecule Translation LLMs through Offline Preference Contrastive Optimisation	Dimitris Gkoumas et.al.	2405.08619	null
2024-05-14	A Comprehensive Survey of Large Language Models and Multimodal Large Language Models in Medicine	Hanguang Xiao et.al.	2405.08603	null
2024-05-15	EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark	Xiaohui Zhang et.al.	2405.08596	null
2024-05-14	Falcon 7b for Software Mention Detection in Scholarly Documents	AmeerAli Khan et.al.	2405.08514	null
2024-05-14	Archimedes-AUEB at SemEval-2024 Task 5: LLM explains Civil Procedure	Odysseas S. Chlapanis et.al.	2405.08502	null
2024-05-14	MambaOut: Do We Really Need Mamba for Vision?	Weihao Yu et.al.	2405.07992	link
2024-05-13	Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots	Chengyue Wu et.al.	2405.07990	null
2024-05-13	A Generalist Learner for Multifaceted Medical Image Interpretation	Hong-Yu Zhou et.al.	2405.07988	null
2024-05-13	OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition	Qiuchi Xiang et.al.	2405.07966	link
2024-05-13	PyZoBot: A Platform for Conversational Information Extraction and Synthesis from Curated Zotero Reference Libraries through Advanced Retrieval-Augmented Generation	Suad Alshammari et.al.	2405.07963	null
2024-05-13	AgentClinic: a multimodal agent benchmark to evaluate AI in simulated clinical environments	Samuel Schmidgall et.al.	2405.07960	null
2024-05-13	EconLogicQA: A Question-Answering Benchmark for Evaluating Large Language Models in Economic Sequential Reasoning	Yinzhu Quan et.al.	2405.07938	null
2024-05-14	PARDEN, Can You Repeat That? Defending against Jailbreaks via Repetition	Ziyang Zhang et.al.	2405.07932	link
2024-05-13	Can Better Text Semantics in Prompt Tuning Improve VLM Generalization?	Hari Chandana Kuchibhotla et.al.	2405.07921	null
2024-05-13	A Systematic Investigation of Distilling Large Language Models into Cross-Encoders for Passage Re-ranking	Ferdinand Schlatt et.al.	2405.07920	null
2024-05-10	Linearizing Large Language Models	Jean Mercat et.al.	2405.06640	link
2024-05-10	Value Augmented Sampling for Language Model Alignment and Personalization	Seungwook Han et.al.	2405.06639	link
2024-05-10	Characterizing the Accuracy - Efficiency Trade-off of Low-rank Decomposition in Language Models	Chakshu Moar et.al.	2405.06626	null
2024-05-10	Non-Uniform Spatial Alignment Errors in sUAS Imagery From Wide-Area Disasters	Thomas Manzini et.al.	2405.06593	null
2024-05-10	What Can Natural Language Processing Do for Peer Review?	Ilia Kuznetsov et.al.	2405.06563	null
2024-05-10	Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval	Mengjia Niu et.al.	2405.06545	null
2024-05-10	Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts	Wenyu Huang et.al.	2405.06524	null
2024-05-10	UniDM: A Unified Framework for Data Manipulation with Large Language Models	Yichen Qian et.al.	2405.06510	null
2024-05-10	Storypark: Leveraging Large Language Models to Enhance Children Story Learning Through Child-AI collaboration Storytelling	Lyumanshan Ye et.al.	2405.06495	null
2024-05-10	Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?	Hunter McNichols et.al.	2405.06414	null
2024-05-09	Natural Language Processing RELIES on Linguistics	Juri Opitz et.al.	2405.05966	null
2024-05-09	OpenBA-V2: Reaching 77.3% High Compression Ratio with Fast Multi-Stage Pruning	Dan Qiao et.al.	2405.05957	link
2024-05-09	Probing Multimodal LLMs as World Models for Driving	Shiva Sreeram et.al.	2405.05956	link
2024-05-09	Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning	Junzhi Chen et.al.	2405.05955	null
2024-05-09	CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts	Jiachen Li et.al.	2405.05949	link
2024-05-09	Trustworthy AI-Generative Content in Intelligent 6G Network: Adversarial, Privacy, and Fairness	Siyuan Li et.al.	2405.05930	null
2024-05-09	Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?	Zorik Gekhman et.al.	2405.05904	null
2024-05-09	Co-driver: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes	Ziang Guo et.al.	2405.05885	null
2024-05-09	FlockGPT: Guiding UAV Flocking with Linguistic Orchestration	Artem Lykov et.al.	2405.05872	null
2024-05-09	Robots Can Feel: LLM-based Framework for Robot Ethical Reasoning	Artem Lykov et.al.	2405.05824	link
2024-05-09	You Only Cache Once: Decoder-Decoder Architectures for Language Models	Yutao Sun et.al.	2405.05254	null
2024-05-08	Open Source Language Models Can Provide Feedback: Evaluating LLMs' Ability to Help Students Using GPT-4-As-A-Judge	Charles Koutcheme et.al.	2405.05253	link
2024-05-09	LLMs with Personalities in Multi-issue Negotiation Games	Sean Noh et.al.	2405.05248	null
2024-05-08	SuFIA: Language-Guided Augmented Dexterity for Robotic Surgical Assistants	Masoud Moghani et.al.	2405.05226	null
2024-05-08	Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers	Jiuxiang Gu et.al.	2405.05219	null
2024-05-08	MIDGARD: Self-Consistency Using Minimum Description Length for Structured Commonsense Reasoning	Inderjeet Nair et.al.	2405.05189	null
2024-05-08	Air Gap: Protecting Privacy-Conscious Conversational Agents	Eugene Bagdasaryan et.al.	2405.05175	null
2024-05-08	XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples	Peiqin Lin et.al.	2405.05116	null
2024-05-08	QFMTS: Generating Query-Focused Summaries over Multi-Table Inputs	Weijia Zhang et.al.	2405.05109	null
2024-05-08	Concerns on Bias in Large Language Models when Creating Synthetic Personae	Helena A. Haxvig et.al.	2405.05080	null
2024-05-07	ChatHuman: Language-driven 3D Human Understanding with Retrieval-Augmented Tool Reasoning	Jing Lin et.al.	2405.04533	null
2024-05-07	QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving	Yujun Lin et.al.	2405.04532	link
2024-05-07	NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Prompts	Shudan Zhang et.al.	2405.04520	null
2024-05-07	xLSTM: Extended Long Short-Term Memory	Maximilian Beck et.al.	2405.04517	null
2024-05-07	A Transformer with Stack Attention	Jiaoda Li et.al.	2405.04515	link
2024-05-08	Unveiling Disparities in Web Task Handling Between Human and Web Agent	Kihoon Son et.al.	2405.04497	null
2024-05-07	Toward In-Context Teaching: Adapting Examples to Students' Misconceptions	Alexis Ross et.al.	2405.04495	null
2024-05-07	The Silicone Ceiling: Auditing GPT's Race and Gender Biases in Hiring	Lena Armstrong et.al.	2405.04412	null
2024-05-07	Vision Mamba: A Comprehensive Survey and Taxonomy	Xiao Liu et.al.	2405.04404	link
2024-05-07	Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks	Georgios Pantazopoulos et.al.	2405.04403	link
2024-05-06	Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs	Muhammad Uzair Khattak et.al.	2405.03690	null
2024-05-06	Large Language Models Reveal Information Operation Goals, Tactics, and Narrative Frames	Keith Burghardt et.al.	2405.03688	null
2024-05-06	Language-Image Models with 3D Understanding	Jang Hyun Cho et.al.	2405.03685	null
2024-05-06	AtomGPT: Atomistic Generative Pre-trained Transformer for Forward and Inverse Materials Design	Kamal Choudhary et.al.	2405.03680	null
2024-05-06	When LLMs Meet Cybersecurity: A Systematic Literature Review	Jie Zhang et.al.	2405.03644	null
2024-05-06	A Controlled Experiment on the Energy Efficiency of the Source Code Generated by Code Llama	Vlad-Andrei Cursaru et.al.	2405.03616	null
2024-05-06	Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment	Abhinav Agarwalla et.al.	2405.03594	null
2024-05-06	AlphaMath Almost Zero: process Supervision without process	Guoxin Chen et.al.	2405.03553	null
2024-05-06	MAmmoTH2: Scaling Instructions from the Web	Xiang Yue et.al.	2405.03548	null
2024-05-06	Position Paper: Leveraging Foundational Models for Black-Box Optimization: Benefits, Challenges, and Future Directions	Xingyou Song et.al.	2405.03547	null
2024-05-03	Leveraging Large Language Models to Enhance Domain Expert Inclusion in Data Science Workflows	Jasmine Y. Shih et.al.	2405.02260	null
2024-05-03	What matters when building vision-language models?	Hugo Laurençon et.al.	2405.02246	null
2024-05-03	REASONS: A benchmark for REtrieval and Automated citationS Of scieNtific Sentences using Public and Proprietary LLMs	Deepa Tilwani et.al.	2405.02228	null
2024-05-03	FairEvalLLM. A Comprehensive Framework for Benchmarking Fairness in Large Language Model Recommender Systems	Yashar Deldjoo et.al.	2405.02219	null
2024-05-03	Automatic Programming: Large Language Models and Beyond	Michael R. Lyu et.al.	2405.02213	null
2024-05-03	Assessing and Verifying Task Utility in LLM-Powered Applications	Negar Arabzadeh et.al.	2405.02178	null
2024-05-03	The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates	Giuseppe Russo Latona et.al.	2405.02150	null
2024-05-03	MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain	Chao Jiang et.al.	2405.02144	null
2024-05-03	Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection	Guillem Ramírez et.al.	2405.02134	null
2024-05-06	Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets	Xuelong Geng et.al.	2405.02132	null
2024-05-02	Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks	Murtaza Dalal et.al.	2405.01534	null
2024-05-02	OmniDrive: A Holistic LLM-Agent Framework for Autonomous Driving with 3D Perception, Reasoning and Planning	Shihao Wang et.al.	2405.01533	null
2024-05-02	FLAME: Factuality-Aware Alignment for Large Language Models	Sheng-Chieh Lin et.al.	2405.01525	null
2024-05-02	Transformer-Aided Semantic Communications	Matin Mortaheb et.al.	2405.01521	null
2024-05-02	Analyzing the Role of Semantic Representations in the Era of Large Language Models	Zhijing Jin et.al.	2405.01502	link
2024-05-02	Supporting Business Document Workflows via Collection-Centric Information Foraging with Large Language Models	Raymond Fok et.al.	2405.01501	null
2024-05-02	Controllable Text Generation in the Instruction-Tuning Era	Dhananjay Ashok et.al.	2405.01490	null
2024-05-02	NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment	Gerald Shen et.al.	2405.01481	link
2024-05-02	A Systematic Literature Review on Large Language Models for Automated Program Repair	Quanjun Zhang et.al.	2405.01466	null
2024-05-02	Natural Language to Verilog: Design of a Recurrent Spiking Neural Network using Large Language Models and ChatGPT	Paola Vitolo et.al.	2405.01419	null
2024-05-01	Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3	Junsang Yoon et.al.	2405.00664	null
2024-05-01	HalluVault: A Novel Logic Programming-aided Metamorphic Testing Framework for Detecting Fact-Conflicting Hallucinations in Large Language Models	Ningke Li et.al.	2405.00648	null
2024-05-01	When Quantization Affects Confidence of Large Language Models?	Irina Proskurina et.al.	2405.00632	null
2024-05-01	"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust	Sunnie S. Y. Kim et.al.	2405.00623	null
2024-05-01	Addressing Topic Granularity and Hallucination in Large Language Models for Topic Modelling	Yida Mu et.al.	2405.00611	null
2024-05-01	Investigating Automatic Scoring and Feedback using Large Language Models	Gloria Ashiya Katuka et.al.	2405.00602	null
2024-05-01	Are Models Biased on Text without Gender-related Language?	Catarina G Belém et.al.	2405.00588	link
2024-05-01	The Real, the Better: Aligning Large Language Models with Online Human Behaviors	Guanying Jiang et.al.	2405.00578	null
2024-05-01	EALD-MLLM: Emotion Analysis in Long-sequential and De-identity videos with Multi-modal Large Language Model	Deng Li et.al.	2405.00574	null
2024-05-01	NumLLM: Numeric-Sensitive Large Language Model for Chinese Finance	Huan-Yi Su et.al.	2405.00566	null
2024-04-30	Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation	Yunhao Ge et.al.	2404.19752	null
2024-04-30	PrivComp-KG : Leveraging Knowledge Graph and Large Language Models for Privacy Policy Compliance Verification	Leon Garza et.al.	2404.19744	null
2024-04-30	Better & Faster Large Language Models via Multi-token Prediction	Fabian Gloeckle et.al.	2404.19737	null
2024-04-30	A Framework for Leveraging Human Computation Gaming to Enhance Knowledge Graphs for Accuracy Critical Generative AI Applications	Steph Buongiorno et.al.	2404.19729	null
2024-04-30	PANGeA: Procedural Artificial Narrative using Generative AI for Turn-Based Video Games	Steph Buongiorno et.al.	2404.19721	null
2024-04-30	Assessing LLMs in Malicious Code Deobfuscation of Real-world Malware Campaigns	Constantinos Patsakis et.al.	2404.19715	null
2024-04-30	Automated Generation of High-Quality Medical Simulation Scenarios Through Integration of Semi-Structured Data and Large Language Models	Scott Sumpter et.al.	2404.19713	null
2024-04-30	When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively	Tiziano Labruna et.al.	2404.19705	null
2024-04-30	Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners	Chun Feng et.al.	2404.19696	null
2024-04-30	On Training a Neural Network to Explain Binaries	Alexander Interrante-Grant et.al.	2404.19631	null
2024-04-29	Hallucination of Multimodal Large Language Models: A Survey	Zechen Bai et.al.	2404.18930	link
2024-04-29	DPO Meets PPO: Reinforced Token Optimization for RLHF	Han Zhong et.al.	2404.18922	null
2024-04-29	TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation	Junhao Cheng et.al.	2404.18919	null
2024-04-29	Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting	Fangcheng Liu et.al.	2404.18911	null
2024-04-29	Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking	Hong Jin Kang et.al.	2404.18881	link
2024-04-29	More RLHF, More Trust? On The Impact of Human Preference Alignment On Language Model Trustworthiness	Aaron J. Li et.al.	2404.18870	link
2024-04-29	Truth-value judgment in language models: belief directions are context sensitive	Stefan F. Schouten et.al.	2404.18865	null
2024-04-29	Performance-Aligned LLMs for Generating Fast Code	Daniel Nichols et.al.	2404.18864	null
2024-04-29	A Survey on Vision Mamba: Models, Applications and Challenges	Rui Xu et.al.	2404.18861	link
2024-04-29	VERT: Verified Equivalent Rust Transpilation with Few-Shot Learning	Aidan Z. H. Yang et.al.	2404.18852	null
2024-04-26	Probabilistic Inference in Language Models via Twisted Sequential Monte Carlo	Stephen Zhao et.al.	2404.17546	null
2024-04-26	Large Language Model Agent as a Mechanical Designer	Yayati Jadhav et.al.	2404.17525	null
2024-04-29	On the Use of Large Language Models to Generate Capability Ontologies	Luis Miguel Vieira da Silva et.al.	2404.17524	null
2024-04-26	Enhancing Legal Compliance and Regulation Analysis with Large Language Models	Shabnam Hassani et.al.	2404.17522	null
2024-04-26	A Comprehensive Evaluation on Event Reasoning of Large Language Models	Zhengwei Tao et.al.	2404.17513	link
2024-04-26	Ruffle&Riley: Insights from Designing and Evaluating a Large Language Model-Based Conversational Tutoring System	Robin Schmucker et.al.	2404.17460	null
2024-04-26	"ChatGPT Is Here to Help, Not to Replace Anybody" -- An Evaluation of Students' Opinions On Integrating ChatGPT In CS Courses	Bruno Pereira Cipriano et.al.	2404.17443	null
2024-04-26	InspectorRAGet: An Introspection Platform for RAG Evaluation	Kshitij Fadnis et.al.	2404.17347	null
2024-04-26	When to Trust LLMs: Aligning Confidence with Response Quality	Shuchang Tao et.al.	2404.17287	null
2024-04-26	Reinforcement Retrieval Leveraging Fine-grained Feedback for Fact Checking News Claims with Black-Box LLM	Xuan Zhang et.al.	2404.17283	link
2024-04-25	Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials	Ye Fang et.al.	2404.16829	null
2024-04-25	How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites	Zhe Chen et.al.	2404.16821	link
2024-04-25	IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs on Indic Languages	Harman Singh et.al.	2404.16816	null
2024-04-26	Make Your LLM Fully Utilize the Context	Shengnan An et.al.	2404.16811	link
2024-04-25	Improving Diversity of Commonsense Generation by Large Language Models via In-Context Learning	Tianhui Zhang et.al.	2404.16807	null
2024-04-25	Weak-to-Strong Extrapolation Expedites Alignment	Chujie Zheng et.al.	2404.16792	link
2024-04-25	SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension	Bohao Li et.al.	2404.16790	link
2024-04-25	Continual Learning of Large Language Models: A Comprehensive Survey	Haizhou Shi et.al.	2404.16789	link
2024-04-25	Prefix Text as a Yarn: Eliciting Non-English Alignment in Foundation Language Model	Runzhe Zhan et.al.	2404.16766	null
2024-04-25	RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis	Xiaoman Zhang et.al.	2404.16754	null
2024-04-24	Hybrid LLM/Rule-based Approaches to Business Insights Generation from Structured Data	Aliaksei Vertsel et.al.	2404.15604	null
2024-04-24	ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction	Henry Peng Zou et.al.	2404.15592	link
2024-04-24	Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations?	Hossein Salami et.al.	2404.15578	null
2024-04-23	PRISM: Patient Records Interpretation for Semantic Clinical Trial Matching using Large Language Models	Shashi Kant Gupta et.al.	2404.15549	null
2024-04-23	Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models	Mihir Parmar et.al.	2404.15522	link
2024-04-23	Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval	Young Kyun Jang et.al.	2404.15516	null
2024-04-23	ToM-LM: Delegating Theory Of Mind Reasoning to External Symbolic Executors in Large Language Models	Weizhi Tang et.al.	2404.15515	null
2024-04-23	IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents	Jean-Philippe Corbeil et.al.	2404.15488	link
2024-04-23	Large Language Models Spot Phishing Emails with Surprising Accuracy: A Comparative Analysis of Performance	Het Patel et.al.	2404.15485	null
2024-04-23	Can Large Language Models Learn the Physics of Metamaterials? An Empirical Study with ChatGPT	Darui Lu et.al.	2404.15458	null
2024-04-23	Aligning LLM Agents by Learning Latent Preference from User Edits	Ge Gao et.al.	2404.15269	null
2024-04-23	XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts	Yifeng Ding et.al.	2404.15247	link
2024-04-23	Revisiting Unnaturalness for Automated Program Repair in the Era of Large Language Models	Aidan Z. H. Yang et.al.	2404.15236	null
2024-04-23	Re-Thinking Inverse Graphics With Large Language Models	Peter Kulits et.al.	2404.15228	null
2024-04-23	Setting up the Data Printer with Improved English to Ukrainian Machine Translation	Yurii Paniv et.al.	2404.15196	null
2024-04-23	Regressive Side Effects of Training Language Models to Mimic Student Misconceptions	Shashank Sonkar et.al.	2404.15156	null
2024-04-23	Bias patterns in the application of LLMs for clinical decision support: A comprehensive study	Raphael Poulain et.al.	2404.15149	null
2024-04-23	Rethinking LLM Memorization through the Lens of Adversarial Compression	Avi Schwarzschild et.al.	2404.15146	null
2024-04-23	Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation	Xun Wu et.al.	2404.15100	null
2024-04-23	A Short Review for Ontology Learning from Text: Stride from Shallow Learning, Deep Learning to Large Language Models Trend	Rick Du et.al.	2404.14991	null
2024-04-22	AutoAD III: The Prequel -- Back to the Pixels	Tengda Han et.al.	2404.14412	null
2024-04-22	SpaceByte: Towards Deleting Tokenization from Large Language Modeling	Kevin Slagle et.al.	2404.14408	link
2024-04-22	RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?	Adrian de Wynter et.al.	2404.14397	null
2024-04-22	A Survey on Self-Evolution of Large Language Models	Zhengwei Tao et.al.	2404.14387	null
2024-04-22	Beyond Scaling: Predicting Patent Approval with Domain-specific Fine-grained Claim Dependency Graph	Xiaochen Kev Gao et.al.	2404.14372	link
2024-04-23	Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data	Fahim Tajwar et.al.	2404.14367	link
2024-04-22	Better Synthetic Data by Retrieving and Transforming Existing Datasets	Saumya Gandhi et.al.	2404.14361	link
2024-04-22	Rethinking Legal Compliance Automation: Opportunities with Large Language Models	Shabnam Hassani et.al.	2404.14356	null
2024-04-22	Automated Long Answer Grading with RiceChem Dataset	Shashank Sonkar et.al.	2404.14316	null
2024-04-22	Explaining Arguments' Strength: Unveiling the Role of Attacks and Supports (Technical Report)	Xiang Yin et.al.	2404.14304	null
2024-04-19	MoVA: Adapting Mixture of Vision Experts to Multimodal Context	Zhuofan Zong et.al.	2404.13046	link
2024-04-19	Unified Scene Representation and Reconstruction for 3D Large Language Models	Tao Chu et.al.	2404.13044	null
2024-04-19	Data Alignment for Zero-Shot Concept Generation in Dermatology AI	Soham Gadgil et.al.	2404.13043	null
2024-04-19	Sample Design Engineering: An Empirical Study of What Makes Good Downstream Fine-Tuning Samples for LLMs	Biyang Guo et.al.	2404.13033	link
2024-04-19	When Life gives you LLMs, make LLM-ADE: Large Language Models with Adaptive Data Engineering	Stephen Choi et.al.	2404.13028	null
2024-04-19	Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models	Chuofan Ma et.al.	2404.13013	null
2024-04-19	Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs	Clemencia Siro et.al.	2404.12994	link
2024-04-19	FineRec:Exploring Fine-grained Sequential Recommendation	Xiaokun Zhang et.al.	2404.12975	null
2024-04-19	Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models	Yian Li et.al.	2404.12966	null
2024-04-19	Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction	Qinyuan Wu et.al.	2404.12957	null
2024-04-18	BLINK: Multimodal Large Language Models Can See but Not Perceive	Xingyu Fu et.al.	2404.12390	null
2024-04-18	MedThink: Explaining Medical Visual Question Answering via Multimodal Decision-Making Rationale	Xiaotang Gai et.al.	2404.12372	null
2024-04-18	When LLMs are Unfit Use FastFit: Fast and Effective Text Classification with Many Classes	Asaf Yehudai et.al.	2404.12365	null
2024-04-19	Towards a Foundation Model for Partial Differential Equations: Multi-Operator Learning and Extrapolation	Jingmin Sun et.al.	2404.12355	link
2024-04-18	V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning	Hang Hua et.al.	2404.12353	null
2024-04-18	Large Language Models in Targeted Sentiment Analysis	Nicolay Rusnachenko et.al.	2404.12342	link
2024-04-18	Normative Requirements Operationalization with Large Language Models	Nick Feng et.al.	2404.12335	null
2024-04-18	Large Language Models for Synthetic Participatory Planning of Shared Automated Electric Mobility Systems	Jiangbo Yu et.al.	2404.12317	null
2024-04-18	Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair	Yusuke Sakai et.al.	2404.12299	null
2024-04-18	Augmenting emotion features in irony detection with Large language modeling	Yucheng Lin et.al.	2404.12291	null
2024-04-17	A Deep Dive into Large Language Models for Automated Bug Localization and Repair	Soneya Binta Hossain et.al.	2404.11595	null
2024-04-17	LLMTune: Accelerate Database Knob Tuning with Large Language Models	Xinmei Huang et.al.	2404.11581	null
2024-04-17	MoA: Mixture-of-Attention for Subject-Context Disentanglement in Personalized Image Generation	Kuan-Chieh et.al.	2404.11565	null
2024-04-17	Quantifying Multilingual Performance of Large Language Models Across Languages	Zihao Li et.al.	2404.11553	null
2024-04-17	Pack of LLMs: Model Fusion at Test-Time via Perplexity Optimization	Costas Mavromatis et.al.	2404.11531	null
2024-04-17	Embedding Privacy in Computational Social Science and Artificial Intelligence Research	Keenan Jones et.al.	2404.11515	null
2024-04-17	Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models	Yushuo Chen et.al.	2404.11502	link
2024-04-17	Paraphrase and Solve: Exploring and Exploiting the Impact of Surface Form on Mathematical Reasoning in Large Language Models	Yue Zhou et.al.	2404.11500	link
2024-04-18	Octopus v3: Technical Report for On-device Sub-billion Multimodal AI Agent	Wei Chen et.al.	2404.11459	null
2024-04-17	Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models	Sunhao Dai et.al.	2404.11457	link
2024-04-16	Nearly Optimal Algorithms for Contextual Dueling Bandits from Adversarial Feedback	Qiwei Di et.al.	2404.10776	null
2024-04-16	Deep Learning and LLM-based Methods Applied to Stellar Lightcurve Classification	Yu-Yang Li et.al.	2404.10757	null
2024-04-16	Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study	Shusheng Xu et.al.	2404.10719	null
2024-04-16	An empirical study on code review activity prediction in practice	Doriane Olewicki et.al.	2404.10703	null
2024-04-16	Automating REST API Postman Test Cases Using LLM	S Deepika Sri et.al.	2404.10678	null
2024-04-16	Self-playing Adversarial Language Game Enhances LLM Reasoning	Pengyu Cheng et.al.	2404.10642	link
2024-04-16	HLAT: High-quality Large Language Model Pre-trained on AWS Trainium	Haozheng Fan et.al.	2404.10630	null
2024-04-16	Private Attribute Inference from Images with Vision-Language Models	Batuhan Tömekçe et.al.	2404.10618	null
2024-04-16	Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases	Yanze Li et.al.	2404.10595	null
2024-04-16	Construction of Domain-specified Japanese Large Language Model for Finance through Continual Pre-training	Masanori Hirano et.al.	2404.10555	null
2024-04-15	KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models	Avinash Anand et.al.	2404.09763	null
2024-04-15	Resilience of Large Language Models for Noisy Instructions	Bin Wang et.al.	2404.09754	null
2024-04-15	Personalized Collaborative Fine-Tuning for On-Device Large Language Models	Nicolas Wagner et.al.	2404.09753	null
2024-04-15	Quantization of Large Language Models with an Overdetermined Basis	Daniil Merkulov et.al.	2404.09737	null
2024-04-15	Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model	Hyunsoo Cho et.al.	2404.09717	null
2024-04-15	Enhancing Robot Explanation Capabilities through Vision-Language Models: a Preliminary Study by Interpreting Visual Inputs for Improved Human-Robot Interaction	David Sobrín-Hidalgo et.al.	2404.09705	null
2024-04-15	Generative AI for Game Theory-based Mobile Networking	Long He et.al.	2404.09699	null
2024-04-15	Are Large Language Models Reliable Argument Quality Annotators?	Nailia Mirzakhmedova et.al.	2404.09696	null
2024-04-15	LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models	Guangyan Li et.al.	2404.09695	null
2024-04-15	Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation	Juhwan Choi et.al.	2404.09682	null
2024-04-15	Do LLMs Understand Visual Anomalies? Uncovering LLM Capabilities in Zero-shot Anomaly Detection	Jiaqi Zhu et.al.	2404.09654	null
2024-04-15	Bridging Vision and Language Spaces with Assignment Prediction	Jungin Park et.al.	2404.09632	link
2024-04-12	Enhancing Visual Question Answering through Question-Driven Image Captions as Prompts	Övgü Özdemir et.al.	2404.08589	link
2024-04-12	Enhancing Autonomous Vehicle Training with Language Model Integration and Critical Scenario Generation	Hanlin Tian et.al.	2404.08570	null
2024-04-12	RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs	Shreyas Chaudhari et.al.	2404.08555	null
2024-04-12	Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward	Xuan Xie et.al.	2404.08517	null
2024-04-12	Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction	Haoran Qiu et.al.	2404.08509	link
2024-04-12	LaSagnA: Language-based Segmentation Assistant for Complex Queries	Cong Wei et.al.	2404.08506	link
2024-04-12	Strategic Interactions between Large Language Models-based Agents in Beauty Contests	Siting Lu et.al.	2404.08492	null
2024-04-12	Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian	Stefano De Paoli et.al.	2404.08488	null
2024-04-12	Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task	Hassan Ali et.al.	2404.08424	null
2024-04-12	AdapterSwap: Continuous Training of LLMs with Data Removal and Access-Control Guarantees	William Fleshman et.al.	2404.08417	null
2024-04-11	OpenBias: Open-set Bias Detection in Text-to-Image Generative Models	Moreno D'Incà et.al.	2404.07990	null
2024-04-11	Manipulating Large Language Models to Increase Product Visibility	Aounon Kumar et.al.	2404.07981	link
2024-04-11	LLoCO: Learning Long Contexts Offline	Sijun Tan et.al.	2404.07979	link
2024-04-11	Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models	Haotian Zhang et.al.	2404.07973	null
2024-04-11	Leveraging Large Language Models (LLMs) to Support Collaborative Human-AI Online Risk Data Annotation	Jinkyung Park et.al.	2404.07926	null
2024-04-11	LaVy: Vietnamese Multimodal Large Language Model	Chi Tran et.al.	2404.07922	null
2024-04-11	AmpleGCG: Learning a Universal and Transferable Generative Model of Adversarial Suffixes for Jailbreaking Both Open and Closed LLMs	Zeyi Liao et.al.	2404.07921	link
2024-04-11	DesignQA: A Multimodal Benchmark for Evaluating Large Language Models' Understanding of Engineering Documentation	Anna C. Doris et.al.	2404.07917	link
2024-04-11	HGRN2: Gated Linear RNNs with State Expansion	Zhen Qin et.al.	2404.07904	link
2024-04-11	High-Dimension Human Value Representation in Large Language Models	Samuel Cahyawijaya et.al.	2404.07900	null
2024-04-10	UMBRAE: Unified Multimodal Decoding of Brain Signals	Weihao Xia et.al.	2404.07202	null
2024-04-10	Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention	Tsendsuren Munkhdalai et.al.	2404.07143	null
2024-04-10	Continuous Language Model Interpolation for Dynamic and Controllable Text Generation	Sara Kangaslahti et.al.	2404.07117	link
2024-04-11	From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications	Yongqiang Ma et.al.	2404.07108	null
2024-04-10	3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion	Yixuan Li et.al.	2404.07106	null
2024-04-10	Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs	Bowen Jin et.al.	2404.07103	null
2024-04-10	Dynamic Generation of Personalities with Large Language Models	Jianzhi Liu et.al.	2404.07084	null
2024-04-10	VLLMs Provide Better Context for Emotion Understanding Through Common Sense Reasoning	Alexandros Xenos et.al.	2404.07078	link
2024-04-10	Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?	Mingyu Jin et.al.	2404.07066	link
2024-04-10	Groundedness in Retrieval-augmented Long-form Generation: An Empirical Study	Alessandro Stolfo et.al.	2404.07060	null
2024-04-09	Pitfalls of Conversational LLMs on News Debiasing	Ipek Baris Schlicht et.al.	2404.06488	null
2024-04-10	Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks	Chonghua Wang et.al.	2404.06480	link
2024-04-09	Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models	Zihan Fang et.al.	2404.06448	null
2024-04-09	Large Language Models to the Rescue: Deadlock Resolution in Multi-Robot Systems	Kunal Garg et.al.	2404.06413	null
2024-04-09	AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM Agents	Luca Gioacchini et.al.	2404.06411	link
2024-04-09	Take a Look at it! Rethinking How to Evaluate Language Model Jailbreak	Hongyu Cai et.al.	2404.06407	link
2024-04-09	Apprentices to Research Assistants: Advancing Research with Large Language Models	M. Namvarpour et.al.	2404.06404	null
2024-04-09	MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies	Shengding Hu et.al.	2404.06395	link
2024-04-10	MuPT: A Generative Symbolic Music Pretrained Transformer	Xingwei Qu et.al.	2404.06393	null
2024-04-09	Latent Distance Guided Alignment Training for Large Language Models	Haotian Luo et.al.	2404.06390	null
2024-04-08	MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding	Bo He et.al.	2404.05726	null
2024-04-08	Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs	Keen You et.al.	2404.05719	null
2024-04-08	Evaluating Mathematical Reasoning Beyond Accuracy	Shijie Xia et.al.	2404.05692	link
2024-04-08	Retrieval-Augmented Open-Vocabulary Object Detection	Jooyeon Kim et.al.	2404.05687	link
2024-04-08	MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation	Kunpeng Song et.al.	2404.05674	null
2024-04-08	CoReS: Orchestrating the Dance of Reasoning and Segmentation	Xiaoyi Bao et.al.	2404.05673	null
2024-04-09	Fighting crime with Transformers: Empirical analysis of address parsing methods in payment data	Haitham Hammami et.al.	2404.05632	link
2024-04-08	LTNER: Large Language Model Tagging for Named Entity Recognition with Contextualized Entity Marking	Faren Yan et.al.	2404.05624	null
2024-04-08	MedExpQA: Multilingual Benchmarking of Large Language Models for Medical Question Answering	Iñigo Alonso et.al.	2404.05590	null
2024-04-08	360°REA: Towards A Reusable Experience Accumulation with 360° Assessment for Multi-Agent System	Shen Gao et.al.	2404.05569	null
2024-04-05	Physical Property Understanding from Language-Embedded Feature Fields	Albert J. Zhai et.al.	2404.04242	null
2024-04-05	Cleared for Takeoff? Compositional & Conditional Reasoning may be the Achilles Heel to (Flight-Booking) Language Agents	Harsh Kohli et.al.	2404.04237	null
2024-04-05	Social Skill Training with Large Language Models	Diyi Yang et.al.	2404.04204	null
2024-04-05	Ambiguity in the use of SIR models to fit epidemic incidence data	B Shayak et.al.	2404.04181	null
2024-04-05	Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model	Xinrun Du et.al.	2404.04167	null
2024-04-05	Large language models as oracles for instantiating ontologies with domain-specific knowledge	Giovanni Ciatto et.al.	2404.04108	link
2024-04-05	Robust Preference Optimization with Provable Noise Tolerance for LLMs	Xize Liang et.al.	2404.04102	null
2024-04-05	Assessing the quality of information extraction	Filip Seitl et.al.	2404.04068	null
2024-04-05	CLUE: A Clinical Language Understanding Evaluation for LLMs	Amin Dada et.al.	2404.04067	null
2024-04-05	VoicePilot: Harnessing LLMs as Speech Interfaces for Physically Assistive Robots	Akhil Padmanabha et.al.	2404.04066	null
2024-04-04	AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent	Hanyu Lai et.al.	2404.03648	link
2024-04-04	Capabilities of Large Language Models in Control Engineering: A Benchmark Study on GPT-4, Claude 3 Opus, and Gemini 1.0 Ultra	Darioush Kevian et.al.	2404.03647	null
2024-04-04	Training LLMs over Neurally Compressed Text	Brian Lester et.al.	2404.03626	null
2024-04-04	Unveiling LLMs: The Evolution of Latent Representations in a Temporal Knowledge Graph	Marco Bronzini et.al.	2404.03623	null
2024-04-04	Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models	Wenshan Wu et.al.	2404.03622	null
2024-04-04	DeViDe: Faceted medical knowledge for improved medical vision-language pre-training	Haozhe Luo et.al.	2404.03618	null
2024-04-04	Sailor: Open Language Models for South-East Asia	Longxu Dou et.al.	2404.03608	link
2024-04-04	Evaluating LLMs at Detecting Errors in LLM Responses	Ryo Kamoi et.al.	2404.03602	link
2024-04-04	Intent Detection and Entity Extraction from BioMedical Literature	Ankan Mullick et.al.	2404.03598	link
2024-04-04	SemGrasp: Semantic Grasp Generation via Language Aligned Discretization	Kailin Li et.al.	2404.03590	null
2024-04-03	ALOHa: A New Measure for Hallucination in Captioning Models	Suzanne Petryk et.al.	2404.02904	null
2024-04-03	MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment	Duygu Ceylan et.al.	2404.02899	null
2024-04-03	ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline	Yifan Xu et.al.	2404.02893	null
2024-04-03	Linear Attention Sequence Parallelism	Weigao Sun et.al.	2404.02882	link
2024-04-03	Integrating Explanations in Learning LTL Specifications from Demonstrations	Ashutosh Gupta et.al.	2404.02872	null
2024-04-03	Toward Inference-optimal Mixture-of-Expert Large Language Models	Longfei Yun et.al.	2404.02852	null
2024-04-03	I-Design: Personalized LLM Interior Designer	Ata Çelen et.al.	2404.02838	null
2024-04-03	Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models	Wanyun Cui et.al.	2404.02837	null
2024-04-03	Retrieving Examples from Memory for Retrieval Augmented Neural Machine Translation: A Systematic Comparison	Maxime Bouthors et.al.	2404.02835	null
2024-04-03	Empowering Biomedical Discovery with AI Agents	Shanghua Gao et.al.	2404.02831	null
2024-04-02	Topic-based Watermarks for LLM-Generated Text	Alexander Nemecek et.al.	2404.02138	null
2024-04-02	Exploring Automated Distractor Generation for Math Multiple-choice Questions via Large Language Models	Wanyong Feng et.al.	2404.02124	null
2024-04-02	CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems	Sara Rosenthal et.al.	2404.02103	link
2024-04-02	Advancing LLM Reasoning Generalists with Preference Trees	Lifan Yuan et.al.	2404.02078	link
2024-04-02	SPMamba: State-space model is all you need in speech separation	Kai Li et.al.	2404.02063	link
2024-04-02	Digital Forgetting in Large Language Models: A Survey of Unlearning Methods	Alberto Blanco-Justicia et.al.	2404.02062	null
2024-04-02	Long-context LLMs Struggle with Long In-context Learning	Tianle Li et.al.	2404.02060	link
2024-04-02	Deconstructing In-Context Learning: Understanding Prompts via Corruption	Namrata Shivagunde et.al.	2404.02054	link
2024-04-02	A Survey on Large Language Model-Based Game Agents	Sihao Hu et.al.	2404.02039	link
2024-04-02	MultiParaDetox: Extending Text Detoxification with Parallel Data to New Languages	Daryna Dementieva et.al.	2404.02037	null
2024-03-29	Gecko: Versatile Text Embeddings Distilled from Large Language Models	Jinhyuk Lee et.al.	2403.20327	null
2024-03-29	Convolutional Prompting meets Language Models for Continual Learning	Anurag Roy et.al.	2403.20317	null
2024-03-29	Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference	Jovan Stojkovic et.al.	2403.20306	null
2024-03-29	Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain	Burcu Sayin et.al.	2403.20288	null
2024-03-29	LUQ: Long-text Uncertainty Quantification for LLMs	Caiqi Zhang et.al.	2403.20279	null
2024-04-01	Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want	Weifeng Lin et.al.	2403.20271	link
2024-03-29	Latxa: An Open Language Model and Evaluation Suite for Basque	Julen Etxaniz et.al.	2403.20266	link
2024-03-29	ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models	Thibaut Thonet et.al.	2403.20262	null
2024-03-29	Using LLMs to Model the Beliefs and Preferences of Targeted Populations	Keiichi Namikoshi et.al.	2403.20252	null
2024-03-29	Unleashing the Potential of Large Language Models for Predictive Tabular Tasks in Data Science	Yazheng Yang et.al.	2403.20208	null
2024-03-28	InterDreamer: Zero-Shot Text to 3D Dynamic Human-Object Interaction	Sirui Xu et.al.	2403.19652	null
2024-03-28	MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions	Kai Zhang et.al.	2403.19651	null
2024-03-28	Change-Agent: Towards Interactive Comprehensive Change Interpretation and Analysis from Change Detection and Change Captioning	Chenyang Liu et.al.	2403.19646	link
2024-03-28	Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models	Yucheng Shi et.al.	2403.19631	null
2024-03-29	Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers	Pingcheng Dong et.al.	2403.19591	link
2024-03-28	WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models	Piotr Molenda et.al.	2403.19548	null
2024-03-28	LLMs as Academic Reading Companions: Extending HCI Through Synthetic Personae	Celia Chen et.al.	2403.19506	null
2024-03-28	Evolving Assembly Code in an Adversarial Environment	Irina Maliukov et.al.	2403.19489	null
2024-03-28	JDocQA: Japanese Document Question Answering Dataset for Generative Language Models	Eri Onami et.al.	2403.19454	null
2024-03-28	Mixed Preference Optimization: Reinforcement Learning with Data Selection and Better Reference Model	Qi Gou et.al.	2403.19443	null
2024-03-27	Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models	Yanwei Li et.al.	2403.18814	link
2024-03-27	Long-form factuality in large language models	Jerry Wei et.al.	2403.18802	link
2024-03-27	3P-LLM: Probabilistic Path Planning using Large Language Model for Autonomous Robot Navigation	Ehsan Latif et.al.	2403.18778	null
2024-03-27	CheckEval: Robust Evaluation Framework using Large Language Model via Checklist	Yukyung Lee et.al.	2403.18771	null
2024-03-27	MLDT: Multi-Level Decomposition for Complex Long-Horizon Robotic Task Planning with Open-Source Large Language Model	Yike Wu et.al.	2403.18760	null
2024-03-27	Understanding the Learning Dynamics of Alignment with Human Feedback	Shawn Im et.al.	2403.18742	null
2024-03-27	PhysicsAssistant: An LLM-Powered Interactive Learning Robot for Physics Lab Investigations	Ehsan Latif et.al.	2403.18721	null
2024-03-27	NL-ITI: Optimizing Probing and Intervention for Improvement of ITI Method	Jakub Hoscilowicz et.al.	2403.18680	link
2024-03-27	An Exploratory Study on Upper-Level Computing Students' Use of Large Language Models as Tools in a Semester-Long Project	Ben Arie Tanay et.al.	2403.18679	null
2024-03-27	SDSAT: Accelerating LLM Inference through Speculative Decoding with Semantic Adaptive Tokens	Chengbo Liu et.al.	2403.18647	null
2024-03-26	Towards Explaining Hypercomplex Neural Networks	Eleonora Lopez et.al.	2403.17929	null
2024-03-26	MAGIS: LLM-Based Multi-Agent Framework for GitHub Issue Resolution	Wei Tao et.al.	2403.17927	null
2024-03-26	LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning	Rui Pan et.al.	2403.17919	null
2024-03-26	Addressing Social Misattributions of Large Language Models: An HCXAI-based Approach	Andrea Ferrario et.al.	2403.17873	null
2024-03-26	Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications	Philip Lippmann et.al.	2403.17860	null
2024-03-26	ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages	Bhawna Piryani et.al.	2403.17859	link
2024-03-26	Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs	David R. Mortensen et.al.	2403.17856	null
2024-03-26	ArabicaQA: A Comprehensive Dataset for Arabic Question Answering	Abdelrahman Abdallah et.al.	2403.17848	link
2024-03-26	Assessment of Multimodal Large Language Models in Alignment with Human Values	Zhelun Shi et.al.	2403.17830	null
2024-03-26	Accelerating Radio Spectrum Regulation Workflows with Large Language Models (LLMs)	Amir Ghasemi et.al.	2403.17819	null
2024-03-25	Synapse: Learning Preferential Concepts from Visual Demonstrations	Sadanand Modak et.al.	2403.16689	null
2024-03-25	Investigation of the effectiveness of applying ChatGPT in Dialogic Teaching Using Electroencephalography	Jiayue Zhang et.al.	2403.16687	null
2024-03-26	RU22Fact: Optimizing Evidence for Multilingual Explainable Fact-Checking on Russia-Ukraine Conflict	Yirong Zeng et.al.	2403.16662	link
2024-03-26	CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment	Feiteng Fang et.al.	2403.16649	null
2024-03-25	Virtual Co-Pilot: Multimodal Large Language Model-enabled Quick-access Procedures for Single Pilot Operations	Fan Li et.al.	2403.16645	null
2024-03-25	Conversational Grounding: Annotation and Analysis of Grounding Acts and Grounding Units	Biswesh Mohapatra et.al.	2403.16609	null
2024-03-25	TrustAI at SemEval-2024 Task 8: A Comprehensive Analysis of Multi-domain Machine Generated Text Detection Techniques	Ashok Urlana et.al.	2403.16592	null
2024-03-25	Can Large Language Models (or Humans) Distill Text?	Nicolas Audinet de Pieuchon et.al.	2403.16584	null
2024-03-25	NSINA: A News Corpus for Sinhala	Hansi Hettiarachchi et.al.	2403.16571	link
2024-03-25	Elysium: Exploring Object-level Perception in Videos via MLLM	Han Wang et.al.	2403.16558	link
2024-03-22	LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models	Yuzhang Shang et.al.	2403.15388	null
2024-03-22	Can large language models explore in-context?	Akshay Krishnamurthy et.al.	2403.15371	null
2024-03-22	CoLLEGe: Concept Embedding Generation for Large Language Models	Ryan Teehan et.al.	2403.15362	null
2024-03-22	Sphere Neural-Networks for Rational Reasoning	Tiansi Dong et.al.	2403.15297	null
2024-03-22	Measuring Gender and Racial Biases in Large Language Models	Jiafu An et.al.	2403.15281	null
2024-03-22	Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review	Jinge Wang et.al.	2403.15274	null
2024-03-22	Event Temporal Relation Extraction based on Retrieval-Augmented on LLMs	Xiaobin Zhang et.al.	2403.15273	null
2024-03-22	Imagination Augmented Generation: Learning to Imagine Richer Context for Question Answering over Large Language Models	Huanxuan Liao et.al.	2403.15268	link
2024-03-22	FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions	Orion Weller et.al.	2403.15246	null
2024-03-22	An Exploratory Investigation into Code License Infringements in Large Language Model Training Datasets	Jonathan Katzy et.al.	2403.15230	null
2024-03-21	MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?	Renrui Zhang et.al.	2403.14624	null
2024-03-21	Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey	Zeyu Han et.al.	2403.14608	null
2024-03-21	Large Language Models for Multi-Choice Question Classification of Medical Subjects	Víctor Ponce-López et.al.	2403.14582	null
2024-03-21	RAmBLA: A Framework for Evaluating the Reliability of LLMs as Assistants in the Biomedical Domain	William James Bolton et.al.	2403.14578	link
2024-03-21	A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science	Clayton Cohn et.al.	2403.14565	null
2024-03-21	EDT: Improving Large Language Models' Generation by Entropy-based Dynamic Temperature Sampling	Shimao Zhang et.al.	2403.14541	null
2024-03-22	Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference	Han Zhao et.al.	2403.14520	null
2024-03-21	The Ethics of ChatGPT in Medicine and Healthcare: A Systematic Review on Large Language Models (LLMs)	Joschka Haltaufderheide et.al.	2403.14473	null
2024-03-21	Detoxifying Large Language Models via Knowledge Editing	Mengru Wang et.al.	2403.14472	link
2024-03-21	ChatGPT Alternative Solutions: Large Language Models Survey	Hanieh Alipour et.al.	2403.14469	null
2024-03-20	RAR: Retrieving And Ranking Augmented MLLMs for Visual Recognition	Ziyu Liu et.al.	2403.13805	null
2024-03-20	Learning from Models and Data for Visual Grounding	Ruozhen He et.al.	2403.13804	null
2024-03-20	ZigMa: Zigzag Mamba Diffusion Model	Vincent Tao Hu et.al.	2403.13802	null
2024-03-20	Reverse Training to Nurse the Reversal Curse	Olga Golovneva et.al.	2403.13799	null
2024-03-20	Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts	Guangzeng Han et.al.	2403.13786	null
2024-03-20	EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation	Atnafu Lambebo Tonja et.al.	2403.13737	null
2024-03-20	Large Language Models meet Network Slicing Management and Orchestration	Abdulhalim Dandoush et.al.	2403.13721	null
2024-03-21	RoleInteract: Evaluating the Social Interaction of Role-Playing Agents	Hongzhan Chen et.al.	2403.13679	null
2024-03-20	H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation	Renkai Wu et.al.	2403.13642	link
2024-03-21	Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese	Meet Doshi et.al.	2403.13638	null
2024-03-19	Dated Data: Tracing Knowledge Cutoffs in Large Language Models	Jeffrey Cheng et.al.	2403.12958	null
2024-03-19	Automatic Information Extraction From Employment Tribunal Judgements Using Large Language Models	Joana Ribeiro de Faria et.al.	2403.12936	null
2024-03-19	Rapid AIdeation: Generating Ideas With the Self and in Collaboration With Large Language Models	Gionnieve Lim et.al.	2403.12928	null
2024-03-19	Supporting Energy Policy Research with Large Language Models	Grant Buster et.al.	2403.12924	null
2024-03-19	Semantic Layering in Room Segmentation via LLMs	Taehyeon Kim et.al.	2403.12920	null
2024-03-19	Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference	Baolin Li et.al.	2403.12900	null
2024-03-19	mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding	Anwen Hu et.al.	2403.12895	link
2024-03-20	MEDBind: Unifying Language and Multimodal Medical Data Embeddings	Yuan Gao et.al.	2403.12894	null
2024-03-19	HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning	Fucai Ke et.al.	2403.12884	null
2024-03-19	Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models	Zehui Chen et.al.	2403.12881	link
2024-03-18	HDLdebugger: Streamlining HDL debugging with Large Language Models	Xufeng Yao et.al.	2403.11671	null
2024-03-18	Let's Focus on Neuron: Neuron-Level Supervised Fine-tuning for Large Language Model	Haoyun Xu et.al.	2403.11621	null
2024-03-18	Linguacodus: A Synergistic Framework for Transformative Code Generation in Machine Learning Pipelines	Ekaterina Trofimova et.al.	2403.11585	null
2024-03-18	Sensitivity Assessment of Multi-Criteria Decision-Making Methods in Chemical Engineering Optimization Applications	Seyed Reza Nabavi et.al.	2403.11569	null
2024-03-18	Reinforcement Learning with Token-level Feedback for Controllable Text Generation	Wendi Li et.al.	2403.11558	null
2024-03-18	LLM^3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning	Shu Wang et.al.	2403.11552	link
2024-03-18	DEE: Dual-stage Explainable Evaluation Method for Text Generation	Shenyu Zhang et.al.	2403.11509	null
2024-03-18	VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding	Yue Fan et.al.	2403.11481	null
2024-03-18	HateCOT: An Explanation-Enhanced Dataset for Generalizable Offensive Speech Detection via Large Language Models	Huy Nghiem et.al.	2403.11456	link
2024-03-18	LLM Guided Evolution - The Automation of Models Advancing Models	Clint Morris et.al.	2403.11446	null
2024-03-15	VideoAgent: Long-form Video Understanding with Large Language Model as Agent	Xiaohan Wang et.al.	2403.10517	null
2024-03-15	Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization	Ratnadira Widyasari et.al.	2403.10507	null
2024-03-15	ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment	Xiaofeng Wu et.al.	2403.10504	null
2024-03-15	Reconfigurable Robot Identification from Motion Data	Yuhang Hu et.al.	2403.10496	null
2024-03-15	Can a GPT4-Powered AI Agent Be a Good Enough Performance Attribution Analyst?	Bruno de Melo et.al.	2403.10482	null
2024-03-15	Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases	Jiarui Li et.al.	2403.10446	link
2024-03-15	Optimal Block-Level Draft Verification for Accelerating Speculative Decoding	Ziteng Sun et.al.	2403.10444	null
2024-03-15	Using an LLM to Turn Sign Spottings into Spoken Language Sentences	Ozge Mercanoglu Sincan et.al.	2403.10434	null
2024-03-15	SocialGenPod: Privacy-Friendly Generative AI Social Web Applications with Decentralised Personal Data Stores	Vidminas Vizgirda et.al.	2403.10408	link
2024-03-15	A Thorough Comparison of Cross-Encoders and LLMs for Reranking SPLADE	Hervé Déjean et.al.	2403.10407	null
2024-03-14	Dynamic Memory Compression: Retrofitting LLMs for Accelerated Inference	Piotr Nawrot et.al.	2403.09636	null
2024-03-14	3D-VLA: A 3D Vision-Language-Action Generative World Model	Haoyu Zhen et.al.	2403.09631	null
2024-03-14	Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding	Guo Chen et.al.	2403.09626	link
2024-03-14	Compute-first optical detection for noise-resilient visual perception	Jungmin Kim et.al.	2403.09612	null
2024-03-14	MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training	Brandon McKinzie et.al.	2403.09611	null
2024-03-14	Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey	Xiaoyu Liu et.al.	2403.09606	null
2024-03-14	Logical Discrete Graphical Models Must Supplement Large Language Models for Information Synthesis	Gregory Coppola et.al.	2403.09599	null
2024-03-15	ExploRLLM: Guiding Exploration in Reinforcement Learning with Large Language Models	Runyu Ma et.al.	2403.09583	null
2024-03-14	Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation	Yunhao Gou et.al.	2403.09572	null
2024-03-14	Enhancing Trust in Autonomous Agents: An Architecture for Accountability and Explainability through Blockchain and Large Language Models	Laura Fernández-Becerra et.al.	2403.09567	null
2024-03-13	Simple and Scalable Strategies to Continually Pre-train Large Language Models	Adam Ibrahim et.al.	2403.08763	null
2024-03-13	Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework	Jingling Li et.al.	2403.08743	null
2024-03-13	The Garden of Forking Paths: Observing Dynamic Parameters Distribution in Large Language Models	Carlo Nicolini et.al.	2403.08739	null
2024-03-13	Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization	Renjie Pi et.al.	2403.08730	null
2024-03-14	SOTOPIA- $π$ : Interactive Learning of Socially Intelligent Language Agents	Ruiyi Wang et.al.	2403.08715	link
2024-03-13	Review of Generative AI Methods in Cybersecurity	Yagmur Yigit et.al.	2403.08701	null
2024-03-13	TeaMs-RL: Teaching LLMs to Teach Themselves Better Instructions via Reinforcement Learning	Shangding Gu et.al.	2403.08694	null
2024-03-14	Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records	Erlend Frayling et.al.	2403.08664	null
2024-03-13	Human Alignment of Large Language Models through Online Preference Optimisation	Daniele Calandriello et.al.	2403.08635	null
2024-03-13	MedInsight: A Multi-Source Context Augmentation Framework for Generating Patient-Centric Medical Responses using Large Language Models	Subash Neupane et.al.	2403.08607	null
2024-03-12	Beyond Text: Frozen Large Language Models in Visual Signal Comprehension	Lei Zhu et.al.	2403.07874	link
2024-03-12	Rethinking Generative Large Language Model Evaluation for Semantic Comprehension	Fangyun Wei et.al.	2403.07872	null
2024-03-12	Exploring Safety Generalization Challenges of Large Language Models via Code	Qibing Ren et.al.	2403.07865	null
2024-03-12	DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies	William Xie et.al.	2403.07832	null
2024-03-12	The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing	Jianchen Wang et.al.	2403.07825	null
2024-03-12	Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Sainbayar Sukhbaatar et.al.	2403.07816	null
2024-03-12	Fine-tuning Large Language Models with Sequential Instructions	Hanxu Hu et.al.	2403.07794	link
2024-03-12	Transforming Competition into Collaboration: The Revolutionary Role of Multi-Agent Systems and Language Models in Modern Organizations	Carlos Jose Xavier Cruz et.al.	2403.07769	link
2024-03-12	Synth $^2$ : Boosting Visual-Language Models with Synthetic Captions and Image Embeddings	Sahand Sharifzadeh et.al.	2403.07750	null
2024-03-12	FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models	Yan Liu et.al.	2403.07747	null
2024-03-11	Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena	Leonie Weissweiler et.al.	2403.06965	null
2024-03-11	Materials science in the era of large language models: a perspective	Ge Lei et.al.	2403.06949	null
2024-03-11	Naming, Describing, and Quantifying Visual Objects in Humans and LLMs	Alberto Testoni et.al.	2403.06935	null
2024-03-11	ERA-CoT: Improving Chain-of-Thought through Entity Relationship Analysis	Yanming Liu et.al.	2403.06932	link
2024-03-12	MEND: Meta dEmonstratioN Distillation for Efficient and Effective In-Context Learning	Yichuan Li et.al.	2403.06914	null
2024-03-11	Exploring Large Language Models and Hierarchical Frameworks for Classification of Large Unstructured Legal Documents	Nishchal Prasad et.al.	2403.06872	null
2024-03-11	Development of a Reliable and Accessible Caregiving Language Model (CaLM)	Bambang Parmanto et.al.	2403.06857	null
2024-03-11	DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation	Guosheng Zhao et.al.	2403.06845	null
2024-03-11	RA-ISF: Learning to Answer and Understand from Retrieval Augmentation via Iterative Self-Feedback	Yanming Liu et.al.	2403.06840	link
2024-03-11	ACFIX: Guiding LLMs with Mined Common RBAC Practices for Context-Aware Repair of Access Control Vulnerabilities in Smart Contracts	Lyuye Zhang et.al.	2403.06838	null
2024-03-08	Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context	Machel Reid et.al.	2403.05530	null
2024-03-08	GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM	Hao Kang et.al.	2403.05527	link
2024-03-08	Beyond Finite Data: Towards Data-free Out-of-distribution Generalization via Extrapola	Yijiang Li et.al.	2403.05523	null
2024-03-08	Will GPT-4 Run DOOM?	Adrian de Wynter et.al.	2403.05468	null
2024-03-08	Cost-Performance Optimization for Processing Low-Resource Language Tasks Using Commercial LLMs	Arijit Nag et.al.	2403.05434	null
2024-03-08	Explaining Pre-Trained Language Models with Attribution Scores: An Analysis in Low-Resource Settings	Wei Zhou et.al.	2403.05338	null
2024-03-08	ChatASU: Evoking LLM's Reflexion to Truly Understand Aspect Sentiment in Dialogues	Yiding Liu et.al.	2403.05326	null
2024-03-08	RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation	Zihao Wang et.al.	2403.05313	null
2024-03-08	Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis Agents	Jinyang Li et.al.	2403.05307	null
2024-03-08	ACLSum: A New Dataset for Aspect-based Summarization of Scientific Publications	Sotaro Takeshita et.al.	2403.05303	link
2024-03-07	iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries	Adam Coscia et.al.	2403.04760	link
2024-03-07	KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts	Adam Coscia et.al.	2403.04758	link
2024-03-07	LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error	Boshi Wang et.al.	2403.04746	link
2024-03-07	ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes	Hashmat Shadab Malik et.al.	2403.04701	null
2024-03-07	Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification	Ekaterina Fadeeva et.al.	2403.04696	null
2024-03-07	Telecom Language Models: Must They Be Large?	Nicola Piovesan et.al.	2403.04666	null
2024-03-07	Teaching Large Language Models to Reason with Reinforcement Learning	Alex Havrilla et.al.	2403.04642	null
2024-03-07	CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios	Qilang Ye et.al.	2403.04640	link
2024-03-07	A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds	Xuenan Xu et.al.	2403.04594	null
2024-03-07	Wiki-TabNER:Advancing Table Interpretation Through Named Entity Recognition	Aneta Koleva et.al.	2403.04577	null
2024-03-06	Bridging Language and Items for Retrieval and Recommendation	Yupeng Hou et.al.	2403.03952	link
2024-03-06	Did Translation Models Get More Robust Without Anyone Even Noticing?	Ben Peters et.al.	2403.03923	null
2024-03-06	Fuzzing BusyBox: Leveraging LLM and Crash Reuse for Embedded Bug Unearthing	Asmita et.al.	2403.03897	null
2024-03-06	SaulLM-7B: A pioneering Large Language Model for Law	Pierre Colombo et.al.	2403.03883	null
2024-03-06	Learning to Decode Collaboratively with Multiple Language Models	Shannon Zejiang Shen et.al.	2403.03870	link
2024-03-06	On the Origins of Linear Representations in Large Language Models	Yibo Jiang et.al.	2403.03867	null
2024-03-06	KIWI: A Dataset of Knowledge-Intensive Writing Instructions for Answering Research Questions	Fangyuan Xu et.al.	2403.03866	null
2024-03-06	Are Language Models Puzzle Prodigies? Algorithmic Puzzles Unveil Serious Challenges in Multimodal Reasoning	Deepanway Ghosal et.al.	2403.03864	link
2024-03-06	X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification	Hanzi Xu et.al.	2403.03863	link
2024-03-06	Emojinize : Enriching Any Text with Emoji Translations	Lars Henning Klein et.al.	2403.03857	null
2024-03-05	The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning	Nathaniel Li et.al.	2403.03218	null
2024-03-05	CLEVR-POC: Reasoning-Intensive Visual Question Answering in Partially Observable Environments	Savitha Sam Abraham et.al.	2403.03203	null
2024-03-05	Towards Democratized Flood Risk Management: An Advanced AI Assistant Enabled by GPT-4 for Enhanced Interpretability and Public Engagement	Rafaela Martelo et.al.	2403.03188	link
2024-03-05	How Well Can Transformers Emulate In-context Newton's Method?	Angeliki Giannou et.al.	2403.03183	null
2024-03-05	Behavior Generation with Latent Actions	Seungjae Lee et.al.	2403.03181	link
2024-03-05	SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection	Peng Qi et.al.	2403.03170	null
2024-03-05	PARADISE: Evaluating Implicit Planning Skills of Language Models with Procedural Warnings and Tips Dataset	Arda Uzunoğlu et.al.	2403.03167	link
2024-03-05	Quantum Many-Body Physics Calculations with Large Language Models	Haining Pan et.al.	2403.03154	null
2024-03-05	Language Guided Exploration for RL Agents in Text Environments	Hitesh Golchha et.al.	2403.03141	null
2024-03-05	Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution	Flor Miriam Plaza-del-Arco et.al.	2403.03121	null
2024-03-02	LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems	Tasnim Ahmed et.al.	2403.01342	null
2024-03-02	Chaining thoughts and LLMs to learn DNA structural biophysics	Tyler D. Ross et.al.	2403.01332	null
2024-03-02	VBART: The Turkish LLM	Meliksah Turker et.al.	2403.01308	null
2024-03-02	Improving the Validity of Automatically Generated Feedback via Reinforcement Learning	Alexander Scarlatos et.al.	2403.01304	link
2024-03-02	NoMAD-Attention: Efficient LLM Inference on CPUs Through Multiply-add-free Attention	Tianyi Zhang et.al.	2403.01273	null
2024-03-02	Employing LLMs for Incident Response Planning and Review	Sam Hays et.al.	2403.01271	null
2024-03-02	Dissecting Language Models: Machine Unlearning via Selective Pruning	Nicholas Pochinkov et.al.	2403.01267	null
2024-03-02	Accelerating Greedy Coordinate Gradient via Probe Sampling	Yiran Zhao et.al.	2403.01251	link
2024-03-02	SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code	Ziniu Hu et.al.	2403.01248	null
2024-03-02	Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal	Jianheng Huang et.al.	2403.01244	null
2024-02-29	The All-Seeing Project V2: Towards General Relation Comprehension of the Open World	Weiyun Wang et.al.	2402.19474	link
2024-02-29	Loose LIPS Sink Ships: Asking Questions in Battleship with Language-Informed Program Sampling	Gabriel Grand et.al.	2402.19471	null
2024-02-29	Towards Tracing Trustworthiness Dynamics: Revisiting Pre-training Period of Large Language Models	Chen Qian et.al.	2402.19465	link
2024-02-29	Curiosity-driven Red-teaming for Large Language Models	Zhang-Wei Hong et.al.	2402.19464	link
2024-02-29	ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL	Yifei Zhou et.al.	2402.19446	link
2024-02-29	Compositional API Recommendation for Library-Oriented Code Generation	Zexiong Ma et.al.	2402.19431	null
2024-02-29	Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models	Soham De et.al.	2402.19427	null
2024-02-29	Crafting Knowledge: Exploring the Creative Mechanisms of Chat-Based Search Engines	Lijia Ma et.al.	2402.19421	null
2024-02-29	On the Scaling Laws of Geographical Representation in Language Models	Nathan Godey et.al.	2402.19406	null
2024-02-29	Entity-Aware Multimodal Alignment Framework for News Image Captioning	Junzhe Zhang et.al.	2402.19404	null
2024-02-28	Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards	Haoxiang Wang et.al.	2402.18571	link
2024-02-28	A Categorization of Complexity Classes for Information Retrieval and Synthesis Using Natural Logic	Gregory Coppola et.al.	2402.18566	null
2024-02-28	Implicit Bias of Next-Token Prediction	Christos Thrampoulidis et.al.	2402.18551	null
2024-02-28	RNNs are not Transformers (Yet): The Key Bottleneck on In-context Retrieval	Kaiyue Wen et.al.	2402.18510	link
2024-02-28	Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling	Mahdi Karami et.al.	2402.18508	null
2024-02-28	Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification	Garima Chhikara et.al.	2402.18502	null
2024-02-28	Language Models Represent Beliefs of Self and Others	Wentao Zhu et.al.	2402.18496	null
2024-02-28	Meta-Task Prompting Elicits Embedding from Large Language Models	Yibin Lei et.al.	2402.18458	null
2024-02-28	Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication	Weize Chen et.al.	2402.18439	link
2024-02-28	Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models	Ercong Nie et.al.	2402.18397	null
2024-02-27	ShapeLLM: Universal 3D Object Understanding for Embodied Interaction	Zekun Qi et.al.	2402.17766	link
2024-02-27	The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits	Shuming Ma et.al.	2402.17764	null
2024-02-27	Massive Activations in Large Language Models	Mingjie Sun et.al.	2402.17762	link
2024-02-27	Evaluating Very Long-Term Conversational Memory of LLM Agents	Adyasha Maharana et.al.	2402.17753	null
2024-02-27	Tower: An Open Multilingual Large Language Model for Translation-Related Tasks	Duarte M. Alves et.al.	2402.17733	null
2024-02-27	AmbigNLG: Addressing Task Ambiguity in Instruction for NLG	Ayana Niwa et.al.	2402.17717	null
2024-02-27	Case-Based or Rule-Based: How Do Transformers Do the Math?	Yi Hu et.al.	2402.17709	link
2024-02-27	NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents	Tamara Czinczoll et.al.	2402.17682	null
2024-02-27	The Emergence of Large Language Models in Static Analysis: A First Look through Micro-Benchmarks	Ashwin Prasad Shivarpatna Venkatesh et.al.	2402.17679	null
2024-02-27	Beyond prompt brittleness: Evaluating the reliability and consistency of political worldviews in LLMs	Tanise Ceron et.al.	2402.17649	null
2024-02-26	Integrating Large Language Models with Graphical Session-Based Recommendation	Naicheng Guo et.al.	2402.16539	null
2024-02-26	LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments	Junzhe Chen et.al.	2402.16499	null
2024-02-26	Unveiling ChatGPT's Usage in Open Source Projects: A Mining-based Study	Rosalia Tufano et.al.	2402.16480	null
2024-02-26	Defending LLMs against Jailbreaking Attacks via Backtranslation	Yihan Wang et.al.	2402.16459	null
2024-02-26	ProLLaMA: A Protein Large Language Model for Multi-Task Protein Language Processing	Liuzhenghao Lv et.al.	2402.16445	null
2024-02-26	ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors	Zhexin Zhang et.al.	2402.16444	link
2024-02-26	Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models	Tianyi Tang et.al.	2402.16438	null
2024-02-26	RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions	Yuansen Zhang et.al.	2402.16431	null
2024-02-26	From RAGs to riches: Using large language models to write documents for clinical trials	Nigel Markey et.al.	2402.16406	null
2024-02-26	MoZIP: A Multilingual Benchmark to Evaluate Large Language Models in Intellectual Property	Shiwen Ni et.al.	2402.16389	link
2024-02-26	Immunization against harmful fine-tuning attacks	Domenic Rosati et.al.	2402.16382	null
2024-02-23	AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning	Jianguo Zhang et.al.	2402.15506	null
2024-02-23	API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs	Kinjal Basu et.al.	2402.15491	null
2024-02-23	Prejudice and Caprice: A Statistical Framework for Measuring Social Discrimination in Large Language Models	Yiran Liu et.al.	2402.15481	null
2024-02-23	Repetition Improves Language Model Embeddings	Jacob Mitchell Springer et.al.	2402.15449	link
2024-02-23	A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models	Stefan Hegselmann et.al.	2402.15422	link
2024-02-23	PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning	Simon Holk et.al.	2402.15420	null
2024-02-23	Explorations of Self-Repair in Language Models	Cody Rushing et.al.	2402.15390	link
2024-02-23	Safe Task Planning for Language-Instructed Multi-Robot Systems using Conformal Prediction	Jun Wang et.al.	2402.15368	null
2024-02-23	Farsight: Fostering Responsible AI Awareness During AI Application Prototyping	Zijie J. Wang et.al.	2402.15350	link
2024-02-23	NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data	Sergei Bogdanov et.al.	2402.15343	null
2024-02-22	PALO: A Polyglot Large Multimodal Model for 5B People	Muhammad Maaz et.al.	2402.14818	link
2024-02-22	CriticBench: Benchmarking LLMs for Critique-Correct Reasoning	Zicheng Lin et.al.	2402.14809	link
2024-02-22	RelayAttention for Efficient Large Language Model Serving with Long System Prompts	Lei Zhu et.al.	2402.14808	null
2024-02-22	A Decision-Language Model (DLM) for Dynamic Restless Multi-Armed Bandit Tasks in Public Health	Nikhil Behari et.al.	2402.14807	null
2024-02-22	Identifying Multiple Personalities in Large Language Models with External Evaluation	Xiaoyang Song et.al.	2402.14805	null
2024-02-22	Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models	Xudong Lu et.al.	2402.14800	link
2024-02-22	Zero-shot cross-lingual transfer in instruction tuning of large language model	Nadezhda Chirkova et.al.	2402.14778	null
2024-02-22	DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language Models	Yuhang Cao et.al.	2402.14767	link
2024-02-22	MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues	Ge Bai et.al.	2402.14762	null
2024-02-22	Generalizing Reward Modeling for Out-of-Distribution Preference Learning	Chen Jia et.al.	2402.14760	null
2024-02-21	Coercing LLMs to do and reveal (almost) anything	Jonas Geiping et.al.	2402.14020	link
2024-02-21	Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment	Vyas Raina et.al.	2402.14016	null
2024-02-21	OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems	Chaoqun He et.al.	2402.14008	null
2024-02-21	Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models	Zhiwei He et.al.	2402.14007	null
2024-02-21	Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models	Aline Ioste et.al.	2402.14002	null
2024-02-21	Towards Building Multilingual Language Model for Medicine	Pengcheng Qiu et.al.	2402.13963	link
2024-02-21	Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning	Debjit Paul et.al.	2402.13950	null
2024-02-21	Do Efficient Transformers Really Save Computation?	Kai Yang et.al.	2402.13934	null
2024-02-21	Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content	Federico Bianchi et.al.	2402.13926	null
2024-02-21	SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization	Prakamya Mishra et.al.	2402.13919	null
2024-02-20	Unlocking Insights: Semantic Search in Jupyter Notebooks	Lan Li et.al.	2402.13234	null
2024-02-20	Investigating Cultural Alignment of Large Language Models	Badr AlKhamissi et.al.	2402.13231	link
2024-02-20	Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive	Arka Pal et.al.	2402.13228	null
2024-02-20	AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning	Qiao Jin et.al.	2402.13225	null
2024-02-20	RoCode: A Dataset for Measuring Code Intelligence from Problem Definitions in Romanian	Adrian Cosma et.al.	2402.13222	link
2024-02-20	How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts	Yusu Qian et.al.	2402.13220	null
2024-02-20	Softmax Probabilities (Mostly) Predict Large Language Model Correctness on Multiple-Choice Q&A	Benjamin Plaut et.al.	2402.13213	link
2024-02-20	Soft Self-Consistency Improves Language Model Agents	Han Wang et.al.	2402.13212	link
2024-02-20	Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation	Dongjin Kang et.al.	2402.13211	null
2024-02-20	Bayesian Reward Models for LLM Alignment	Adam X. Yang et.al.	2402.13210	null
2024-02-19	Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding	Zhuoming Chen et.al.	2402.12374	null
2024-02-19	A Critical Evaluation of AI Feedback for Aligning Large Language Models	Archit Sharma et.al.	2402.12366	link
2024-02-19	Nonlinear Discrete-Time Observers with Physics-Informed Neural Networks	Hector Vargas Alvarez et.al.	2402.12360	null
2024-02-19	Graph-Based Retriever Captures the Long Tail of Biomedical Knowledge	Julien Delile et.al.	2402.12352	null
2024-02-19	GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations	Jinhao Duan et.al.	2402.12348	link
2024-02-19	Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!	Zhanhui Zhou et.al.	2402.12343	null
2024-02-19	Shall We Talk: Exploring Spontaneous Collaborations of Competing LLM Agents	Zengqing Wu et.al.	2402.12327	link
2024-02-19	ARKS: Active Retrieval in Knowledge Soup for Code Generation	Hongjin Su et.al.	2402.12317	null
2024-02-19	Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports	Felix J. Dorfner et.al.	2402.12298	null
2024-02-19	Adaptive Skeleton Graph Decoding	Shuowei Jin et.al.	2402.12280	null
2024-02-16	PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter	Junfei Xiao et.al.	2402.10896	null
2024-02-16	RLVF: Learning from Verbal Feedback without Overgeneralization	Moritz Stephan et.al.	2402.10893	null
2024-02-16	Instruction Diversity Drives Generalization To Unseen Tasks	Dylan Zhang et.al.	2402.10891	null
2024-02-16	When is Tree Search Useful for LLM Planning? It Depends on the Discriminator	Ziru Chen et.al.	2402.10890	null
2024-02-16	Multi-modal preference alignment remedies regression of visual instruction tuning on language model	Shengzhi Li et.al.	2402.10884	null
2024-02-16	EcoRank: Budget-Constrained Text Re-ranking Using Large Language Models	Muhammad Shihab Rashid et.al.	2402.10866	null
2024-02-16	Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities	Mingyu Jin et.al.	2402.10835	null
2024-02-16	RAG-Driver: Generalisable Driving Explanations with Retrieval-Augmented In-Context Learning in Multi-Modal Large Language Model	Jianhao Yuan et.al.	2402.10828	null
2024-02-16	Quantifying the Persona Effect in LLM Simulations	Tiancheng Hu et.al.	2402.10811	null
2024-02-16	Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond	Yongqi Li et.al.	2402.10805	null
2024-02-15	Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling	Raunaq Bhirangi et.al.	2402.10211	null
2024-02-15	Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation	Huizhuo Yuan et.al.	2402.10210	null
2024-02-15	Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment	Rui Yang et.al.	2402.10207	null
2024-02-15	Chain-of-Thought Reasoning Without Prompting	Xuezhi Wang et.al.	2402.10200	null
2024-02-15	A Trembling House of Cards? Mapping Adversarial Attacks against Language Agents	Lingbo Mo et.al.	2402.10196	link
2024-02-15	BitDelta: Your Fine-Tune May Only Be Worth One Bit	James Liu et.al.	2402.10193	link
2024-02-15	Uncertainty Decomposition and Quantification for In-Context Learning of Large Language Models	Chen Ling et.al.	2402.10189	link
2024-02-15	Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective	Tianyi Qiu et.al.	2402.10184	null
2024-02-15	TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation	Yaoxiang Wang et.al.	2402.10178	null
2024-02-15	OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset	Shubham Toshniwal et.al.	2402.10176	link
2024-02-14	AQA-Bench: An Interactive Benchmark for Evaluating LLMs' Sequential Reasoning Ability	Siwei Yang et.al.	2402.09404	link
2024-02-14	Reinforcement Learning from Human Feedback with Active Queries	Kaixuan Ji et.al.	2402.09401	null
2024-02-14	Get More with LESS: Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference	Harry Dong et.al.	2402.09398	null
2024-02-14	LlaSMol: Advancing Large Language Models for Chemistry with a Large-Scale, Comprehensive, High-Quality Instruction Tuning Dataset	Botao Yu et.al.	2402.09391	link
2024-02-14	HGOT: Hierarchical Graph of Thoughts for Retrieval-Augmented In-Context Learning in Factuality Evaluation	Yihao Fang et.al.	2402.09390	null
2024-02-14	Massively Multi-Cultural Knowledge Acquisition & LM Benchmarking	Yi Fung et.al.	2402.09369	null
2024-02-14	Copyright Traps for Large Language Models	Matthieu Meeus et.al.	2402.09363	null
2024-02-14	HiRE: High Recall Approximate Top- $k$ Estimation for Efficient LLM Inference	Yashas Samaga B L et.al.	2402.09360	null
2024-02-14	Developing a Framework for Auditing Large Language Models Using Human-in-the-Loop	Maryam Amirizaniani et.al.	2402.09346	null
2024-02-14	AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach	Maryam Amirizaniani et.al.	2402.09334	null
2024-02-13	COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability	Xingang Guo et.al.	2402.08679	link
2024-02-13	Human Curriculum Effects Emerge with In-Context Learning in Neural Networks	Jacob Russin et.al.	2402.08674	null
2024-02-13	Improving Generalization in Semantic Parsing by Increasing Natural Language Variation	Irina Saparina et.al.	2402.08666	null
2024-02-13	The Last JITAI? The Unreasonable Effectiveness of Large Language Models in Issuing Just-in-Time Adaptive Interventions: Fostering Physical Activity in a Prospective Cardiac Rehabilitation Setting	David Haag et.al.	2402.08658	null
2024-02-13	PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs	Michael Dorkenwald et.al.	2402.08657	null
2024-02-13	Tandem Transformers for Inference Efficient LLMs	Aishwarya P S et.al.	2402.08644	null
2024-02-13	SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages	Nedjma Ousidhoum et.al.	2402.08638	null
2024-02-13	Knowledge Editing on Black-box Large Language Models	Xiaoshuai Song et.al.	2402.08631	null
2024-02-13	Test-Time Backdoor Attacks on Multimodal Large Language Models	Dong Lu et.al.	2402.08577	link
2024-02-13	Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast	Xiangming Gu et.al.	2402.08567	link
2024-02-12	WildfireGPT: Tailored Large Language Model for Wildfire Analysis	Yangxinyu Xie et.al.	2402.07877	null
2024-02-12	Policy Improvement using Language Feedback Models	Victor Zhong et.al.	2402.07876	null
2024-02-12	Scaling Laws for Fine-Grained Mixture of Experts	Jakub Krajewski et.al.	2402.07871	null
2024-02-12	PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models	Wei Zou et.al.	2402.07867	link
2024-02-12	AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy	Philipp Schoenegger et.al.	2402.07862	null
2024-02-12	Lissard: Long and Simple Sequential Reasoning Datasets	Mirelle Bueno et.al.	2402.07859	null
2024-02-12	Mercury: An Efficiency Benchmark for LLM Code Synthesis	Mingzhe Du et.al.	2402.07844	null
2024-02-12	Do Membership Inference Attacks Work on Large Language Models?	Michael Duan et.al.	2402.07841	null
2024-02-12	Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model	Ahmet Üstün et.al.	2402.07827	null
2024-02-12	Differentially Private Zeroth-Order Methods for Scalable Large Language Model Finetuning	Z Liu et.al.	2402.07818	null
2024-02-09	Understanding the Effects of Iterative Prompting on Truthfulness	Satyapriya Krishna et.al.	2402.06625	null
2024-02-09	Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning	Shivalika Singh et.al.	2402.06619	null
2024-02-09	On the Out-Of-Distribution Generalization of Multimodal Large Language Models	Xingxuan Zhang et.al.	2402.06599	null
2024-02-09	CigaR: Cost-efficient Program Repair with LLMs	Dávid Hidvégi et.al.	2402.06598	null
2024-02-09	Understanding the Weakness of Large Language Model Agents within a Complex Android Environment	Mingzhe Xing et.al.	2402.06596	link
2024-02-09	G-SciEdBERT: A Contextualized LLM for Science Assessment Tasks in German	Ehsan Latif et.al.	2402.06584	null
2024-02-09	The Quantified Boolean Bayesian Network: Theory and Experiments with a Logical Graphical Model	Gregory Coppola et.al.	2402.06557	null
2024-02-09	Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA	Marek Šuppa et.al.	2402.06549	null
2024-02-09	Calibrating Long-form Generations from Large Language Models	Yukun Huang et.al.	2402.06544	null
2024-02-09	Introspective Planning: Guiding Language-Enabled Agents to Refine Their Own Uncertainty	Kaiqu Liang et.al.	2402.06529	null
2024-02-08	SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models	Peng Gao et.al.	2402.05935	link
2024-02-08	Driving Everywhere with Large Language Model Policy Adaptation	Boyi Li et.al.	2402.05932	null
2024-02-08	WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	Xing Han Lù et.al.	2402.05930	null
2024-02-08	On the Convergence of Zeroth-Order Federated Tuning in Large Language Models	Zhenqing Ling et.al.	2402.05926	null
2024-02-08	Efficient Stagewise Pretraining via Progressive Subnetworks	Abhishek Panigrahi et.al.	2402.05913	null
2024-02-08	FACT-GPT: Fact-Checking Augmentation via Claim Matching with LLMs	Eun Cheol Choi et.al.	2402.05904	null
2024-02-08	Large Language Model Meets Graph Neural Network in Knowledge Distillation	Shengxiang Hu et.al.	2402.05894	null
2024-02-08	Generative Echo Chamber? Effects of LLM-Powered Search Systems on Diverse Information Seeking	Nikhil Sharma et.al.	2402.05880	null
2024-02-08	PromptCrypt: Prompt Encryption for Secure Communication with Large Language Models	Guo Lin et.al.	2402.05868	link
2024-02-08	How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis	Federico Bianchi et.al.	2402.05863	link
2024-02-07	Opening the AI black box: program synthesis via mechanistic interpretability	Eric J. Michaud et.al.	2402.05110	null
2024-02-07	You Can REST Now: Automated Specification Inference and Black-Box Testing of RESTful APIs with Large Language Models	Alix Decrop et.al.	2402.05102	null
2024-02-07	Hydragen: High-Throughput LLM Inference with Shared Prefixes	Jordan Juravsky et.al.	2402.05099	null
2024-02-07	Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation	Ziyang Wang et.al.	2402.05079	link
2024-02-07	SALAD-Bench: A Hierarchical and Comprehensive Safety Benchmark for Large Language Models	Lijun Li et.al.	2402.05044	link
2024-02-07	A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?	Agustinus Kristiadi et.al.	2402.05015	link
2024-02-07	Pedagogical Alignment of Large Language Models	Shashank Sonkar et.al.	2402.05000	null
2024-02-07	An Enhanced Prompt-Based LLM Reasoning Scheme via Knowledge Graph-Integrated Collaboration	Yihao Li et.al.	2402.04978	null
2024-02-07	ChatScratch: An AI-Augmented System Toward Autonomous Visual Programming Learning for Children Aged 6-12	Liuqing Chen et.al.	2402.04975	null
2024-02-07	Reconfidencing LLMs from the Grouping Loss Perspective	Lihu Chen et.al.	2402.04957	null
2024-02-06	AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls	Yu Du et.al.	2402.04253	null
2024-02-06	HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal	Mantas Mazeika et.al.	2402.04249	link
2024-02-06	Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science	Xiangru Tang et.al.	2402.04247	null
2024-02-06	Can Generative Agents Predict Emotion?	Ciaran Regan et.al.	2402.04232	null
2024-02-06	Explaining Autonomy: Enhancing Human-Robot Interaction through Explanation Generation with Large Language Models	David Sobrín-Hidalgo et.al.	2402.04206	null
2024-02-06	SHIELD : An Evaluation Benchmark for Face Spoofing and Forgery Detection with Multimodal Large Language Models	Yichen Shi et.al.	2402.04178	link
2024-02-06	Scaling Laws for Downstream Task Performance of Large Language Models	Berivan Isik et.al.	2402.04177	null
2024-02-06	Multi-line AI-assisted Code Authoring	Omer Dunay et.al.	2402.04141	null
2024-02-06	U-shaped Vision Mamba for Single Image Dehazing	Zhuoran Zheng et.al.	2402.04139	null
2024-02-06	Scientific Language Modeling: A Quantitative Review of Large Language Models in Molecular Science	Pengfei Liu et.al.	2402.04119	link
2024-02-05	Nevermind: Instruction Override and Moderation in Large Language Models	Edward Kim et.al.	2402.03303	null
2024-02-05	Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining	Jiarun Liu et.al.	2402.03302	link
2024-02-05	GUARD: Role-playing to Generate Natural-language Jailbreakings to Test Guideline Adherence of Large Language Models	Haibo Jin et.al.	2402.03299	null
2024-02-05	Make Every Move Count: LLM-based High-Quality RTL Code Generation Using MCTS	Matthew DeLorenzo et.al.	2402.03289	null
2024-02-05	Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models	Anthony Sicilia et.al.	2402.03284	null
2024-02-05	Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models	Zhiyuan Hu et.al.	2402.03271	link
2024-02-05	Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills	Kolby Nottingham et.al.	2402.03244	null
2024-02-05	JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching	Antoine Magron et.al.	2402.03242	link
2024-02-05	English Prompts are Better for NLI-based Zero-Shot Emotion Classification than Target-Language Prompts	Patrick Barreiß et.al.	2402.03223	null
2024-02-05	Unified Hallucination Detection for Multimodal Large Language Models	Xiang Chen et.al.	2402.03190	link
2024-02-02	TravelPlanner: A Benchmark for Real-World Planning with Language Agents	Jian Xie et.al.	2402.01622	null
2024-02-02	Stochastic Two Points Method for Deep Model Zeroth-order Optimization	Yijiang Pang et.al.	2402.01621	null
2024-02-02	MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models	Justin Chih-Yao Chen et.al.	2402.01620	link
2024-02-02	KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases	Jiajie Zhang et.al.	2402.01619	link
2024-02-02	Style Vectors for Steering Generative Large Language Model	Kai Konen et.al.	2402.01618	link
2024-02-02	Foundation Model Sherpas: Guiding Foundation Models through Knowledge and Reasoning	Debarun Bhattacharjya et.al.	2402.01602	null
2024-02-02	BAT: Learning to Reason about Spatial Sounds with Large Language Models	Zhisheng Zheng et.al.	2402.01591	null
2024-02-02	Homogenization Effects of Large Language Models on Human Creative Ideation	Barrett R. Anderson et.al.	2402.01536	null
2024-02-02	Decoding Speculative Decoding	Minghao Yan et.al.	2402.01528	null
2024-02-02	K-Level Reasoning with Large Language Models	Yadong Zhang et.al.	2402.01521	null
2024-02-01	Evaluating Large Language Models for Generalization and Robustness via Data Compression	Yucheng Li et.al.	2402.00861	null
2024-02-01	Can Large Language Models Understand Context?	Yilun Zhu et.al.	2402.00858	null
2024-02-01	SymbolicAI: A framework for logic-based approaches combining generative models and solvers	Marius-Constantin Dinu et.al.	2402.00854	link
2024-02-01	Score-based Causal Representation Learning: Linear and General Transformations	Burak Varıcı et.al.	2402.00849	null
2024-02-01	Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?	Xue-Yong Fu et.al.	2402.00841	null
2024-02-01	Common errors in Generative AI systems used for knowledge extraction in the climate action domain	Denis Havlik et.al.	2402.00830	null
2024-02-01	Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents	Zelong Li et.al.	2402.00798	link
2024-02-01	LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law	Toni J. B. Liu et.al.	2402.00795	null
2024-02-01	CroissantLLM: A Truly Bilingual French-English Language Model	Manuel Faysse et.al.	2402.00786	link
2024-02-01	Dense Reward for Free in Reinforcement Learning from Human Feedback	Alex J. Chan et.al.	2402.00782	link
2024-01-31	Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?	Andreas Opedal et.al.	2401.18070	null
2024-01-31	LongAlign: A Recipe for Long Context Alignment of Large Language Models	Yushi Bai et.al.	2401.18058	link
2024-01-31	Paramanu: A Family of Novel Efficient Indic Generative Foundation Language Models	Mitodru Niyogi et.al.	2401.18034	null
2024-01-31	Supporting Anticipatory Governance using LLMs: Evaluating and Aligning Large Language Models with the News Media to Anticipate the Negative Impacts of AI	Mowafak Allaham et.al.	2401.18028	null
2024-01-31	Prompt-Driven LLM Safeguarding via Directed Representation Optimization	Chujie Zheng et.al.	2401.18018	link
2024-01-31	EEG-GPT: Exploring Capabilities of Large Language Models for EEG Classification and Interpretation	Jonathan W. Kim et.al.	2401.18006	null
2024-01-31	Evaluating the Effectiveness of GPT-4 Turbo in Creating Defeaters for Assurance Cases	Kimya Khakzad Shahandashti et.al.	2401.17991	null
2024-01-31	Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study	Qirui Jiao et.al.	2401.17981	null
2024-01-31	HyperZ $\cdot$Z$\cdot$ W Operator Connects Slow-Fast Networks for Full Context Interaction	Harvie Zhang et.al.	2401.17948	null
2024-01-31	LOCOST: State-Space Models for Long Document Abstractive Summarization	Florian Le Bronnec et.al.	2401.17919	link
2024-01-30	Weaver: Foundation Models for Creative Writing	Tiannan Wang et.al.	2401.17268	null
2024-01-30	Weak-to-Strong Jailbreaking on Large Language Models	Xuandong Zhao et.al.	2401.17256	link
2024-01-30	LLaMP: Large Language Model Made Powerful for High-fidelity Materials Knowledge Retrieval and Distillation	Yuan Chiang et.al.	2401.17244	null
2024-01-31	GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear	Robert Konrad et.al.	2401.17217	null
2024-01-30	Data-efficient Fine-tuning for LLM-based Recommendation	Xinyu Lin et.al.	2401.17197	null
2024-01-30	Transfer Learning for Text Diffusion Models	Kehang Han et.al.	2401.17181	null
2024-01-30	Conditional and Modal Reasoning in Large Language Models	Wesley H. Holliday et.al.	2401.17169	null
2024-01-30	Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios	Shijue Huang et.al.	2401.17167	link
2024-01-30	Learning Agent-based Modeling with LLM Companions: Experiences of Novices and Experts Using ChatGPT & NetLogo Chat	John Chen et.al.	2401.17163	null
2024-01-30	Large Language Model Evaluation via Matrix Entropy	Lai Wei et.al.	2401.17139	link
2024-01-29	Scaling Sparse Fine-Tuning to Large Language Models	Alan Ansell et.al.	2401.16405	null
2024-01-29	Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling	Pratyush Maini et.al.	2401.16380	null
2024-01-29	The role of library versions in Developer-ChatGPT conversations	Rachna Raj et.al.	2401.16340	null
2024-01-29	Machine Translation Meta Evaluation through Translation Accuracy Challenge Sets	Nikita Moghe et.al.	2401.16313	null
2024-01-29	Security Code Review by LLMs: A Deep Dive into Responses	Jiaxin Yu et.al.	2401.16310	null
2024-01-29	CO2: Efficient Distributed Training with Full Communication-Computation Overlap	Weigao Sun et.al.	2401.16265	null
2024-01-29	An Empirical Study on Usage and Perceptions of LLMs in a Software Engineering Project	Sanka Rasnayaka et.al.	2401.16186	null
2024-01-29	LLM4Vuln: A Unified Evaluation Framework for Decoupling and Enhancing LLMs' Vulnerability Reasoning	Yuqiang Sun et.al.	2401.16185	null
2024-01-29	LLaMandement: Large Language Models for Summarization of French Legislative Proposals	Joseph Gesnouin et.al.	2401.16182	null
2024-01-29	On Decentralized Linearly Separable Computation With the Minimum Computation Cost	Haoning Chen et.al.	2401.16181	null
2024-01-26	EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty	Yuhui Li et.al.	2401.15077	null
2024-01-26	From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities	Chaochao Lu et.al.	2401.15071	null
2024-01-26	Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement Learning	Md Mushfiqur Rahman et.al.	2401.15043	null
2024-01-26	PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models	Haochen Tan et.al.	2401.15042	null
2024-01-26	On the generalization capacity of neural networks during generic multimodal reasoning	Takuya Ito et.al.	2401.15030	null
2024-01-26	SliceGPT: Compress Large Language Models by Deleting Rows and Columns	Saleh Ashkboos et.al.	2401.15024	null
2024-01-26	Reassessing Java Code Readability Models with a Human-Centered Approach	Agnia Sergeyuk et.al.	2401.14936	null
2024-01-26	Appropriateness of LLM-equipped Robotic Well-being Coach Language in the Workplace: A Qualitative Evaluation	Micol Spitale et.al.	2401.14935	null
2024-01-26	Do LLMs Dream of Ontologies?	Marco Bombieri et.al.	2401.14931	null
2024-01-26	The Power of Noise: Redefining Retrieval for RAG Systems	Florin Cuconasu et.al.	2401.14887	null
2024-01-25	The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support	Inhwa Song et.al.	2401.14362	null
2024-01-25	ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models	Yao Fu et.al.	2401.14351	null
2024-01-25	Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts	Maciej Besta et.al.	2401.14295	null
2024-01-25	RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models models via Romanization	Jaavid Aktar Husain et.al.	2401.14280	null
2024-01-25	ZS4C: Zero-Shot Synthesis of Compilable Code for Incomplete Code Snippets using ChatGPT	Azmain Kabir et.al.	2401.14279	null
2024-01-25	GPTVoiceTasker: LLM-Powered Virtual Assistant for Smartphone	Minh Duc Vu et.al.	2401.14268	null
2024-01-25	Transformers and Cortical Waves: Encoders for Pulling In Context Across Time	Lyle Muller et.al.	2401.14267	null
2024-01-25	Improving Natural Language Capability of Code Large Language Model	Wei Li et.al.	2401.14242	link
2024-01-25	DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence	Daya Guo et.al.	2401.14196	link
2024-01-25	How Can Large Language Models Understand Spatial-Temporal Data?	Lei Liu et.al.	2401.14192	null
2024-01-24	How Good is ChatGPT at Face Biometrics? A First Look into Recognition, Soft Biometrics, and Explainability	Ivan DeAndres-Tame et.al.	2401.13641	null
2024-01-25	MM-LLMs: Recent Advances in MultiModal Large Language Models	Duzhen Zhang et.al.	2401.13601	null
2024-01-24	Consistency Guided Knowledge Retrieval and Denoising in LLMs for Zero-shot Document-level Relation Triplet Extraction	Qi Sun et.al.	2401.13598	null
2024-01-24	Graph Guided Question Answer Generation for Procedural Question-Answering	Hai X. Pham et.al.	2401.13594	null
2024-01-24	Evaluation of General Large Language Models in Contextually Assessing Semantic Concepts Extracted from Adult Critical Care Electronic Health Record Notes	Darren Liu et.al.	2401.13588	null
2024-01-24	Fine-grained Contract NER using instruction based model	Hiranmai Sri Adibhatla et.al.	2401.13545	null
2024-01-24	SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation	Dong Zhang et.al.	2401.13527	link
2024-01-24	Research about the Ability of LLM in the Tamper-Detection Area	Xinyu Yang et.al.	2401.13504	null
2024-01-24	How AI Ideas Affect the Creativity, Diversity, and Evolution of Human Ideas: Evidence From a Large, Dynamic Experiment	Joshua Ashkinaze et.al.	2401.13481	null
2024-01-24	Clue-Guided Path Exploration: An Efficient Knowledge Base Question-Answering Framework with Low Computational Resource Consumption	Dehao Tao et.al.	2401.13444	null
2024-01-23	HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments	Qinhong Zhou et.al.	2401.12975	link
2024-01-23	Raidar: geneRative AI Detection viA Rewriting	Chengzhi Mao et.al.	2401.12970	null
2024-01-23	AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents	Michael Ahn et.al.	2401.12963	null
2024-01-23	Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion	Dylan Zhang et.al.	2401.12947	null
2024-01-23	Red Teaming Visual Language Models	Mukai Li et.al.	2401.12915	null
2024-01-23	From Understanding to Utilization: A Survey on Explainability for Large Language Models	Haoyan Luo et.al.	2401.12874	null
2024-01-23	KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning	Debjyoti Mondal et.al.	2401.12863	null
2024-01-23	How well can large language models explain business processes?	Dirk Fahland et.al.	2401.12846	null
2024-01-23	Benchmarking LLMs via Uncertainty Quantification	Fanghua Ye et.al.	2401.12794	null
2024-01-23	Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study	W. Ronny Huang et.al.	2401.12789	null
2024-01-22	Less Could Be Better: Parameter-efficient Fine-tuning Advances Medical Vision Foundation Models	Chenyu Lian et.al.	2401.12215	link
2024-01-22	CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation	Zhihong Chen et.al.	2401.12208	null
2024-01-22	APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference	Bowen Zhao et.al.	2401.12200	null
2024-01-22	Text Embedding Inversion Attacks on Multilingual Language Models	Yiyi Chen et.al.	2401.12192	null
2024-01-22	WARM: On the Benefits of Weight Averaged Reward Models	Alexandre Ramé et.al.	2401.12187	null
2024-01-22	CodeTailor: Personalized Parsons Puzzles are Preferred Over AI-Generated Solutions to Support Learning	Xinying Hou et.al.	2401.12125	null
2024-01-22	The Curious Case of Nonverbal Abstract Reasoning with Multi-Modal Large Language Models	Kian Ahrabian et.al.	2401.12117	null
2024-01-22	An Empirical Analysis of In-context Learning Abilities of LLMs for MT	Pranjal A. Chitale et.al.	2401.12097	null
2024-01-22	Revisiting Demonstration Selection Strategies in In-Context Learning	Keqin Peng et.al.	2401.12087	null
2024-01-22	Temporal Blind Spots in Large Language Models	Jonas Wallat et.al.	2401.12078	link
2024-01-19	Reinforcement learning for question answering in programming domain using public community scoring as a human feedback	Alexey Gorbatovski et.al.	2401.10882	null
2024-01-19	Pruning for Protection: Increasing Jailbreak Resistance in Aligned LLMs Without Fine-Tuning	Adib Hasan et.al.	2401.10862	null
2024-01-19	Using LLMs to discover emerging coded antisemitic hate-speech emergence in extremist social media	Dhanush Kikkisetti et.al.	2401.10841	null
2024-01-19	A survey on recent advances in named entity recognition	Imed Keraghel et.al.	2401.10825	null
2024-01-19	Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads	Tianle Cai et.al.	2401.10774	link
2024-01-19	Mitigating Hallucinations of Large Language Models via Knowledge Consistent Alignment	Fanqi Wan et.al.	2401.10768	null
2024-01-19	Interactions with Prompt Problems: A New Way to Teach Programming with Large Language Models	James Prather et.al.	2401.10759	null
2024-01-19	FinLLMs: A Framework for Financial Reasoning Dataset Generation with Large Language Models	Ziqiang Yuan et.al.	2401.10744	null
2024-01-19	In-IDE Human-AI Experience in the Era of Large Language Models; A Literature Review	Agnia Sergeyuk et.al.	2401.10739	null
2024-01-19	Dynamic Q&A of Clinical Documents with Large Language Models	Ran Elgedawy et.al.	2401.10733	null
2024-01-18	Towards Language-Driven Video Inpainting via Multimodal Large Language Models	Jianzong Wu et.al.	2401.10226	null
2024-01-18	ChatQA: Building GPT-4 Level Conversational QA Models	Zihan Liu et.al.	2401.10225	null
2024-01-18	Beyond Reference-Based Metrics: Analyzing Behaviors of Open LLMs on Data-to-Text Generation	Zdeněk Kasner et.al.	2401.10186	null
2024-01-18	Comparing Traditional and LLM-based Search for Image Geolocation	Albatool Wazzan et.al.	2401.10184	null
2024-01-18	Spatial-Temporal Large Language Model for Traffic Prediction	Chenxi Liu et.al.	2401.10134	null
2024-01-18	Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs	Haritz Puerto et.al.	2401.10065	link
2024-01-18	DiffusionGPT: LLM-Driven Text-to-Image Generation System	Jie Qin et.al.	2401.10061	null
2024-01-18	Large Language Models for Scientific Information Extraction: An Empirical Study for Virology	Mahsa Shamsabadi et.al.	2401.10040	null
2024-01-18	LOCALINTEL: Generating Organizational Threat Intelligence from Global and Local Cyber Knowledge	Shaswata Mitra et.al.	2401.10036	null
2024-01-18	Evolutionary Computation in the Era of Large Language Model: Survey and Roadmap	Xingyu Wu et.al.	2401.10034	null
2024-01-17	Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model	Lianghui Zhu et.al.	2401.09417	link
2024-01-17	Vlogger: Make Your Dream A Vlog	Shaobin Zhuang et.al.	2401.09414	link
2024-01-17	Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text	Mazal Bethany et.al.	2401.09407	null
2024-01-17	Stuck in the Quicksand of Numeracy, Far from AGI Summit: Evaluating LLMs' Mathematical Competency through Ontology-guided Perturbations	Pengfei Hong et.al.	2401.09395	null
2024-01-17	Large Language Models Are Neurosymbolic Reasoners	Meng Fang et.al.	2401.09334	null
2024-01-17	Material Informatics through Neural Networks on Ab-Initio Electron Charge Densities: the Role of Transfer Learning	Dario Massa et.al.	2401.09301	null
2024-01-17	Beyond Anti-Forgetting: Multimodal Continual Instruction Tuning with Positive Forward Transfer	Junhao Zheng et.al.	2401.09181	null
2024-01-17	InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding	Qiaoling Chen et.al.	2401.09149	null
2024-01-17	BibSonomy Meets ChatLLMs for Publication Management: From Chat to Publication Management: Organizing your related work using BibSonomy & LLMs	Tom Völker et.al.	2401.09092	null
2024-01-17	Understanding the concerns and choices of public when using large language models for healthcare	Yunpeng Xiao et.al.	2401.09090	null
2024-01-16	RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture	Aman Gupta et.al.	2401.08406	null
2024-01-16	DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models	Zongxin Yang et.al.	2401.08392	link
2024-01-16	Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference	Jinghan Yao et.al.	2401.08383	link
2024-01-16	Hallucination Detection and Hallucination Mitigation: An Investigation	Junliang Luo et.al.	2401.08358	null
2024-01-16	Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models	Jianhui Pang et.al.	2401.08350	null
2024-01-16	Understanding User Experience in Large Language Model Interactions	Jiayin Wang et.al.	2401.08329	null
2024-01-16	RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning	Junjie Ye et.al.	2401.08326	null
2024-01-16	Application of LLM Agents in Recruitment: A Novel Framework for Resume Screening	Chengguang Gan et.al.	2401.08315	null
2024-01-16	Anchor function: a type of benchmark functions for studying language models	Zhongwang Zhang et.al.	2401.08309	null
2024-01-16	DAPT: A Dual Attention Framework for Parameter-Efficient Continual Learning of Large Language Models	Weixiang Zhao et.al.	2401.08295	null
2024-01-12	Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements	Anton Voronov et.al.	2401.06766	null
2024-01-12	APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding	Mingdao Liu et.al.	2401.06761	null
2024-01-12	Few-Shot Detection of Machine-Generated Text using Style Representations	Rafael Rivera Soto et.al.	2401.06712	null
2024-01-12	Multi-Candidate Speculative Decoding	Sen Yang et.al.	2401.06706	link
2024-01-12	An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models	Gantavya Bhatt et.al.	2401.06692	null
2024-01-12	Don't Rank, Combine! Combining Machine Translation Hypotheses Using Quality Estimation	Giorgos Vernikos et.al.	2401.06688	null
2024-01-12	LLMRS: Unlocking Potentials of LLM-Based Recommender Systems for Software Purchase	Angela John et.al.	2401.06676	null
2024-01-12	Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation	Jan Cegin et.al.	2401.06643	link
2024-01-12	OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models	Shuai Wang et.al.	2401.06628	null
2024-01-12	How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs	Yi Zeng et.al.	2401.06373	null
2024-01-11	TOFU: A Task of Fictitious Unlearning for LLMs	Pratyush Maini et.al.	2401.06121	null
2024-01-11	Extreme Compression of Large Language Models via Additive Quantization	Vage Egiazarian et.al.	2401.06118	link
2024-01-11	Patchscope: A Unifying Framework for Inspecting Hidden Representations of Language Models	Asma Ghandeharioun et.al.	2401.06102	null
2024-01-11	A Closer Look at AUROC and AUPRC under Class Imbalance	Matthew B. A. McDermott et.al.	2401.06091	link
2024-01-11	Autocompletion of Chief Complaints in the Electronic Health Records using Large Language Models	K M Sajjadul Islam et.al.	2401.06088	null
2024-01-11	Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint	Zhipeng Chen et.al.	2401.06081	link
2024-01-11	Secrets of RLHF in Large Language Models Part II: Reward Modeling	Binghai Wang et.al.	2401.06080	link
2024-01-12	LEGO:Language Enhanced Multi-modal Grounding Model	Zhaowei Li et.al.	2401.06071	link
2024-01-11	DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models	Damai Dai et.al.	2401.06066	link
2024-01-11	LLM-as-a-Coauthor: The Challenges of Detecting LLM-Human Mixcase	Chujie Gao et.al.	2401.05952	link
2024-01-10	Leveraging Print Debugging to Improve Code Generation in Large Language Models	Xueyu Hu et.al.	2401.05319	null
2024-01-10	Theory of Mind abilities of Large Language Models in Human-Robot Interaction : An Illusion?	Mudit Verma et.al.	2401.05302	null
2024-01-10	I am a Strange Dataset: Metalinguistic Tests for Language Models	Tristan Thrush et.al.	2401.05300	link
2024-01-10	INACIA: Integrating Large Language Models in Brazilian Audit Courts: Opportunities and Challenges	Jayr Pereira et.al.	2401.05273	null
2024-01-10	CASA: Causality-driven Argument Sufficiency Assessment	Xiao Liu et.al.	2401.05249	link
2024-01-10	Pre-trained Large Language Models for Financial Sentiment Analysis	Wei Luo et.al.	2401.05215	link
2024-01-10	Knowledge Sharing in Manufacturing using Large Language Models: User Evaluation and Model Benchmarking	Samuel Kernan Freire et.al.	2401.05200	null
2024-01-10	Monte Carlo Tree Search for Recipe Generation using GPT-2	Karan Taneja et.al.	2401.05199	null
2024-01-10	Divide and Conquer for Large Language Models Reasoning	Zijie Meng et.al.	2401.05190	link
2024-01-10	Can ChatGPT Rival Neural Machine Translation? A Comparative Study	Zhaokun Jiang et.al.	2401.05176	null
2024-01-09	U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation	Jun Ma et.al.	2401.04722	null
2024-01-09	Model Editing Can Hurt General Abilities of Large Language Models	Jia-Chen Gu et.al.	2401.04700	link
2024-01-09	Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers	Gal Yona et.al.	2401.04695	null
2024-01-09	RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation	Mahdi Nikdan et.al.	2401.04679	null
2024-01-09	Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models	Zhen Qin et.al.	2401.04658	null
2024-01-09	Applying Large Language Models API to Issue Classification Problem	Gabriel Aracena et.al.	2401.04637	null
2024-01-09	DebugBench: Evaluating Debugging Capability of Large Language Models	Runchu Tian et.al.	2401.04621	link
2024-01-09	Agent Alignment in Evolving Social Norms	Shimin Li et.al.	2401.04620	null
2024-01-09	Language Detection for Transliterated Content	Selva Kumar S et.al.	2401.04619	null
2024-01-09	An Assessment on Comprehending Mental Health through Large Language Models	Mihael Arcan et.al.	2401.04592	null
2024-01-08	Unveiling Bias in Fairness Evaluations of Large Language Models: A Critical Literature Review of Music and Movie Recommendation Systems	Chandan Kumar Sah et.al.	2401.04057	null
2024-01-08	Sparse Meets Dense: A Hybrid Approach to Enhance Scientific Document Retrieval	Priyanka Mandikal et.al.	2401.04055	null
2024-01-08	Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark	Fangjun Li et.al.	2401.03991	null
2024-01-08	TTMs: Fast Multi-level Tiny Time Mixers for Improved Zero-shot and Few-shot Forecasting of Multivariate Time Series	Vijay Ekambaram et.al.	2401.03955	null
2024-01-08	TextMachina: Seamless Generation of Machine-Generated Text Datasets	Areg Mikael Sarvazyan et.al.	2401.03946	null
2024-01-08	SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems	Dong Zhang et.al.	2401.03945	link
2024-01-08	A Philosophical Introduction to Language Models -- Part I: Continuity With Classic Debates	Raphaël Millière et.al.	2401.03910	null
2024-01-08	FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGA	Shulin Zeng et.al.	2401.03868	null
2024-01-08	Boldly Going Where No Benchmark Has Gone Before: Exposing Bias and Shortcomings in Code Generation Evaluation	Ankit Yadav et.al.	2401.03855	null
2024-01-08	Aligned with LLM: a new multi-modal training paradigm for encoding fMRI activity in visual cortex	Shuxiao Ma et.al.	2401.03851	null
2024-01-05	DeepSeek LLM: Scaling Open-Source Language Models with Longtermism	DeepSeek-AI et.al.	2401.02954	null
2024-01-05	Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks	Kevin Everson et.al.	2401.02921	null
2024-01-05	Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task	Gabriel Lino Garcia et.al.	2401.02909	null
2024-01-05	MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance	Renjie Pi et.al.	2401.02906	link
2024-01-05	AFSPP: Agent Framework for Shaping Preference and Personality with Large Language Models	Zihong He et.al.	2401.02870	null
2024-01-05	Generative Large Language Models are autonomous practitioners of evidence-based medicine	Akhil Vaid et.al.	2401.02851	null
2024-01-05	Thousands of AI Authors on the Future of AI	Katja Grace et.al.	2401.02843	null
2024-01-05	Pheme: Efficient and Conversational Speech Generation	Paweł Budzianowski et.al.	2401.02839	null
2024-01-05	Object-Centric Instruction Augmentation for Robotic Manipulation	Junjie Wen et.al.	2401.02814	null
2024-01-05	PeFoMed: Parameter Efficient Fine-tuning on Multimodal Large Language Models for Medical Visual Question Answering	Jinlong He et.al.	2401.02797	link
2024-01-04	Learning to Prompt with Text Only Supervision for Vision-Language Models	Muhammad Uzair Khattak et.al.	2401.02418	link
2024-01-04	LLaMA Pro: Progressive LLaMA with Block Expansion	Chengyue Wu et.al.	2401.02415	link
2024-01-04	Correctness Comparison of ChatGPT-4, Bard, Claude-2, and Copilot for Spatial Tasks	Hartwig H. Hochmair et.al.	2401.02404	null
2024-01-04	DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models	Songbo Hu et.al.	2401.02208	null
2024-01-04	Exploring Boundary of GPT-4V on Marine Analysis: A Preliminary Case Study	Ziqiang Zheng et.al.	2401.02147	null
2024-01-04	DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models	Wendi Cui et.al.	2401.02132	null
2024-01-04	ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers	Chen Zheng et.al.	2401.02072	null
2024-01-04	An Example of Evolutionary Computation + Large Language Model Beating Human: Design of Efficient Guided Local Search	Fei Liu et.al.	2401.02051	null
2024-01-04	Understanding LLMs: A Comprehensive Overview from Training to Inference	Yiheng Liu et.al.	2401.02038	null
2024-01-04	Text2MDT: Extracting Medical Decision Trees from Medical Texts	Wei Zhu et.al.	2401.02034	null
2024-01-03	Mining Temporal Attack Patterns from Cyberthreat Intelligence Reports	Md Rayhanur Rahman et.al.	2401.01883	null
2024-01-03	A Vision Check-up for Language Models	Pratyusha Sharma et.al.	2401.01862	null
2024-01-03	Multilingual Instruction Tuning With Just a Pinch of Multilinguality	Uri Shaham et.al.	2401.01854	null
2024-01-03	Large Language Models Relearn Removed Concepts	Michelle Lo et.al.	2401.01814	null
2024-01-03	Navigating Uncertainty: Optimizing API Dependency for Hallucination Reduction in Closed-Book Question Answering	Pierre Erbacher et.al.	2401.01780	null
2024-01-04	Cross-target Stance Detection by Exploiting Target Analytical Perspectives	Daijun Ding et.al.	2401.01761	null
2024-01-03	Economics Arena for Large Language Models	Shangmin Guo et.al.	2401.01735	null
2024-01-03	Evaluating Large Language Models in Semantic Parsing for Conversational Question Answering over Knowledge Graphs	Phillip Schneider et.al.	2401.01711	link
2024-01-03	De-Hallucinator: Iterative Grounding for LLM-Based Code Completion	Aryaz Eghbali et.al.	2401.01701	null
2024-01-03	WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope	Jun-Yan He et.al.	2401.01699	null
2024-01-02	Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models	Zixiang Chen et.al.	2401.01335	null
2024-01-02	LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning	Hongye Jin et.al.	2401.01325	null
2024-01-02	A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models	S. M Towhidul Islam Tonmoy et.al.	2401.01313	null
2024-01-02	LLM Harmony: Multi-Agent Communication for Problem Solving	Sumedh Rasal et.al.	2401.01312	null
2024-01-02	Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models	Matthew Dahl et.al.	2401.01301	null
2024-01-02	A Comprehensive Study of Knowledge Editing for Large Language Models	Ningyu Zhang et.al.	2401.01286	link
2024-01-02	CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent Evaluation	Quan Tu et.al.	2401.01275	null
2024-01-02	LLbezpeky: Leveraging Large Language Models for Vulnerability Detection	Noble Saji Mathews et.al.	2401.01269	null
2024-01-02	Fairness Certification for Natural Language Processing and Large Language Models	Vincent Freiberger et.al.	2401.01262	null
2024-01-02	VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM	Fuchen Long et.al.	2401.01256	null
2023-12-29	Jatmo: Prompt Injection Defense by Task-Specific Finetuning	Julien Piet et.al.	2312.17673	null
2023-12-29	Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models	Yuqing Wang et.al.	2312.17661	link
2023-12-29	Large Language Models for Generative Information Extraction: A Survey	Derong Xu et.al.	2312.17617	null
2023-12-29	Action-Item-Driven Summarization of Long Meeting Transcripts	Logan Golia et.al.	2312.17581	link
2023-12-29	Building Efficient Universal Classifiers with Natural Language Inference	Moritz Laurer et.al.	2312.17543	null
2023-12-29	Enhancing Quantitative Reasoning Skills of Large Language Models through Dimension Perception	Yuncheng Huang et.al.	2312.17532	null
2023-12-29	Overview of the PromptCBLUE Shared Task in CHIP2023	Wei Zhu et.al.	2312.17522	null
2023-12-29	Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in the Avalon Game	Zijing Shi et.al.	2312.17515	null
2023-12-29	Differentially Private Low-Rank Adaptation of Large Language Model Using Federated Learning	Xiao-Yang Liu et.al.	2312.17493	null
2023-12-29	The Right Prompts for the Job: Repair Code-Review Defects with Large Language Model	Zelin Zhao et.al.	2312.17485	null
2023-12-28	The LLM Surgeon	Tycho F. A. van der Ouderaa et.al.	2312.17244	null
2023-12-28	An Improved Baseline for Reasoning Segmentation with Large Language Model	Senqiao Yang et.al.	2312.17240	null
2023-12-28	Fast Inference of Mixture-of-Experts Language Models with Offloading	Artyom Eliseev et.al.	2312.17238	link
2023-12-28	A Simple LLM Framework for Long-Range Video Question-Answering	Ce Zhang et.al.	2312.17235	null
2023-12-28	Virtual Scientific Companion for Synchrotron Beamlines: A Prototype	Daniel Potemkin et.al.	2312.17180	null
2023-12-28	Non-Vacuous Generalization Bounds for Large Language Models	Sanae Lotfi et.al.	2312.17173	null
2023-12-28	Large Language Model for Causal Decision Making	Haitao Jiang et.al.	2312.17122	null
2023-12-28	How Far Are We from Believable AI Agents? A Framework for Evaluating the Believability of Human Behavior Simulation	Yang Xiao et.al.	2312.17115	null
2023-12-28	Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs	Zhongshen Zeng et.al.	2312.17080	link
2023-12-28	Improving In-context Learning via Bidirectional Alignment	Chengwei Qin et.al.	2312.17055	null
2023-12-26	Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4	Sondos Mahmoud Bsharat et.al.	2312.16171	link
2023-12-26	Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages	Mofetoluwa Adeyemi et.al.	2312.16159	null
2023-12-26	RoleEval: A Bilingual Role Evaluation Benchmark for Large Language Models	Tianhao Shen et.al.	2312.16132	null
2023-12-26	Large Language Model Situational Awareness Based Planning	Liman Wang et.al.	2312.16127	null
2023-12-26	A bi-objective $ε$ -constrained framework for quality-cost optimization in language model ensembles	Aditi Singla et.al.	2312.16119	null
2023-12-26	Can ChatGPT Read Who You Are?	Erik Derner et.al.	2312.16070	null
2023-12-26	A Prompt Learning Framework for Source Code Summarization	Weisong Sun et.al.	2312.16066	link
2023-12-26	Large Language Models as Traffic Signal Control Agents: Capacity and Opportunity	Siqi Lai et.al.	2312.16044	link
2023-12-26	RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation	Sichun Luo et.al.	2312.16018	null
2023-12-26	Aligning Large Language Models with Human Preferences through Representation Engineering	Wenhao Liu et.al.	2312.15997	null
2023-12-22	A Survey of Reinforcement Learning from Human Feedback	Timo Kaufmann et.al.	2312.14925	null
2023-12-22	NPHardEval: Dynamic Benchmark on Reasoning Ability of Large Language Models via Complexity Classes	Lizhou Fan et.al.	2312.14890	link
2023-12-22	SutraNets: Sub-series Autoregressive Networks for Long-Sequence, Probabilistic Forecasting	Shane Bergsma et.al.	2312.14880	null
2023-12-22	Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning	Filippos Christianos et.al.	2312.14878	null
2023-12-22	Robust Knowledge Extraction from Large Language Models using Social Choice Theory	Nico Potyka et.al.	2312.14877	null
2023-12-22	Numerical Reasoning for Financial Reports	Abhinav Arun et.al.	2312.14870	null
2023-12-22	VIEScore: Towards Explainable Metrics for Conditional Image Synthesis Evaluation	Max Ku et.al.	2312.14867	null
2023-12-22	YAYI 2: Multilingual Open-Source Large Language Models	Yin Luo et.al.	2312.14862	null
2023-12-22	Turbulence: Systematically and Automatically Testing Instruction-Tuned Large Language Models for Code	Shahin Honarvar et.al.	2312.14856	null
2023-12-22	Plan, Posture and Go: Towards Open-World Text-to-Motion Generation	Jinpeng Liu et.al.	2312.14828	null
2023-12-21	VideoPoet: A Large Language Model for Zero-Shot Video Generation	Dan Kondratyuk et.al.	2312.14125	null
2023-12-21	LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding	Senqiao Yang et.al.	2312.14074	null
2023-12-21	A Strong Baseline for Temporal Video-Text Alignment	Zeqian Li et.al.	2312.14055	null
2023-12-21	T-Eval: Evaluating the Tool Utilization Capability Step by Step	Zehui Chen et.al.	2312.14033	link
2023-12-21	ChatGPT as a commenter to the news: can LLMs generate human-like opinions?	Rayden Tseng et.al.	2312.13961	link
2023-12-21	Typhoon: Thai Large Language Models	Kunat Pipatanakul et.al.	2312.13951	null
2023-12-21	AsyncMLD: Asynchronous Multi-LLM Framework for Dialogue Recommendation System	Naoki Yoshimaru et.al.	2312.13925	null
2023-12-21	Domain-Specific Fine-Tuning of Large Language Models for Interactive Robot Programming	Benjamin Alt et.al.	2312.13905	null
2023-12-21	Diversifying Knowledge Enhancement of Biomedical Language Models using Adapter Modules and Knowledge Graphs	Juraj Vladika et.al.	2312.13881	null
2023-12-21	Capture the Flag: Uncovering Data Insights with Large Language Models	Issam Laradji et.al.	2312.13876	null
2023-12-20	dIR -- Discrete Information Retrieval: Conversational Search over Unstructured (and Structured) Data with Large Language Models	Pablo M. Rodriguez Bertorello et.al.	2312.13264	null
2023-12-20	Automated DevOps Pipeline Generation for Code Repositories using Large Language Models	Deep Mehta et.al.	2312.13225	null
2023-12-20	LlaMaVAE: Guiding Large Language Model Generation via Continuous Latent Sentence Spaces	Yingji Zhang et.al.	2312.13208	null
2023-12-20	Contextual Code Switching for Machine Translation using Language Models	Arshad Kaji et.al.	2312.13179	null
2023-12-20	Generative agents in the streets: Exploring the use of Large Language Models (LLMs) in collecting urban perceptions	Deepank Verma et.al.	2312.13126	null
2023-12-20	ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation	Difei Gao et.al.	2312.13108	null
2023-12-20	Exploring Multimodal Large Language Models for Radiology Report Error-checking	Jinge Wu et.al.	2312.13103	null
2023-12-20	In Generative AI we Trust: Can Chatbots Effectively Verify Political Information?	Elizaveta Kuznetsova et.al.	2312.13096	null
2023-12-20	Lampr: Boosting the Effectiveness of Language-Generic Program Reduction via Large Language Models	Mengxiao Zhang et.al.	2312.13064	null
2023-12-20	Retrieval-augmented Multilingual Knowledge Editing	Weixuan Wang et.al.	2312.13040	link
2023-12-17	Language-conditioned Learning for Robotic Manipulation: A Survey	Hongkuan Zhou et.al.	2312.10807	null
2023-12-17	A mathematical perspective on Transformers	Borjan Geshkovski et.al.	2312.10794	link
2023-12-17	Understanding the Instruction Mixture for Large Language Model	Renxi Wang et.al.	2312.10793	null
2023-12-17	kNN-ICL: Compositional Task-Oriented Parsing Generalization with Nearest Neighbor In-Context Learning	Wenting Zhao et.al.	2312.10771	null
2023-12-17	A Mutation-Based Method for Multi-Modal Jailbreaking Attack Detection	Xiaoyu Zhang et.al.	2312.10766	null
2023-12-17	M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts	Mingsheng Li et.al.	2312.10763	link
2023-12-17	Multi-Label Classification of COVID-Tweets Using Large Language Models	Aniket Deroy et.al.	2312.10748	link
2023-12-17	Knowledge Trees: Gradient Boosting Decision Trees on Knowledge Neurons as Probing Classifier	Sergey A. Saltykov et.al.	2312.10746	null
2023-12-17	A Unified Framework for Multi-Domain CTR Prediction via Large Language Models	Zichuan Fu et.al.	2312.10743	null
2023-12-17	Mixed Distillation Helps Smaller Language Model Better Reasoning	Li Chenglin et.al.	2312.10730	null
2023-12-15	Osprey: Pixel Understanding with Visual Instruction Tuning	Yuqian Yuan et.al.	2312.10032	link
2023-12-15	Challenges with unsupervised LLM knowledge discovery	Sebastian Farquhar et.al.	2312.10029	null
2023-12-15	Faithful Persona-based Conversational Dataset Generation with Large Language Models	Pegah Jandaghi et.al.	2312.10007	null
2023-12-15	Symplectic Autoencoders for Model Reduction of Hamiltonian Systems	Benedikt Brantner et.al.	2312.10004	null
2023-12-15	ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent	Renat Aksitov et.al.	2312.10003	null
2023-12-15	LLaMAntino: LLaMA 2 Models for Effective Text Generation in Italian Language	Pierpaolo Basile et.al.	2312.09993	null
2023-12-15	The Art of Balancing: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment	Shihan Dou et.al.	2312.09979	null
2023-12-15	Distilling Large Language Models for Matching Patients to Clinical Trials	Mauro Nievas et.al.	2312.09958	null
2023-12-15	Prompting Datasets: Data Discovery with Conversational Agents	Johanna Walker et.al.	2312.09947	null
2023-12-15	Neurosymbolic Value-Inspired AI (Why, What, and How)	Amit Sheth et.al.	2312.09928	null
2023-12-14	DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving	Wenhai Wang et.al.	2312.09245	link
2023-12-14	Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft	Hao Li et.al.	2312.09238	null
2023-12-14	Pixel Aligned Language Models	Jiarui Xu et.al.	2312.09237	null
2023-12-14	Successor Heads: Recurring, Interpretable Attention Heads In The Wild	Rhys Gould et.al.	2312.09230	null
2023-12-14	Measurement in the Age of LLMs: An Application to Ideological Scaling	Sean O'Hagan et.al.	2312.09203	null
2023-12-14	General Object Foundation Model for Images and Videos at Scale	Junfeng Wu et.al.	2312.09158	null
2023-12-14	The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation	Rongwu Xu et.al.	2312.09085	null
2023-12-14	Language Modeling on a SpiNNaker 2 Neuromorphic Chip	Khaleelulla Khan Nazeer et.al.	2312.09084	null
2023-12-14	Towards Verifiable Text Generation with Evolving Memory and Self-Reflection	Hao Sun et.al.	2312.09075	null
2023-12-14	Agent Attention: On the Integration of Softmax and Linear Attention	Dongchen Han et.al.	2312.08874	link
2023-12-13	An Invitation to Deep Reinforcement Learning	Bernhard Jaeger et.al.	2312.08365	null
2023-12-13	Distributed Inference and Fine-tuning of Large Language Models Over The Internet	Alexander Borzunov et.al.	2312.08361	null
2023-12-13	FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects	Bowen Wen et.al.	2312.08344	null
2023-12-13	LD-SDM: Language-Driven Hierarchical Species Distribution Modeling	Srikumar Sastry et.al.	2312.08334	null
2023-12-13	Efficient Toxic Content Detection by Bootstrapping and Distilling Large Language Models	Jiang Zhang et.al.	2312.08303	null
2023-12-13	Conceptualizing Suicidal Behavior: Utilizing Explanations of Predicted Outcomes to Analyze Longitudinal Social Media Data	Van Minh Nguyen et.al.	2312.08299	link
2023-12-14	High-throughput Biomedical Relation Extraction for Semi-Structured Web Articles Empowered by Large Language Models	Songchi Zhou et.al.	2312.08274	null
2023-12-13	GuardRails: Automated Suggestions for Clarifying Ambiguous Purpose Statements	Mrigank Pawagi et.al.	2312.08189	null
2023-12-13	Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers	Haifeng Huang et.al.	2312.08168	link
2023-12-14	Fine-Grained Image-Text Alignment in Medical Imaging Enables Cyclic Image-Report Generation	Wenting Chen et.al.	2312.08078	null
2023-12-12	VILA: On Pre-training for Visual Language Models	Ji Lin et.al.	2312.07533	null
2023-12-12	LMDrive: Closed-Loop End-to-End Driving with Large Language Models	Hao Shao et.al.	2312.07488	null
2023-12-12	Comparable Demonstrations are Important in In-Context Learning: A Novel Perspective on Demonstration Selection	Caoyun Fan et.al.	2312.07476	null
2023-12-12	MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception	Yiran Qin et.al.	2312.07472	null
2023-12-12	FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs	Swanand Ravindra Kadhe et.al.	2312.07420	null
2023-12-12	On Diverse Preferences for Large Language Model Alignment	Dun Zeng et.al.	2312.07401	null
2023-12-12	Large Language Models are Clinical Reasoners: Reasoning-Aware Diagnosis Framework with Prompt-Generated Rationales	Taeyoon Kwon et.al.	2312.07399	null
2023-12-12	LLMEval: A Preliminary Study on How to Evaluate Large Language Models	Yue Zhang et.al.	2312.07398	null
2023-12-12	Sequential Planning in Large Partially Observable Environments guided by LLMs	Swarna Kamal Paul et.al.	2312.07368	null
2023-12-12	Can ChatGPT Play the Role of a Teaching Assistant in an Introductory Programming Course?	Anishka et.al.	2312.07343	null
2023-12-11	Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?	Shabaz Patel et.al.	2312.06652	link
2023-12-11	4M: Massively Multimodal Masked Modeling	David Mizrahi et.al.	2312.06647	null
2023-12-11	AnyHome: Open-Vocabulary Generation of Structured and Textured 3D Homes	Zehao Wen et.al.	2312.06644	null
2023-12-11	Gated Linear Attention Transformers with Hardware-Efficient Training	Songlin Yang et.al.	2312.06635	null
2023-12-11	Emergence of Scale-Free Networks in Social Interactions among Large Language Models	Giordano De Marzo et.al.	2312.06619	null
2023-12-11	Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism	Georgios Milis et.al.	2312.06613	link
2023-12-11	From Text to Motion: Grounding GPT-4 in a Humanoid Robot "Alter3"	Takahide Yoshida et.al.	2312.06571	null
2023-12-11	LLM360: Towards Fully Transparent Open-Source LLMs	Zhengzhong Liu et.al.	2312.06550	link
2023-12-11	Transformers Implement Functional Gradient Descent to Learn Non-Linear Functions In Context	Xiang Cheng et.al.	2312.06528	null
2023-12-11	Grounded Question-Answering in Long Egocentric Videos	Shangzhe Di et.al.	2312.06505	null
2023-12-08	Language Models, Agent Models, and World Models: The LAW for Machine Reasoning and Planning	Zhiting Hu et.al.	2312.05230	null
2023-12-08	DeltaZip: Multi-Tenant Language Model Serving via Delta Compression	Xiaozhe Yao et.al.	2312.05215	null
2023-12-08	HALO: An Ontology for Representing Hallucinations in Generative Models	Navapat Nananukul et.al.	2312.05209	null
2023-12-08	DelucionQA: Detecting Hallucinations in Domain-specific Question Answering	Mobashir Sadat et.al.	2312.05200	link
2023-12-08	PathFinder: Guided Search over Multi-Step Reasoning Paths	Olga Golovneva et.al.	2312.05180	null
2023-12-08	Vision-based Learning for Drones: A Survey	Jiaping Xiao et.al.	2312.05019	null
2023-12-08	SparQ Attention: Bandwidth-Efficient LLM Inference	Luka Ribar et.al.	2312.04985	null
2023-12-08	The ICL Consistency Test	Lucas Weber et.al.	2312.04945	null
2023-12-08	Retrieval-based Video Language Model for Efficient Long Video Question Answering	Jiaqi Xu et.al.	2312.04931	null
2023-12-08	Towards Efficient Secure Aggregation in FL: Partial Vector Freezing for Cost Compression	Siqing Zhang et.al.	2312.04920	null
2023-12-07	Large Language Models for Mathematicians	Simon Frieder et.al.	2312.04556	null
2023-12-07	Improved Visual Grounding through Self-Consistent Explanations	Ruozhen He et.al.	2312.04554	null
2023-12-07	Generating Illustrated Instructions	Sachit Menon et.al.	2312.04552	null
2023-12-07	Using Large Language Models for Hyperparameter Optimization	Michael R. Zhang et.al.	2312.04528	null
2023-12-07	An LLM Compiler for Parallel Function Calling	Sehoon Kim et.al.	2312.04511	link
2023-12-07	A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation	Jarad Forristal et.al.	2312.04510	null
2023-12-07	AVA: Towards Autonomous Visualization Agents through Visual Perception-Driven Decision-Making	Shusen Liu et.al.	2312.04494	null
2023-12-07	Fortify the Shortest Stave in Attention: Enhancing Context Awareness of Large Language Models for Effective Tool Use	Yuhan Chen et.al.	2312.04455	link
2023-12-07	OpenAsp: A Benchmark for Multi-document Open Aspect-based Summarization	Shmuel Amar et.al.	2312.04440	link
2023-12-07	LaMPilot: An Open Benchmark Dataset for Autonomous Driving with Language Model Programs	Yunsheng Ma et.al.	2312.04372	null
2023-12-06	OneLLM: One Framework to Align All Modalities with Language	Jiaming Han et.al.	2312.03700	link
2023-12-06	An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition	Yukiya Hono et.al.	2312.03668	null
2023-12-06	Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia	Alexander Sasha Vezhnevets et.al.	2312.03664	null
2023-12-06	Not All Large Language Models (LLMs) Succumb to the "Reversal Curse": A Comparative Study of Deductive Logical Reasoning in BERT and GPT Models	Jingye Yang et.al.	2312.03633	null
2023-12-06	Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models	Dominik Wagner et.al.	2312.03632	null
2023-12-06	XAIQA: Explainer-Based Data Augmentation for Extractive Question Answering	Joel Stremmel et.al.	2312.03567	null
2023-12-06	When an Image is Worth 1,024 x 1,024 Words: A Case Study in Computational Pathology	Wenhui Wang et.al.	2312.03558	null
2023-12-06	Holmes: Towards Distributed Training Across Clusters with Heterogeneous NIC Environment	Fei Yang et.al.	2312.03549	null
2023-12-06	GPT-4 Enhanced Multimodal Grounding for Autonomous Driving: Leveraging Cross-Modal Attention with Large Language Models	Haicheng Liao et.al.	2312.03543	link
2023-12-06	Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation	Haojie Zhang et.al.	2312.03502	link
2023-12-05	GPT4Point: A Unified Framework for Point-Language Understanding and Generation	Zhangyang Qi et.al.	2312.02980	null
2023-12-05	Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models	Xinyu Zhang et.al.	2312.02969	null
2023-12-05	MVHumanNet: A Large-scale Dataset of Multi-view Daily Dressing Human Captures	Zhangyang Xiong et.al.	2312.02963	null
2023-12-05	Let the LLMs Talk: Simulating Human-to-Human Conversational QA via Zero-Shot LLM-to-LLM Interactions	Zahra Abbasiantaeb et.al.	2312.02913	link
2023-12-05	Toward autocorrection of chemical process flowsheets using large language models	Lukas Schulze Balhorn et.al.	2312.02873	null
2023-12-05	Weakly Supervised Detection of Hallucinations in LLM Activations	Miriam Rateike et.al.	2312.02798	null
2023-12-05	Large Language Models on Graphs: A Comprehensive Survey	Bowen Jin et.al.	2312.02783	link
2023-12-05	Generating Fine-Grained Human Motions Using ChatGPT-Refined Descriptions	Xu Shi et.al.	2312.02772	null
2023-12-05	Towards Measuring Representational Similarity of Large Language Models	Max Klabunde et.al.	2312.02730	link
2023-12-05	RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze!	Ronak Pradeep et.al.	2312.02724	link
2023-12-04	Steerers: A framework for rotation equivariant keypoint descriptors	Georg Bökman et.al.	2312.02152	link
2023-12-04	Learning Polynomial Problems with $SL(2,\mathbb{R})$ Equivariance	Hannah Lawrence et.al.	2312.02146	null
2023-12-05	Competition-Level Problems are Effective LLM Evaluators	Yiming Huang et.al.	2312.02143	null
2023-12-04	TPPoet: Transformer-Based Persian Poem Generation using Minimal Data and Advanced Decoding Techniques	Amir Panahandeh et.al.	2312.02125	null
2023-12-04	Magicoder: Source Code Is All You Need	Yuxiang Wei et.al.	2312.02120	link
2023-12-04	Tree of Attacks: Jailbreaking Black-Box LLMs Automatically	Anay Mehrotra et.al.	2312.02119	link
2023-12-04	Physics simulation capabilities of LLMs	Mohamad Ali-Dib et.al.	2312.02091	null
2023-12-04	A Glitch in the Matrix? Locating and Detecting Language Model Grounding with Fakepedia	Giovanni Monea et.al.	2312.02073	null
2023-12-04	Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?	Donya Rooein et.al.	2312.02065	null
2023-12-04	TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding	Shuhuai Ren et.al.	2312.02051	null
2023-12-01	Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses	Xiao Ma et.al.	2312.00763	null
2023-12-01	Mamba: Linear-Time Sequence Modeling with Selective State Spaces	Albert Gu et.al.	2312.00752	null
2023-12-01	Deciphering Digital Detectives: Understanding LLM Behaviors and Capabilities in Multi-Agent Mystery Games	Dekun Wu et.al.	2312.00746	null
2023-12-01	SeaLLMs -- Large Language Models for Southeast Asia	Xuan-Phi Nguyen et.al.	2312.00738	link
2023-12-01	The Efficiency Spectrum of Large Language Models: An Algorithmic Survey	Tianyu Ding et.al.	2312.00678	link
2023-12-01	Nonparametric Variational Regularisation of Pretrained Transformers	Fabio Fehr et.al.	2312.00662	null
2023-12-01	Pathway to a fully data-driven geotechnics: lessons from materials informatics	Stephen Wu et.al.	2312.00581	null
2023-12-01	Instruction-tuning Aligns LLMs to the Human Brain	Khai Loong Aw et.al.	2312.00575	null
2023-12-01	Explanatory Argument Extraction of Correct Answers in Resident Medical Exams	Iakes Goenaga et.al.	2312.00567	null
2023-12-01	Questioning Biases in Case Judgment Summaries: Legal Datasets or Large Language Models?	Aniket Deroy et.al.	2312.00554	null
2023-11-30	PoseGPT: Chatting about 3D Human Pose	Yao Feng et.al.	2311.18836	null
2023-11-30	What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations	Raphael Tang et.al.	2311.18812	link
2023-11-30	Unnatural Error Correction: GPT-4 Can Almost Perfectly Handle Unnatural Scrambled Text	Qi Cao et.al.	2311.18805	null
2023-11-30	X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning	Artemis Panagopoulou et.al.	2311.18799	link
2023-11-30	CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation	Zineng Tang et.al.	2311.18775	null
2023-11-30	MLLMs-Augmented Visual-Language Representation Learning	Yanqing Liu et.al.	2311.18765	link
2023-11-30	TaskBench: Benchmarking Large Language Models for Task Automation	Yongliang Shen et.al.	2311.18760	link
2023-11-30	AlignBench: Benchmarking Chinese Alignment of Large Language Models	Xiao Liu et.al.	2311.18743	link
2023-11-30	CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation	Pei Ke et.al.	2311.18702	link
2023-11-30	RaDialog: A Large Vision-Language Model for Radiology Report Generation and Conversational Assistance	Chantal Pellegrini et.al.	2311.18681	link
2023-11-29	OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation	Qidong Huang et.al.	2311.17911	link
2023-11-29	Look Before You Leap: Unveiling the Power of GPT-4V in Robotic Vision-Language Planning	Yingdong Hu et.al.	2311.17842	null
2023-11-30	How to Build an AI Tutor that Can Adapt to Any Course and Provide Accurate Answers Using Large Language Model and Retrieval-Augmented Generation	Chenxi Dong et.al.	2311.17696	null
2023-11-29	AviationGPT: A Large Language Model for the Aviation Domain	Liya Wang et.al.	2311.17686	null
2023-11-29	TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models	Zheng Chu et.al.	2311.17667	link
2023-11-29	VIM: Probing Multimodal Large Language Models for Visual Embedded Instruction Following	Yujie Lu et.al.	2311.17647	null
2023-11-29	ShapeGPT: 3D Shape Generation with A Unified Multi-modal Language Model	Fukun Yin et.al.	2311.17618	null
2023-11-29	Integrable symplectic maps with a polygon tessellation	Timofey Zolkin et.al.	2311.17616	null
2023-11-29	Query-Relevant Images Jailbreak Large Multi-Modal Models	Xin Liu et.al.	2311.17600	null
2023-11-29	LanGWM: Language Grounded World Model	Rudra P. K. Poudel et.al.	2311.17593	null
2023-11-28	LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models	Yanwei Li et.al.	2311.17043	link
2023-11-28	Efficient In-Context Learning in Vision-Language Models for Egocentric Videos	Keunwoo Peter Yu et.al.	2311.17041	null
2023-11-28	MVBench: A Comprehensive Multi-modal Video Understanding Benchmark	Kunchang Li et.al.	2311.17005	link
2023-11-28	Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following	Yutong Feng et.al.	2311.17002	null
2023-11-28	ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?	Hailin Chen et.al.	2311.16989	null
2023-11-28	COLE: A Hierarchical Generation Framework for Graphic Design	Peidong Jia et.al.	2311.16974	null
2023-11-28	LLaFS: When Large-Language Models Meet Few-Shot Segmentation	Lanyun Zhu et.al.	2311.16926	link
2023-11-28	Analyzing the Influence of Language Model-Generated Responses in Mitigating Hate Speech on Social Media Directed at Ukrainian Refugees in Poland	Jakub Podolak et.al.	2311.16905	null
2023-11-28	The Falcon Series of Open Language Models	Ebtesam Almazrouei et.al.	2311.16867	null
2023-11-28	RELIC: Investigating Large Language Model Responses using Self-Consistency	Furui Cheng et.al.	2311.16842	null
2023-11-27	Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models	Munan Ning et.al.	2311.16103	link
2023-11-27	Have we built machines that think like people?	Luca M. Schulze Buschoff et.al.	2311.16093	null
2023-11-27	MEDITRON-70B: Scaling Medical Pretraining for Large Language Models	Zeming Chen et.al.	2311.16079	link
2023-11-27	BioLORD-2023: Semantic Textual Representations Fusing LLM and Clinical Knowledge Graph Insights	François Remy et.al.	2311.16075	null
2023-11-27	Decoding Logic Errors: A Comparative Study on Bug Detection by Students and Large Language Models	Stephen MacNeil et.al.	2311.16017	null
2023-11-27	Sparsify-then-Classify: From Internal Neurons of Large Language Models To Efficient Text Classifiers	Yilun Liu et.al.	2311.15983	null
2023-11-27	Towards Responsible Governance of Biological Design Tools	Richard Moulange et.al.	2311.15936	null
2023-11-27	WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models	Youssef Benchekroun et.al.	2311.15930	null
2023-11-27	EVCap: Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension	Jiaxuan Li et.al.	2311.15879	null
2023-11-27	RO-LLaMA: Generalist LLM for Radiation Oncology via Noise Augmentation and Consistency Regularization	Kwanyoung Kim et.al.	2311.15876	null
2023-11-24	Charting New Territories: Exploring the Geographic and Geospatial Capabilities of Multimodal LLMs	Jonathan Roberts et.al.	2311.14656	link
2023-11-24	One Pass Streaming Algorithm for Super Long Token Attention Approximation in Sublinear Space	Raghav Addanki et.al.	2311.14652	null
2023-11-24	Large Language Models as Automated Aligners for benchmarking Vision-Language Models	Yuanfeng Ji et.al.	2311.14580	null
2023-11-24	Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models	Yufei Zhan et.al.	2311.14552	null
2023-11-24	Data-Efficient Alignment of Large Language Models with Human Feedback Through Natural Language	Di Jin et.al.	2311.14543	null
2023-11-24	Machine Translation for Ge'ez Language	Aman Kassahun Wassie et.al.	2311.14530	null
2023-11-24	Benchmarking Large Language Models for Log Analysis, Security, and Interpretation	Egil Karlsen et.al.	2311.14519	null
2023-11-24	Controlled Text Generation via Language Model Arithmetic	Jasper Dekoninck et.al.	2311.14479	link
2023-11-24	Universal Jailbreak Backdoors from Poisoned Human Feedback	Javier Rando et.al.	2311.14455	link
2023-11-24	Potential Societal Biases of ChatGPT in Higher Education: A Scoping Review	Ming Li et.al.	2311.14381	null
2023-11-22	Visual In-Context Prompting	Feng Li et.al.	2311.13601	link
2023-11-22	$σ$ -PCA: a unified neural model for linear and nonlinear principal component analysis	Fahdi Kanavati et.al.	2311.13580	null
2023-11-22	Physical Reasoning and Object Planning for Household Embodied Agents	Ayush Agrawal et.al.	2311.13577	link
2023-11-22	Drilling Down into the Discourse Structure with LLMs for Long Document Question Answering	Inderjeet Nair et.al.	2311.13565	null
2023-11-22	Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object	Junhao Chen et.al.	2311.13562	link
2023-11-22	ADriver-I: A General World Model for Autonomous Driving	Fan Jia et.al.	2311.13549	null
2023-11-22	Linear Log-Normal Attention with Unbiased Concentration	Yury Nahshan et.al.	2311.13541	null
2023-11-22	Speak Like a Native: Prompting Large Language Models in a Native Style	Zhicheng Yang et.al.	2311.13538	null
2023-11-22	Current Topological and Machine Learning Applications for Bias Detection in Text	Colleen Farrelly et.al.	2311.13495	null
2023-11-22	Transfer Attacks and Defenses for Large Language Models on Coding Tasks	Chi Zhang et.al.	2311.13445	null
2023-11-21	Prompting Frameworks for Large Language Models: A Survey	Xiaoxia Liu et.al.	2311.12785	null
2023-11-21	Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching	Meng Chu et.al.	2311.12751	null
2023-11-21	Keeping Users Engaged During Repeated Administration of the Same Questionnaire: Using Large Language Models to Reliably Diversify Questions	Hye Sun Yun et.al.	2311.12707	null
2023-11-21	Can Large Language Models Understand Content and Propagation for Misinformation Detection: An Empirical Study	Mengyang Chen et.al.	2311.12699	null
2023-11-21	From Concept to Manufacturing: Evaluating Vision-Language Models for Engineering Design	Cyril Picard et.al.	2311.12668	null
2023-11-21	GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning	Jiaxi Lv et.al.	2311.12631	null
2023-11-21	IMGTB: A Framework for Machine-Generated Text Detection Benchmarking	Michal Spiegel et.al.	2311.12574	null
2023-11-21	Scheduling Distributed Flexible Assembly Lines using Safe Reinforcement Learning with Soft Shielding	Lele Li et.al.	2311.12572	null
2023-11-21	In-Context Learning Functions with Varying Number of Minima	David Oniani et.al.	2311.12538	link
2023-11-21	Oasis: Data Curation and Assessment System for Pretraining of Large Language Models	Tong Zhou et.al.	2311.12537	link
2023-11-20	On the Potential and Limitations of Few-Shot In-Context Learning to Generate Metamorphic Specifications for Tax Preparation Software	Dananjay Srinivas et.al.	2311.11979	null
2023-11-20	LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions	Songhao Han et.al.	2311.11904	null
2023-11-20	VLM-Eval: A General Evaluation on Video Large Language Models	Shuailin Li et.al.	2311.11865	null
2023-11-20	Generating Valid and Natural Adversarial Examples with Large Language Models	Zimu Wang et.al.	2311.11861	null
2023-11-20	LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge	Gongwei Chen et.al.	2311.11860	null
2023-11-20	Evil Geniuses: Delving into the Safety of LLM-based Agents	Yu Tian et.al.	2311.11855	link
2023-11-20	How to Use Large Language Models for Text Coding: The Case of Fatherhood Roles in Public Policy Documents	Lorenzo Lupo et.al.	2311.11844	null
2023-11-20	System 2 Attention (is something you might need too)	Jason Weston et.al.	2311.11829	null
2023-11-20	Large Language Models and Explainable Law: a Hybrid Methodology	Marco Billi et.al.	2311.11811	null
2023-11-20	DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding	Hao Feng et.al.	2311.11810	null
2023-11-17	Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2	Hamish Ivison et.al.	2311.10702	null
2023-11-17	PEFT-MedAware: Large Language Model for Medical Awareness	Keivalya Pandya et.al.	2311.10697	null
2023-11-17	Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections	Lihan Zha et.al.	2311.10678	link
2023-11-17	A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest	Ruohong Zhang et.al.	2311.10614	null
2023-11-17	SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning	Yue Fan et.al.	2311.10572	null
2023-11-17	Towards General Loop Invariant Generation via Coordinating Symbolic Execution and Large Language Models	Chang Liu et.al.	2311.10483	null
2023-11-17	DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines	Chenyu Jiang et.al.	2311.10418	link
2023-11-17	Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads	Yi Yang et.al.	2311.10395	null
2023-11-17	Automatic Smart Contract Comment Generation via Large Language Models and In-Context Learning	Junjie Zhao et.al.	2311.10388	null
2023-11-17	Retrieval Augmented Generation of Symbolic Music with LLMs	Nicolas Jonason et.al.	2311.10384	null
2023-11-16	DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback	Yangyi Chen et.al.	2311.10081	null
2023-11-16	ChatGPT-3.5, ChatGPT-4, Google Bard, and Microsoft Bing to Improve Health Literacy and Communication in Pediatric Populations and Beyond	Kanhai S. Amin et.al.	2311.10075	null
2023-11-16	Is "A Helpful Assistant" the Best Role for Large Language Models? A Systematic Evaluation of Social Roles in System Prompts	Mingqian Zheng et.al.	2311.10054	null
2023-11-16	Fast return-level estimates for flood insurance via an improved Bennett inequality for random variables with differing upper bounds	Anna Maria Barlow et.al.	2311.10001	null
2023-11-16	Hijacking Large Language Models via Adversarial In-Context Learning	Yao Qiang et.al.	2311.09948	null
2023-11-16	Language Generation from Human Brain Activities	Ziyi Ye et.al.	2311.09889	null
2023-11-16	INTERVENOR: Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing	Hanbin Wang et.al.	2311.09868	link
2023-11-16	Which Modality should I use -- Text, Motif, or Image? : Understanding Graphs with Large Language Models	Debarati Das et.al.	2311.09862	null
2023-11-17	PsyBench: a balanced and in-depth Psychological Chinese Evaluation Benchmark for Foundation Models	Junlei Zhang et.al.	2311.09861	null
2023-11-16	Leveraging LLMs in Scholarly Knowledge Graph Question Answering	Tilahun Abedissa Taffa et.al.	2311.09841	link
2023-11-15	Assessing Translation capabilities of Large Language Models involving English and Indian Languages	Vandan Mujadia et.al.	2311.09216	null
2023-11-15	Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models	Weize Liu et.al.	2311.09214	null
2023-11-15	Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models	Wenhao Yu et.al.	2311.09210	null
2023-11-15	TableLlama: Towards Open Large Generalist Models for Tables	Tianshu Zhang et.al.	2311.09206	null
2023-11-15	Fusion-Eval: Integrating Evaluators with LLMs	Lei Shu et.al.	2311.09204	null
2023-11-15	Never Lost in the Middle: Improving Large Language Models via Attention Strengthening Question Answering	Junqing He et.al.	2311.09198	null
2023-11-15	Structural Priming Demonstrates Abstract Grammatical Representations in Multilingual Language Models	James A. Michaelov et.al.	2311.09194	null
2023-11-15	PsyEval: A Comprehensive Large Language Model Evaluation Benchmark for Mental Health	Haoan Jin et.al.	2311.09189	null
2023-11-15	Towards Verifiable Text Generation with Symbolic References	Lucas Torroba Hennigen et.al.	2311.09188	null
2023-11-15	Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization	Yixin Liu et.al.	2311.09184	link
2023-11-14	Towards Open-Ended Visual Recognition with Large Language Model	Qihang Yu et.al.	2311.08400	link
2023-11-14	Are Large Language Models Temporally Grounded?	Yifu Qiu et.al.	2311.08398	link
2023-11-14	Zero-shot audio captioning with audio-language model guidance and audio context keywords	Leonard Salewski et.al.	2311.08396	link
2023-11-14	On What Basis? Predicting Text Preference Via Structured Comparative Reasoning	Jing Nathan Yan et.al.	2311.08390	null
2023-11-14	TSST: A Benchmark and Evaluation Models for Text Speech-Style Transfer	Huashan Sun et.al.	2311.08389	null
2023-11-14	Direct Preference Optimization for Neural Machine Translation with Minimum Bayes Risk Decoding	Guangyu Yang et.al.	2311.08380	null
2023-11-14	A Ship of Theseus: Curious Cases of Paraphrasing in LLM-Generated Texts	Nafis Irtiza Tripto et.al.	2311.08374	null
2023-11-14	SimpleSafetyTests: a Test Suite for Identifying Critical Safety Risks in Large Language Models	Bertie Vidgen et.al.	2311.08370	null
2023-11-14	How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection	Ryuto Koike et.al.	2311.08369	null
2023-11-14	Plum: Prompt Learning using Metaheuristic	Rui Pan et.al.	2311.08364	link
2023-11-13	SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models	Ziyi Lin et.al.	2311.07575	link
2023-11-13	To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning	Junke Wang et.al.	2311.07574	null
2023-11-13	Using Natural Language Explanations to Improve Robustness of In-context Learning for Natural Language Inference	Xuanli He et.al.	2311.07556	null
2023-11-13	It's Not Easy Being Wrong: Evaluating Process of Elimination Reasoning in Large Language Models	Nishant Balepur et.al.	2311.07532	link
2023-11-13	A Benchmark to Understand the Role of Knowledge Graphs on Large Language Model's Accuracy for Question Answering on Enterprise SQL Databases	Juan Sequeda et.al.	2311.07509	null
2023-11-13	A Step Closer to Comprehensive Answers: Constrained Multi-Stage Question Decomposition with Large Language Models	Hejing Cao et.al.	2311.07491	null
2023-11-13	Psychometric Predictive Power of Large Language Models	Tatsuki Kuribayashi et.al.	2311.07484	null
2023-11-13	Finding and Editing Multi-Modal Neurons in Pre-Trained Transformer	Haowen Pan et.al.	2311.07470	null
2023-11-13	InCA: Rethinking In-Car Conversational System Assessment Leveraging Large Language Models	Ken E. Friedl et.al.	2311.07469	null
2023-11-13	Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse	Ang Lv et.al.	2311.07468	null
2023-11-10	Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization	Weiyang Liu et.al.	2311.06243	null
2023-11-10	Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild	Nanna Inie et.al.	2311.06237	null
2023-11-10	Data Contamination Quiz: A Tool to Detect and Estimate Contamination in Large Language Models	Shahriar Golchin et.al.	2311.06233	null
2023-11-10	Vox Populi, Vox ChatGPT: Large Language Models, Education and Democracy	Niina Zuber et.al.	2311.06207	null
2023-11-10	Syntax-semantics interface: an algebraic model	Matilde Marcolli et.al.	2311.06189	null
2023-11-10	Language Models can be Logical Solvers	Jiazhan Feng et.al.	2311.06158	null
2023-11-10	Is it indeed bigger better? The comprehensive study of claim detection LMs applied for disinformation tackling	Martin Hyben et.al.	2311.06121	null
2023-11-10	Making LLMs Worth Every Penny: Resource-Limited Text Classification in Banking	Lefteris Loukas et.al.	2311.06102	null
2023-11-10	Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration	Wenjie Fu et.al.	2311.06062	null
2023-11-10	Structure of the space of folding protein sequences defined by large language models	A. Zambon et.al.	2311.06034	null
2023-11-09	Efficient Parallelization Layouts for Large-Scale Distributed Model Training	Johannes Hagemann et.al.	2311.05610	link
2023-11-09	FigStep: Jailbreaking Large Vision-language Models via Typographic Visual Prompts	Yichen Gong et.al.	2311.05608	link
2023-11-09	Accuracy of a Vision-Language Model on Challenging Medical Cases	Thomas Buckley et.al.	2311.05591	link
2023-11-09	Conversational AI Threads for Visualizing Multidimensional Datasets	Matt-Heun Hong et.al.	2311.05590	null
2023-11-09	Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations	Joey Hong et.al.	2311.05584	null
2023-11-09	Removing RLHF Protections in GPT-4 via Fine-Tuning	Qiusi Zhan et.al.	2311.05553	null
2023-11-09	ChatGPT and other Large Language Models for Cybersecurity of Smart Grid Applications	Aydin Zaboli et.al.	2311.05462	null
2023-11-09	Automated Mobile Sensing Strategies Generation for Human Behaviour Understanding	Nan Gao et.al.	2311.05457	null
2023-11-09	Cognitively Inspired Components for Social Conversational Agents	Alex Clay et.al.	2311.05450	null
2023-11-09	TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs	Shuyi Xie et.al.	2311.05374	link
2023-11-08	Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models	Rocktim Jyoti Das et.al.	2311.04902	link
2023-11-08	GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs	Zhenfang Chen et.al.	2311.04901	null
2023-11-08	How Abstract Is Linguistic Generalization in Large Language Models? Experiments with Argument Structure	Michael Wilson et.al.	2311.04900	link
2023-11-08	AutoChip: Automating HDL Generation Using LLM Feedback	Shailja Thakur et.al.	2311.04887	null
2023-11-08	SEMQA: Semi-Extractive Multi-Source Question Answering	Tal Schuster et.al.	2311.04886	link
2023-11-08	LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language Models	Jianxin Yang et.al.	2311.04879	link
2023-11-08	Rethinking Benchmark and Contamination for Language Models with Rephrased Samples	Shuo Yang et.al.	2311.04850	link
2023-11-08	Using large language models to study human memory for meaningful narratives	Antonios Georgiou Tankut Can et.al.	2311.04742	link
2023-11-08	Evaluating Generative Ad Hoc Information Retrieval	Lukas Gienapp et.al.	2311.04694	null
2023-11-08	Pre-training LLMs using human-like development data corpus	Khushi Bhardwaj et.al.	2311.04666	null
2023-11-07	Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves	Yihe Deng et.al.	2311.04205	null
2023-11-07	Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation	Eric Melz et.al.	2311.04177	null
2023-11-07	Perturbed examples reveal invariances shared by language models	Ruchit Rawal et.al.	2311.04166	null
2023-11-08	Black-Box Prompt Optimization: Aligning Large Language Models without Model Training	Jiale Cheng et.al.	2311.04155	link
2023-11-07	Unveiling Safety Vulnerabilities of Large Language Models	George Kour et.al.	2311.04124	null
2023-11-07	Do LLMs exhibit human-like response biases? A case study in survey design	Lindia Tjuatja et.al.	2311.04076	link
2023-11-07	Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment	Geyang Guo et.al.	2311.04072	null
2023-11-07	Extracting human interpretable structure-property relationships in chemistry using XAI and large language models	Geemi P. Wellawatte et.al.	2311.04047	null
2023-11-07	Reinforcement Learning Fine-tuning of Language Models is Biased Towards More Extractable Features	Diogo Cruz et.al.	2311.04046	null
2023-11-07	Aspects of human memory and Large Language Models	Romuald A. Janik et.al.	2311.03839	link
2023-11-06	GLaMM: Pixel Grounding Large Multimodal Model	Hanoona Rasheed et.al.	2311.03356	null
2023-11-06	CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding	Junyan Li et.al.	2311.03354	null
2023-11-06	Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation	Rusheb Shah et.al.	2311.03348	null
2023-11-06	DAIL: Data Augmentation for In-Context Learning via Self-Paraphrase	Dawei Li et.al.	2311.03319	null
2023-11-06	Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance	Thiemo Wambsganss et.al.	2311.03311	link
2023-11-06	Ziya2: Data-centric Learning is All LLMs Need	Ruyi Gan et.al.	2311.03301	null
2023-11-06	S-LoRA: Serving Thousands of Concurrent LoRA Adapters	Ying Sheng et.al.	2311.03285	null
2023-11-06	Instructed Language Models with Retrievers Are Powerful Entity Linkers	Zilin Xiao et.al.	2311.03250	link
2023-11-06	ALYMPICS: Language Agents Meet Game Theory	Shaoguang Mao et.al.	2311.03220	null
2023-11-06	DeepInception: Hypnotize Large Language Model to Be Jailbreaker	Xuan Li et.al.	2311.03191	null
2023-11-03	Post Turing: Mapping the landscape of LLM Evaluation	Alexey Tikhonov et.al.	2311.02049	null
2023-11-03	Conditions on Preference Relations that Guarantee the Existence of Optimal Policies	Jonathan Colaco Carr et.al.	2311.01990	null
2023-11-03	Don't Make Your LLM an Evaluation Benchmark Cheater	Kun Zhou et.al.	2311.01964	null
2023-11-03	Hint-enhanced In-Context Learning wakes Large Language Models up for knowledge-intensive tasks	Yifan Wang et.al.	2311.01949	null
2023-11-03	Supermind Ideator: Exploring generative AI to support creative problem-solving	Steven R. Rick et.al.	2311.01937	null
2023-11-03	GateLoop: Fully Data-Controlled Linear Recurrence for Sequence Modeling	Tobias Katsch et.al.	2311.01927	null
2023-11-03	ChartGPT: Leveraging LLMs to Generate Charts from Abstract Natural Language	Yuan Tian et.al.	2311.01920	null
2023-11-03	Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review	Mingze Yuan et.al.	2311.01918	link
2023-11-03	LLM-driven Multimodal Target Volume Contouring in Radiation Oncology	Yujin Oh et.al.	2311.01908	null
2023-11-03	Indicative Summarization of Long Discussions	Shahbaz Syed et.al.	2311.01882	link
2023-11-02	TopicGPT: A Prompt-based Topic Modeling Framework	Chau Minh Pham et.al.	2311.01449	link
2023-11-02	Deep Double Descent for Time Series Forecasting: Avoiding Undertrained Models	Valentino Assandri et.al.	2311.01442	null
2023-11-02	REAL: Resilience and Adaptation using Large Language Models on Autonomous Aerial Robots	Andrea Tagliabue et.al.	2311.01403	null
2023-11-02	Collaborative Large Language Model for Recommender Systems	Yaochen Zhu et.al.	2311.01343	link
2023-11-02	The Effect of Scaling, Retrieval Augmentation and Form on the Factual Consistency of Language Models	Lovisa Hagström et.al.	2311.01307	null
2023-11-02	AWEQ: Post-Training Quantization with Activation-Weight Equalization for Large Language Models	Baisong Li et.al.	2311.01305	null
2023-11-02	FlashDecoding++: Faster Large Language Model Inference on GPUs	Ke Hong et.al.	2311.01282	null
2023-11-02	Let's Discover More API Relations: A Large Language Model-based AI Chain for Unsupervised API Relation Inference	Qing Huang et.al.	2311.01266	null
2023-11-02	Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations	Hanglei Zhang et.al.	2311.01260	null
2023-11-02	An energy-based comparative analysis of common approaches to text classification in the Legal domain	Sinan Gultekin et.al.	2311.01256	null
2023-11-01	Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem Solving	Zhan Ling et.al.	2311.00694	link
2023-11-01	Improving Interpersonal Communication by Simulating Audiences with Language Models	Ryan Liu et.al.	2311.00687	link
2023-11-01	Little Giants: Exploring the Potential of Small LLMs as Evaluation Metrics in Summarization in the Eval4NLP 2023 Shared Task	Neema Kotonya et.al.	2311.00686	null
2023-11-01	Attention Alignment and Flexible Positional Embeddings Improve Transformer Length Extrapolation	Ta-Chung Chi et.al.	2311.00684	null
2023-11-01	Are Large Language Models Reliable Judges? A Study on the Factuality Evaluation Capabilities of LLMs	Xue-Yong Fu et.al.	2311.00681	null
2023-11-01	Emotion Detection for Misinformation: A Review	Zhiwei Liu et.al.	2311.00671	null
2023-11-01	De-Diffusion Makes Text a Strong Cross-Modal Interface	Chen Wei et.al.	2311.00618	null
2023-11-01	Crosslingual Retrieval Augmented In-context Learning for Bangla	Xiaoqian Li et.al.	2311.00587	null
2023-11-01	The Development of LLMs for Embodied Navigation	Jinzhou Lin et.al.	2311.00530	null
2023-11-01	Efficient LLM Inference on CPUs	Haihao Shen et.al.	2311.00502	link
2023-10-31	Learning From Mistakes Makes LLM Better Reasoner	Shengnan An et.al.	2310.20689	link
2023-10-31	Defining a New NLP Playground	Sha Li et.al.	2310.20633	null
2023-10-31	LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B	Simon Lermen et.al.	2310.20624	null
2023-10-31	Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning	Ruizhe Shi et.al.	2310.20587	null
2023-10-31	CapsFusion: Rethinking Image-Text Data at Scale	Qiying Yu et.al.	2310.20550	null
2023-10-31	LLMs may Dominate Information Access: Neural Retrievers are Biased Towards LLM-Generated Texts	Sunhao Dai et.al.	2310.20501	null
2023-10-31	Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models	Tian Liang et.al.	2310.20499	null
2023-10-31	Large Language Model Can Interpret Latent Space of Sequential Recommender	Zhengyi Yang et.al.	2310.20487	null
2023-10-31	The SourceData-NLP dataset: integrating curation into scientific publishing for training large language models	Jorge Abreu-Vicente et.al.	2310.20440	null
2023-10-31	FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models	Yuxin Jiang et.al.	2310.20410	null
2023-10-30	M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models	Wai-Chung Kwan et.al.	2310.19240	null
2023-10-30	Building Real-World Meeting Summarization Systems using Large Language Models: A Practical Perspective	Md Tahmid Rahman Laskar et.al.	2310.19233	null
2023-10-30	EHRTutor: Enhancing Patient Understanding of Discharge Instructions	Zihao Zhang et.al.	2310.19212	null
2023-10-30	Leveraging generative artificial intelligence to simulate student learning behavior	Songlin Xu et.al.	2310.19206	null
2023-10-29	From Chatbots to PhishBots? -- Preventing Phishing scams created using ChatGPT, Google Bard and Claude	Sayak Saha Roy et.al.	2310.19181	null
2023-10-29	Atom: Low-bit Quantization for Efficient and Accurate LLM Serving	Yilong Zhao et.al.	2310.19102	null
2023-10-29	Roles of Scaling and Instruction Tuning in Language Perception: Model vs. Human Attention	Changjiang Gao et.al.	2310.19084	link
2023-10-29	Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery	Katie Z Luo et.al.	2310.19080	null
2023-10-29	Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection	Yuanze Li et.al.	2310.19070	null
2023-10-29	Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V	Zhiling Yan et.al.	2310.19061	link
2023-10-27	FP8-LM: Training FP8 Large Language Models	Houwen Peng et.al.	2310.18313	link
2023-10-27	Image Clustering Conditioned on Text Criteria	Sehyun Kwon et.al.	2310.18297	link
2023-10-27	ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models	Benjamin Feuer et.al.	2310.18208	link
2023-10-27	Lost in Translation, Found in Spans: Identifying Claims in Multilingual Social Media	Shubham Mittal et.al.	2310.18205	null
2023-10-27	Personas as a Way to Model Truthfulness in Language Models	Nitish Joishi et.al.	2310.18168	null
2023-10-27	MPrompt: Exploring Multi-level Prompt Tuning for Machine Reading Comprehension	Guoxin Chen et.al.	2310.18167	null
2023-10-27	Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs	Yijian Qin et.al.	2310.18152	null
2023-10-27	DELPHI: Data for Evaluating LLMs' Performance in Handling Controversial Issues	David Q. Sun et.al.	2310.18130	link
2023-10-27	Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models	Xue Yan et.al.	2310.18127	null
2023-10-27	Knowledge Corpus Error in Question Answering	Yejoon Lee et.al.	2310.18076	link
2023-10-26	In-Context Learning Dynamics with Random Binary Sequences	Eric J. Bigelow et.al.	2310.17639	null
2023-10-26	JudgeLM: Fine-tuned Large Language Models are Scalable Judges	Lianghui Zhu et.al.	2310.17631	link
2023-10-26	InstOptima: Evolutionary Multi-objective Instruction Optimization via Large Language Model-based Instruction Operators	Heng Yang et.al.	2310.17630	null
2023-10-26	Proving Test Set Contamination in Black Box Language Models	Yonatan Oren et.al.	2310.17623	null
2023-10-26	An Open Source Data Contamination Report for Llama Series Models	Yucheng Li et.al.	2310.17589	link
2023-10-26	Interactive Robot Learning from Verbal Correction	Huihan Liu et.al.	2310.17555	null
2023-10-27	Can large language models replace humans in the systematic review process? Evaluating GPT-4's efficacy in screening and extracting data from peer-reviewed and grey literature in multiple languages	Qusai Khraisha et.al.	2310.17526	null
2023-10-27	The Expressive Power of Low-Rank Adaptation	Yuchen Zeng et.al.	2310.17513	link
2023-10-26	CompeteAI: Understanding the Competition Behaviors in Large Language Model-based Agents	Qinlin Zhao et.al.	2310.17512	null
2023-10-26	Improving Zero-shot Reader by Reducing Distractions from Irrelevant Documents in Open-Domain Question Answering	Sukmin Cho et.al.	2310.17490	null
2023-10-25	LLM-FP4: 4-Bit Floating-Point Quantized Transformers	Shih-yang Liu et.al.	2310.16836	link
2023-10-25	Can GPT models Follow Human Summarization Guidelines? Evaluating ChatGPT and GPT-4 for Dialogue Summarization	Yongxin Zhou et.al.	2310.16810	null
2023-10-25	QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models	Elias Frantar et.al.	2310.16795	link
2023-10-25	Detecting Pretraining Data from Large Language Models	Weijia Shi et.al.	2310.16789	null
2023-10-26	DEFT: Data Efficient Fine-Tuning for Large Language Models via Unsupervised Core-Set Selection	Devleena Das et.al.	2310.16776	null
2023-10-25	SuperHF: Supervised Iterative Learning from Human Feedback	Gabriel Mukobi et.al.	2310.16763	link
2023-10-25	HI-TOM: A Benchmark for Evaluating Higher-Order Theory of Mind Reasoning in Large Language Models	Yinghui He et.al.	2310.16755	link
2023-10-25	HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis	Nafis Irtiza Tripto et.al.	2310.16746	null
2023-10-25	Disentangling Extraction and Reasoning in Multi-hop Spatial Reasoning	Roshanak Mirzaee et.al.	2310.16731	null
2023-10-26	SkyMath: Technical Report	Liu Yang et.al.	2310.16713	null
2023-10-24	MuSR: Testing the Limits of Chain-of-thought with Multistep Soft Reasoning	Zayne Sprague et.al.	2310.16049	link
2023-10-24	AI Alignment and Social Choice: Fundamental Limitations and Policy Implications	Abhilash Mishra et.al.	2310.16048	null
2023-10-24	Woodpecker: Hallucination Correction for Multimodal Large Language Models	Shukang Yin et.al.	2310.16045	link
2023-10-25	WebWISE: Web Interface Control and Sequential Exploration with Large Language Models	Heyi Tao et.al.	2310.16042	null
2023-10-24	Instruct and Extract: Instruction Tuning for On-Demand Information Extraction	Yizhu Jiao et.al.	2310.16040	null
2023-10-24	What's Left? Concept Grounding with Logic-Enhanced Foundation Models	Joy Hsu et.al.	2310.16035	link
2023-10-24	Visual Cropping Improves Zero-Shot Question Answering of Multimodal Large Language Models	Jiarui Zhang et.al.	2310.16033	null
2023-10-24	What Algorithms can Transformers Learn? A Study in Length Generalization	Hattie Zhou et.al.	2310.16028	null
2023-10-24	White-box Compiler Fuzzing Empowered by Large Language Models	Chenyuan Yang et.al.	2310.15991	null
2023-10-24	Dissecting In-Context Learning of Translations in GPTs	Vikas Raunak et.al.	2310.15987	null
2023-10-23	Large Language Models are Visual Reasoning Coordinators	Liangyu Chen et.al.	2310.15166	link
2023-10-23	LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers	Theo X. Olausson et.al.	2310.15164	link
2023-10-23	Linear Representations of Sentiment in Large Language Models	Curt Tigges et.al.	2310.15154	null
2023-10-23	S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models	Fangyu Lei et.al.	2310.15147	link
2023-10-23	SpecTr: Fast Speculative Decoding via Optimal Transport	Ziteng Sun et.al.	2310.15141	null
2023-10-23	AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models	Sicheng Zhu et.al.	2310.15140	null
2023-10-23	Quantifying the Dialect Gap and its Correlates Across Languages	Anjali Kantharuban et.al.	2310.15135	null
2023-10-23	Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models	Gabriel Sarch et.al.	2310.15127	null
2023-10-23	Branch-Solve-Merge Improves Large Language Model Evaluation and Generation	Swarnadeep Saha et.al.	2310.15123	null
2023-10-23	Causal Inference Using LLM-Guided Discovery	Aniket Vashishtha et.al.	2310.15117	null
2023-10-20	Improving Long-form Speech Translation through Segmentation with Large Language Models and Finite State Decoding Constraints	Arya D. McCarthy et.al.	2310.13678	null
2023-10-20	StereoMap: Quantifying the Awareness of Human-like Stereotypes in Large Language Models	Sullam Jeoung et.al.	2310.13673	link
2023-10-20	Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models	Ruida Wang et.al.	2310.13671	link
2023-10-20	BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues	Haodong Duan et.al.	2310.13650	link
2023-10-20	Contrastive Prefence Learning: Learning from Human Feedback without RL	Joey Hejna et.al.	2310.13639	link
2023-10-20	Three Questions Concerning the Use of Large Language Models to Facilitate Mathematics Learning	An-Zi Yen et.al.	2310.13615	null
2023-10-20	MarineGPT: Unlocking Secrets of Ocean to the Public	Ziqiang Zheng et.al.	2310.13596	link
2023-10-20	Entangled Preferences: The History and Risks of Reinforcement Learning and Human Feedback	Nathan Lambert et.al.	2310.13595	null
2023-10-20	Why Can Large Language Models Generate Correct Chain-of-Thoughts?	Rasul Tutunov et.al.	2310.13571	null
2023-10-20	Cache & Distil: Optimising API Calls to Large Language Models	Guillem Ramírez et.al.	2310.13561	null
2023-10-19	Frozen Transformers in Language Models Are Effective Visual Encoder Layers	Ziqi Pang et.al.	2310.12973	link
2023-10-19	CLAIR: Evaluating Image Captions with Large Language Models	David Chan et.al.	2310.12971	null
2023-10-19	AutoMix: Automatically Mixing Language Models	Aman Madaan et.al.	2310.12963	link
2023-10-19	An Emulator for Fine-Tuning Large Language Models using Small Language Models	Eric Mitchell et.al.	2310.12962	null
2023-10-19	SEGO: Sequential Subgoal Optimization for Mathematical Problem-Solving	Xueliang Zhao et.al.	2310.12960	null
2023-10-19	Structured Generation and Exploration of Design Space with Large Language Models for Human-AI Co-Creation	Sangho Suh et.al.	2310.12953	null
2023-10-19	3D-GPT: Procedural 3D Modeling with Large Language Models	Chunyi Sun et.al.	2310.12945	null
2023-10-19	Eureka: Human-Level Reward Design via Coding Large Language Models	Yecheng Jason Ma et.al.	2310.12931	null
2023-10-19	Experimental Narratives: A Comparison of Human Crowdsourced Storytelling and AI Storytelling	Nina Begus et.al.	2310.12902	null
2023-10-19	StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding	Cheng Jiayang et.al.	2310.12874	null
2023-10-18	Pseudointelligence: A Unifying Framework for Language Model Evaluation	Shikhar Murty et.al.	2310.12135	null
2023-10-18	Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture	Daniel Y. Fu et.al.	2310.12109	null
2023-10-18	Non-Intrusive Adaptation: Input-Centric Parameter-efficient Fine-Tuning for Versatile Multimodal Modeling	Yaqing Wang et.al.	2310.12100	null
2023-10-18	Unveiling the Siren's Song: Towards Reliable Fact-Conflicting Hallucination Detection	Xiang Chen et.al.	2310.12086	link
2023-10-18	On the Benefit of Generative Foundation Models for Human Activity Recognition	Zikang Leng et.al.	2310.12085	null
2023-10-18	SPEED: Speculative Pipelined Execution for Efficient Decoding	Coleman Hooper et.al.	2310.12072	null
2023-10-18	Evaluating the Symbol Binding Ability of Large Language Models for Multiple-Choice Questions in Vietnamese General Education	Duc-Vu Nguyen et.al.	2310.12059	null
2023-10-18	Concept-Guided Chain-of-Thought Prompting for Pairwise Comparison Scaling of Texts with Large Language Models	Patrick Y. Wu et.al.	2310.12049	null
2023-10-18	LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation	Shengqiang Zhang et.al.	2310.12020	null
2023-10-18	Fast Multipole Attention: A Divide-and-Conquer Attention Mechanism for Long Sequences	Yanming Kang et.al.	2310.11960	null
2023-10-17	VeRA: Vector-based Random Matrix Adaptation	Dawid Jan Kopiczko et.al.	2310.11454	null
2023-10-17	BitNet: Scaling 1-bit Transformers for Large Language Models	Hongyu Wang et.al.	2310.11453	null
2023-10-17	Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective	Ming Zhong et.al.	2310.11451	null
2023-10-18	EvalCrafter: Benchmarking and Evaluating Large Video Generation Models	Yaofang Liu et.al.	2310.11440	null
2023-10-17	An Empirical Study of Translation Hypothesis Ensembling with Large Language Models	António Farinhas et.al.	2310.11430	link
2023-10-17	Neural Attention: Enhancing QKV Calculation in Self-Attention Mechanism with Neural Networks	Muhan Zhang et.al.	2310.11398	link
2023-10-17	Last One Standing: A Comparative Analysis of Security and Privacy of Soft Prompt Tuning, LoRA, and In-Context Learning	Rui Wen et.al.	2310.11397	null
2023-10-17	Towards Automatic Satellite Images Captions Generation Using Large Language Models	Yingxu He et.al.	2310.11392	null
2023-10-17	DialogueLLM: Context and Emotion Knowledge-Tuned LLaMA Models for Emotion Recognition in Conversations	Yazhou Zhang et.al.	2310.11374	null
2023-10-17	Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting	Melanie Sclar et.al.	2310.11324	null
2023-10-13	Vision-by-Language for Training-Free Compositional Image Retrieval	Shyamgopal Karthik et.al.	2310.09291	null
2023-10-13	User Inference Attacks on Large Language Models	Nikhil Kandpal et.al.	2310.09266	null
2023-10-13	**PromptRE: Weakly-Supervised Document-Level Relation Extraction via Prompting-Based Data Progr

Name		Name	Last commit message	Last commit date
Latest commit History 2,090 Commits
.github/workflows		.github/workflows
assets		assets
docs		docs
.gitignore		.gitignore
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Updated on 2024.11.17

Computer Vision

LLM

About

Releases

Packages

Languages

WailordHe/cv-arxiv-daily-wailord

Folders and files

Latest commit

History

Repository files navigation

Updated on 2024.11.17

Computer Vision

LLM

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages