cs.CL updates on arXiv.org

Channel: cs.CL updates on arXiv.org

X Mark channel Not-Safe-For-Work? cancel confirm NSFW Votes: (0 votes)

X Are you the publisher? Claim or contact us about this channel.

X 0

Showing 50 articles in the latest snapshot of channel 78446695
Channel Details:

Title: cs.CL updates on arXiv.org
Channel Number: 78446695
Language: English
Registered On: February 13, 2024, 7:19 pm
Number of Articles: 155
Latest Snapshot: February 13, 2024, 7:19 pm
RSS URL: https://arxiv.org/rss/cs.cl
Publisher: https://rss.arxiv.org/rss
Description: cs.CL updates on the arXiv.org e-print archive.
Catalog: //arxiv1538.rssing.com/catalog.php?indx=78446695

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

February 12, 2024, 9:00 pm

Diffusion models have gained attention in text processing, offering many potential advantages over traditional autoregressive models. This work explores the integration of diffusion models and...

View Article

Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search

February 12, 2024, 9:00 pm

In mixed-initiative conversational search systems, clarifying questions are used to help users who struggle to express their intentions in a single query. These questions aim to uncover user's...

View Article

Unsupervised Sign Language Translation and Generation

February 12, 2024, 9:00 pm

Motivated by the success of unsupervised neural machine translation (UNMT), we introduce an unsupervised sign language translation and generation network (USLNet), which learns from abundant...

View Article

OrderBkd: Textual backdoor attack through repositioning

February 12, 2024, 9:00 pm

The use of third-party datasets and pre-trained machine learning models poses a threat to NLP systems due to possibility of hidden backdoor attacks. Existing attacks involve poisoning the data samples...

View Article

Auxiliary Tasks to Boost Biaffine Semantic Dependency Parsing

February 12, 2024, 9:00 pm

The biaffine parser of Dozat and Manning (2017) was successfully extended to semantic dependency parsing (SDP) (Dozat and Manning, 2018). Its performance on graphs is surprisingly high given that,...

View Article

Large Language Models "Ad Referendum": How Good Are They at Machine...

February 12, 2024, 9:00 pm

This study evaluates the machine translation (MT) quality of two state-of-the-art large language models (LLMs) against a tradition-al neural machine translation (NMT) system across four language pairs...

View Article

The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with...

February 12, 2024, 9:00 pm

In the rapidly evolving landscape of medical documentation, transcribing clinical dialogues accurately is increasingly paramount. This study explores the potential of Large Language Models (LLMs) to...

View Article

Detecting the Clinical Features of Difficult-to-Treat Depression using...

February 12, 2024, 9:00 pm

Difficult-to-treat depression (DTD) has been proposed as a broader and more clinically comprehensive perspective on a person's depressive disorder where despite treatment, they continue to experience...

View Article

AutoMathText: Autonomous Data Selection with Language Models for Mathematical...

February 12, 2024, 9:00 pm

To improve language models' proficiency in mathematical reasoning via continual pretraining, we introduce a novel strategy that leverages base language models for autonomous data selection. Departing...

View Article

Anchor-based Large Language Models

February 12, 2024, 9:00 pm

Large language models (LLMs) predominantly employ decoder-only transformer architectures, necessitating the retention of keys/values information for historical tokens to provide contextual information...

View Article

Step-On-Feet Tuning: Scaling Self-Alignment of LLMs via Bootstrapping

February 12, 2024, 9:00 pm

Self-alignment is an effective way to reduce the cost of human annotation while ensuring promising model capability. However, most current methods complete the data collection and training steps in a...

View Article

Topic Modeling as Multi-Objective Contrastive Optimization

February 12, 2024, 9:00 pm

Recent representation learning approaches enhance neural topic models by optimizing the weighted linear combination of the evidence lower bound (ELBO) of the log-likelihood and the contrastive learning...

View Article

Show Me How It's Done: The Role of Explanations in Fine-Tuning Language Models

February 12, 2024, 9:00 pm

Our research demonstrates the significant benefits of using fine-tuning with explanations to enhance the performance of language models. Unlike prompting, which maintains the model's parameters,...

View Article

MAFIA: Multi-Adapter Fused Inclusive LanguAge Models

February 12, 2024, 9:00 pm

Pretrained Language Models (PLMs) are widely used in NLP for various tasks. Recent studies have identified various biases that such models exhibit and have proposed methods to correct these biases....

View Article

The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese

February 12, 2024, 9:00 pm

In the field of spoken language understanding, systems like Whisper and Multilingual Massive Speech (MMS) have shown state-of-the-art performances. This study is dedicated to a comprehensive...

View Article

Pushing The Limit of LLM Capacity for Text Classification

February 12, 2024, 9:00 pm

The value of text classification's future research has encountered challenges and uncertainties, due to the extraordinary efficacy demonstrated by large language models (LLMs) across numerous...

View Article

AraSpider: Democratizing Arabic-to-SQL

February 12, 2024, 9:00 pm

This study presents AraSpider, the first Arabic version of the Spider dataset, aimed at improving natural language processing (NLP) in the Arabic-speaking community. Four multilingual translation...

View Article

Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined...

February 12, 2024, 9:00 pm

We conducted a detailed analysis on the quality of web-mined corpora for two low-resource languages (making three language pairs, English-Sinhala, English-Tamil and Sinhala-Tamil). We ranked each...

View Article

Intrinsic Task-based Evaluation for Referring Expression Generation

February 12, 2024, 9:00 pm

Recently, a human evaluation study of Referring Expression Generation (REG) models had an unexpected conclusion: on \textsc{webnlg}, Referring Expressions (REs) generated by the state-of-the-art neural...

View Article

SALAD: Smart AI Language Assistant Daily

February 12, 2024, 9:00 pm

SALAD is an AI-driven language-learning application designed to help foreigners learn Japanese. It offers translations in Kanji-Kana-Romaji, speech recognition, translated audio, vocabulary tracking,...

View Article

D\'olares or Dollars? Unraveling the Bilingual Prowess of Financial LLMs...

February 12, 2024, 9:00 pm

Despite Spanish's pivotal role in the global finance industry, a pronounced gap exists in Spanish financial natural language processing (NLP) and application studies compared to English, especially in...

View Article

Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful...

February 12, 2024, 9:00 pm

Fact-checking research has extensively explored verification but less so the generation of natural-language explanations, crucial for user trust. While Large Language Models (LLMs) excel in text...

View Article

Chain-of-Layer: Iteratively Prompting Large Language Models for Taxonomy...

February 12, 2024, 9:00 pm

Automatic taxonomy induction is crucial for web search, recommendation systems, and question answering. Manual curation of taxonomies is expensive in terms of human effort, making automatic taxonomy...

View Article

How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?

February 12, 2024, 9:00 pm

In day-to-day communication, people often approximate the truth - for example, rounding the time or omitting details - in order to be maximally helpful to the listener. How do large language models...

View Article

Previously on the Stories: Recap Snippet Identification for Story Reading

February 12, 2024, 9:00 pm

Similar to the "previously-on" scenes in TV shows, recaps can help book reading by recalling the readers' memory about the important elements in previous texts to better understand the ongoing plot....

View Article

Low-Resource Counterspeech Generation for Indic Languages: The Case of...

February 12, 2024, 9:00 pm

With the rise of online abuse, the NLP community has begun investigating the use of neural architectures to generate counterspeech that can "counter" the vicious tone of such abusive speech and...

View Article

American Sign Language Video to Text Translation

February 12, 2024, 9:00 pm

Sign language to text is a crucial technology that can break down communication barriers for individuals with hearing difficulties. We replicate and try to improve on a recently published study. We...

View Article

TransGPT: Multi-modal Generative Pre-trained Transformer for Transportation

February 12, 2024, 9:00 pm

Natural language processing (NLP) is a key component of intelligent transportation systems (ITS), but it faces many challenges in the transportation domain, such as domain-specific knowledge and data,...

View Article

Through the Lens of Split Vote: Exploring Disagreement, Difficulty and...

February 12, 2024, 9:00 pm

In legal decisions, split votes (SV) occur when judges cannot reach a unanimous decision, posing a difficulty for lawyers who must navigate diverse legal arguments and opinions. In high-stakes domains,...

View Article

Prompt Perturbation in Retrieval-Augmented Generation based Large Language...

February 12, 2024, 9:00 pm

The robustness of large language models (LLMs) becomes increasingly important as their use rapidly grows in a wide range of domains. Retrieval-Augmented Generation (RAG) is considered as a means to...

View Article

Natural Language Reinforcement Learning

February 12, 2024, 9:00 pm

Reinforcement Learning (RL) has shown remarkable abilities in learning policies for decision-making tasks. However, RL is often hindered by issues such as low sample efficiency, lack of...

View Article

Generalizing Conversational Dense Retrieval via LLM-Cognition Data Augmentation

February 12, 2024, 9:00 pm

Conversational search utilizes muli-turn natural language contexts to retrieve relevant passages. Existing conversational dense retrieval models mostly view a conversation as a fixed sequence of...

View Article

Using Large Language Models for Student-Code Guided Test Case Generation in...

February 12, 2024, 9:00 pm

In computer science education, test cases are an integral part of programming assignments since they can be used as assessment items to test students' programming knowledge and provide personalized...

View Article

Semi-Supervised Learning for Bilingual Lexicon Induction

February 12, 2024, 9:00 pm

We consider the problem of aligning two sets of continuous word representations, corresponding to languages, to a common space in order to infer a bilingual lexicon. It was recently shown that it is...

View Article

Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large...

February 12, 2024, 9:00 pm

Large language models have the potential to be valuable in the healthcare industry, but it's crucial to verify their safety and effectiveness through rigorous evaluation. For this purpose, we...

View Article

Event-Keyed Summarization

February 12, 2024, 9:00 pm

We introduce event-keyed summarization (EKS), a novel task that marries traditional summarization and document-level event extraction, with the goal of generating a contextualized summary for a...

View Article

Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning...

February 12, 2024, 9:00 pm

Tuning pretrained language models for dialogue generation has been a prevalent paradigm for building capable dialogue agents. Yet, traditional tuning narrowly views dialogue generation as resembling...

View Article

NLP for Knowledge Discovery and Information Extraction from Energetics Corpora

February 12, 2024, 9:00 pm

We present a demonstration of the utility of NLP for aiding research into energetic materials and associated systems. The NLP method enables machine understanding of textual data, offering an automated...

View Article

SpeechCLIP+: Self-supervised multi-task representation learning for speech...

February 12, 2024, 9:00 pm

The recently proposed visually grounded speech model SpeechCLIP is an innovative framework that bridges speech and text through images via CLIP without relying on text transcription. On this basis,...

View Article

Should I try multiple optimizers when fine-tuning pre-trained Transformers...

February 12, 2024, 9:00 pm

NLP research has explored different neural model architectures and sizes, datasets, training objectives, and transfer learning techniques. However, the choice of optimizer during training has not been...

View Article

LiFi: Lightweight Controlled Text Generation with Fine-Grained Control Codes

February 12, 2024, 9:00 pm

In the rapidly evolving field of text generation, the demand for more precise control mechanisms has become increasingly apparent. To address this need, we present a novel methodology, LIFI, which...

View Article

A Thorough Examination of Decoding Methods in the Era of LLMs

February 12, 2024, 9:00 pm

Decoding methods play an indispensable role in converting language models from next-token predictors into practical task solvers. Prior research on decoding methods, primarily focusing on task-specific...

View Article

TL;DR Progress: Multi-faceted Literature Exploration in Text Summarization

February 12, 2024, 9:00 pm

This paper presents TL;DR Progress, a new tool for exploring the literature on neural text summarization. It organizes 514~papers based on a comprehensive annotation scheme for text summarization...

View Article

Investigating Consistency in Query-Based Meeting Summarization: A Comparative...

February 12, 2024, 9:00 pm

With more and more advanced data analysis techniques emerging, people will expect these techniques to be applied in more complex tasks and solve problems in our daily lives. Text Summarization is one...

View Article

Can LLMs Recognize Toxicity? Structured Toxicity Investigation Framework and...

February 12, 2024, 9:00 pm

In the pursuit of developing Large Language Models (LLMs) that adhere to societal standards, it is imperative to discern the existence of toxicity in the generated text. The majority of existing...

View Article

GenTranslate: Large Language Models are Generative Multilingual Speech and...

February 12, 2024, 9:00 pm

Recent advances in large language models (LLMs) have stepped forward the development of multilingual speech and machine translation by its reduced representation errors and incorporated external...

View Article

History, Development, and Principles of Large Language Models-An Introductory...

February 12, 2024, 9:00 pm

Language models serve as a cornerstone in natural language processing (NLP), utilizing mathematical methods to generalize language laws and knowledge for prediction and generation. Over extensive...

View Article

Evaluation Metrics for Text Data Augmentation in NLP

February 12, 2024, 9:00 pm

Recent surveys on data augmentation for natural language processing have reported different techniques and advancements in the field. Several frameworks, tools, and repositories promote the...

View Article

EntGPT: Linking Generative Large Language Models with Knowledge Bases

February 12, 2024, 9:00 pm

The ability of Large Language Models (LLMs) to generate factually correct output remains relatively unexplored due to the lack of fact-checking and knowledge grounding during training and inference. In...

View Article

NICE: To Optimize In-Context Examples or Not?

February 12, 2024, 9:00 pm

Recent works have shown that large language models (LLMs) work remarkably well on a wide range of tasks through in-context learning and optimization of in-context examples (ICE). However, most of these...

View Article

More Pages to Explore .....

Latest Images