Pinned Loading
Repositories
Showing 10 of 114 repositories
- Multi-expert-Prompting Public Forked from dxlong2000/Multi-expert-Prompting
[EMNLP 2024] Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models
WING-NUS/Multi-expert-Prompting’s past year of commit activity - SSNLP-2024 Public
WING-NUS/SSNLP-2024’s past year of commit activity - FormatEval Public Forked from dxlong2000/FormatBiasEval
[Preprint' 24] LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
WING-NUS/FormatEval’s past year of commit activity - FormatBiasEval Public Forked from dxlong2000/FormatBiasEval
[Preprint' 24] LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
WING-NUS/FormatBiasEval’s past year of commit activity - DiSQ-Score Public Forked from YisongMiao/DiSQ-Score
The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations> @ ACL 2024
WING-NUS/DiSQ-Score’s past year of commit activity - Decompose-and-Aggregate Public
WING-NUS/Decompose-and-Aggregate’s past year of commit activity