Loading...

Building an Evaluation Scale using Item Response Theory

Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power. We propose Item Res...

Full description

Saved in:
Bibliographic Details
Published in:Proc Conf Empir Methods Nat Lang Process
Main Authors: Lalor, John P., Wu, Hao, Yu, Hong
Format: Artigo
Language:Inglês
Published: 2016
Subjects:
Online Access:https://ncbi.nlm.nih.gov/pmc/articles/PMC5167538/
https://ncbi.nlm.nih.gov/pubmed/28004039
Tags: Add Tag
No Tags, Be the first to tag this record!