Matches in Nanopublications for { <https://w3id.org/np/RA5onfai3TcQTXxloau--mcY6JKg8yXNeMmqo29rFn4Qc#assertion> ?p ?o ?g. }
Showing items 1 to 15 of
15
with 100 items per page.
- assertion comment " To reduce evaluation contamination @XuanmingZhang07 @Zhou_Yu_AI @columbianlp et al. convert dataset examples into templates(Fig.) https://arxiv.org/abs/2406.17681 EWOK datasets are built to have this trait https://x.com/neuranna/status/1791465842632454184 Interesting trend will it last? solve contamination? https://twitter.com/LChoshen/status/1806396147281637645/photo/1 @XuanmingZhang07 @Zhou_Yu_AI @columbianlp If you ask me, a nice step, but it only solves the worst contamination (clear training on the test set). Not on just training on similar formats, synthetic data etc. to improve. So it is a good approach that should last, but we need more. (@deliprao you had similar claim right?) " assertion.
- assertion quotesPost 1791465842632454184 assertion.
- assertion wasAttributedTo 0000-0002-0085-6496 provenance.
- assertion wasAttributedTo RAoSadUw99CeqDlR2400018nqTzR_38fT86OrTzk16Vts provenance.
- assertion creator RAoSadUw99CeqDlR2400018nqTzR_38fT86OrTzk16Vts assertion.
- assertion wasGeneratedBy activity provenance.
- assertion wasAssociatedWith LChoshen provenance.
- assertion keywords "dataset-templates" assertion.
- assertion keywords "evaluation-contamination" assertion.
- assertion keywords "ewok-datasets" assertion.
- assertion keywords "language-model-benchmarking" assertion.
- assertion keywords "varbench" assertion.
- assertion includesQuotationFrom 1791465842632454184 assertion.
- assertion linksTo 2406.17681 assertion.
- assertion linksTo 1806396147281637645 provenance.