37 - On Statistical Significance, Training Variance, and Why Reporting Score Distributions Matters

NLP Highlights