▽LDC - Linguistic Data Consortium ●04/26 00:47 How LDC Data Inspires Research The New York Times Annotated Corpus illustrates how data published in LDC’s Catalog can become an important resource for the community. The New York Times is one of LDC’s earliest data providers; the billions of words of news text it has provided for language resources since the 1990s continue to be used today for research and technology development. Its contribution
▽Official Google Research Blog ●04/26 00:23 GoEmotions: A Dataset for Fine-Grained Emotion Classification Thursday, October 28, 2021 Posted by Dana Alon and Jeongwoo Ko, Software Engineers, Google Research Emotions are a key aspect of social interactions, influencing the way people behave and shaping relationships. This is especially true with language — with only a few words, we’re able to express a wide variety of subtle and complex
▽natural language processing blog ●04/25 22:07 Daniel Lemire’s blog In C++, is empty() faster than comparing the size with zero? An urgent puzzle Data Wrangling Introducing Trifacta’s integration with dbt Core on Google BigQuery