The CoRal Project - Danish Speech Dataset
Speech tech in general is a growing industry, with an estimated annual compound growth of 16-19% in the next five years. But the majority of speech tech development is targeted towards high resource languages such as English. Danish, a small and low-resource language, risks falling behind and missing revenue from this technology. Modern speech tech is based on ML algorithms, which require large amounts of data. Lack of Danish data is the main problem for moving the industry forward.