Microsoft Releases Indian Language 'Speech Corpus' For Researchers
This Indian language "Speech Corpus" content is provided by Microsoft Research Open Data initiative, a collection of free datasets from Microsoft Research to advance research in areas such as natural language processing, computer vision, and domain specific sciences.
Microsoft Issues Security Alert Over Cyber Attack: Reports
To help researchers and academia build Indian language speech recognition for all applications where speech is used, Microsoft India on Thursday launched its Indian language "Speech Corpus", offering speech training and test data for Telugu, Tamil and Gujarati. This is the largest publicly available Indian language speech dataset which includes audio and corresponding transcripts, Microsoft said in a statement.
This Indian language "Speech Corpus" content is provided by Microsoft Research Open Data initiative, a collection of free datasets from Microsoft Research to advance research in areas such as natural language processing, computer vision, and domain specific sciences. "Microsoft Indian Language Speech Corpus is an extension of our on-going efforts to reduce language barriers and empower Indians to harness the full potential of the Internet," said Sundar Srinivasan, General Manager, Artificial Intelligence and Research, Microsoft India.
"Using our technology expertise, we want to accelerate innovation in voice based computing for India by supporting researchers and academia," Srinivasan said.
Microsoft's Indian Language Speech Corpus was tested at Interspeech 2018 conference in Hyderabad this month. In a Low Resource Speech Recognition Challenge, participants used data from Microsoft Indian language speech corpus to build Automatic Speech Recognition (ASR) systems.
They were able to create high quality speech recognition models using this data, thus validating the efficacy of the Corpus, Microsoft said. Microsoft has been working with Indian languages for over two decades since the launch of Project Bhasha in 1998, allowing users to input localised text easily and quickly using the Indian Language Input tool.
Get the best of News18 delivered to your inbox - subscribe to News18 Daybreak. Follow News18.com on Twitter, Instagram, Facebook, TikTok and on YouTube, and stay in the know with what's happening in the world around you – in real time.
Subscribe to Moneycontrol Pro and gain access to curated markets data, trading recommendations, equity analysis, investment ideas, insights from market gurus and much more. Get Moneycontrol PRO for 1 year at price of 3 months. Use code FREEDOM.
Recommended For You
- How Many of Us Choose Heart Over Mind: Kangana Ranaut Supports Priyanka Chopra for 'Jai Hind' Tweet
- Jio Effect: Airtel V-Fiber Offers up to 1000GB Free Data & Plans Start at Rs 799
- Spotify Premium Now Available With 3 Months of Free Access for New Users
- World Badminton Championships: B Sai Praneeth Enter Quarters, HS Prannoy Loses
- Woman Abandoned by Boyfriend on a Road Trip Assumed She was Being Pranked