Hindi Intent Classification
Hindi Intent Classification
Supervisor
Dr. Deepak K T
Department of Electronics and Communication Engineering
Indian Institute of Information Technology Dharwad
Overview
Introduction
Research Problem Formulation
Literature Review
Methodology
Dataset
Conclusion
References
Introduction
Intent Classification
Literature Review
Larson et al., (2019) classified around 23,000 sentences over 150 intents
using BERT. BERT yields the best in-scope accuracy [2]
Xia et al., (2018) employed zero-shot user intent detection via capsule neural
networks [3]
Methodology
• Reviewing standard data sets and building a new Hindi data set for intent
classification.
• Applying machine learning techniques to classify the intent and compare it
with the standard data sets.
Dataset
The dataset we introduced has 3 domains with 94 intents, the data set was
created by brainstorming with the team members and also sourced from the
existing data set on the internet:
Dataset
Future Work
Conclusion
References
[1] Hwang, E. J., Ahn, B. K., Macdonald, B. A., Ahn, H. S. (2020, May).
Demonstration of hospital receptionist robot with extended hybrid code network
to select responses and gestures. In 2020 IEEE international conference on
robotics and automation (ICRA) (pp. 8013-8018). IEEE.
[2] Larson, S., Mahendran, A., Peper, J. J., Clarke, C., Lee, A., Hill, P., ... Mars,
J. (2019). An evaluation dataset for intent classification and out-of-scope
prediction. arXiv preprint arXiv:1909.02027.
[3] Xia, C., Zhang, C., Yan, X., Chang, Y., Yu, P. S. (2018). Zero-shot user
intent detection via capsule neural networks. arXiv preprint arXiv:1809.00385.
[4] Alammar, J. (n.d.). The Illustrated BERT, ELMo, and co. (How NLP Cracked
Transfer Learning). https://jalammar.github.io/illustrated-bert/