Updated 2nd Synopsis
Updated 2nd Synopsis
Synopsis
on
of
Bachelor of Technology
In
COMPUTER SCIENCE AND ENGINEERING
(Artificial Intelligence & Machine Learning)
Submitted by
Prasoon Bhatt (2101331530095)
Shashank Shekhar (2101331530118)
Supervisor Sign:
(Ms. Meghali Das)
Introduction
Model Evaluation: The trained model is tested on unseen data to assess its
accuracy. Performance metrics such as accuracy, precision, recall, and
confusion matrix are used for evaluation.
CNNs are particularly powerful for digit recognition because they preserve
spatial relationships within the image and can recognize digits even when
they vary in size, rotation, or handwriting style.
a) Template Matching
Template Matching is one of the earliest approaches used for handwritten
digit recognition. It works by comparing an input image with a set of
predefined templates representing each digit. The closest match is selected
as the predicted digit.
Advantages:
Simple and easy to implement.
Works well when the handwriting style is uniform.
Disadvantages:
Fails when there are variations in handwriting.
Computationally expensive for large datasets.
Advantages:
Effective for recognizing sequences of handwritten characters or
words.
Can learn temporal dependencies in handwriting.
Disadvantages:
Not ideal for single-digit classification.
Requires high computational resources.
Among all the existing approaches, CNNs provide the best results for
handwritten digit recognition due to the following reasons:
Hence, CNNs have become the industry standard for tasks like handwritten
digit recognition, license plate recognition, and even facial recognition.
Problem Statement
Dataset Selection: The model will be trained on the MNIST dataset, which
contains 60,000 training images and 10,000 test images of handwritten
digits (0-9).
Data Normalization: The pixel values (0-255) are normalized to a range
of 0 to 1 to improve the training efficiency.
Reshaping: The dataset images (28x28 pixels) are reshaped to fit the CNN
input layer.
Augmentation (if needed): Techniques like rotation, shifting, and
zooming may be applied to increase data diversity and improve model
generalization.
3. Model Training
4. Model Evaluation
6. Future Enhancements
1. Software Requirements
2. Hardware Requirements
This setup ensures efficient model training, testing, and deployment for
handwritten digit recognition.
Proposed System
A. Functional Requirements
The functional requirements define the key operations and features that the
Handwritten Digit Recognition System must support.
The system should classify input images into digits (0-9) based on
trained features.
It should return a confidence score (probability) for each prediction.
Real-time classification should be supported for fast processing of new
inputs.
4. Performance Evaluation
B. Non-Functional Requirements
1. Performance Requirements
3. Security
4. Usability
The user interface (UI) should be simple and intuitive, allowing easy
digit uploads and recognition.
The system should provide clear output with confidence scores for
better interpretability.
6. Compatibility
8. Efficiency
The system should comply with data privacy laws (GDPR, CCPA)
when handling user data.
It should ensure fairness and bias mitigation in digit recognition to
avoid discrimination against specific handwriting styles.
Class Diagram
DFD (level 0)
[2] Deng, L. (2012). "The MNIST database of handwritten digit images for machine learning
research." IEEE Signal Processing Magazine, 29(6), 141-142.
[3] Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). "ImageNet classification with deep
convolutional neural networks." Advances in Neural Information Processing Systems (NeurIPS).
[4] Patel, M., & Goyal, S. (2020). "A comparative study on handwritten digit recognition using
machine learning and deep learning techniques." International Journal of Machine Learning and
Computing, 10(2), 75-82.
[5] Zhang, X., Zhao, J., & LeCun, Y. (2015). "Character-level convolutional networks for text
classification." Advances in Neural Information Processing Systems (NeurIPS).
[6] Kottakota, A., Teja, P. S., Chander, R. K., & Hema, D. D. (2023). "NeuroWrite: Predictive
Handwritten Digit Classification using Deep Neural Networks." arXiv preprint arXiv:2311.01022.
[7] Hochuli, A. G., Britto Jr, A. S., Saji, D. A., Saavedra, J. M., Sabourin, R., & Oliveira, L. S.
(2020). "A Comprehensive Comparison of End-to-End Approaches for Handwritten Digit String
Recognition." arXiv preprint arXiv:2010.15904.
[8] Gondere, M. S., Schmidt-Thieme, L., Sharma, D. P., & Scholz, R. (2021). "Multi-script
Handwritten Digit Recognition Using Multi-task Learning." arXiv preprint arXiv:2106.08267.
[9] Bonyani, M., Jahangard, S., & Daneshmand, M. (2020). "Persian Handwritten Digit, Character
and Word Recognition Using Deep Learning." arXiv preprint arXiv:2010.12880.
[10] Hochuli, A. G., Britto Jr, A. S., Saji, D. A., Saavedra, J. M., Sabourin, R., & Oliveira, L. S.
(2020). "A Comprehensive Comparison of End-to-End Approaches for Handwritten Digit String
Recognition." arXiv preprint arXiv:2010.15904.
[11] Gondere, M. S., Schmidt-Thieme, L., Sharma, D. P., & Scholz, R. (2021). "Multi-script
Handwritten Digit Recognition Using Multi-task Learning." arXiv preprint arXiv:2106.08267.
[12] Bonyani, M., Jahangard, S., & Daneshmand, M. (2020). "Persian Handwritten Digit, Character
and Word Recognition Using Deep Learning." arXiv preprint arXiv:2010.12880.
[13] Hochuli, A. G., Britto Jr, A. S., Saji, D. A., Saavedra, J. M., Sabourin, R., & Oliveira, L. S.
(2020). "A Comprehensive Comparison of End-to-End Approaches for Handwritten Digit String
Recognition." arXiv preprint arXiv:2010.15904.