0% found this document useful (0 votes)

15 views10 pages

10.1007@978 981 13 6577 535

Uploaded by

PRAVIN SAVARIDASS M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views10 pages

10.1007@978 981 13 6577 535

Uploaded by

PRAVIN SAVARIDASS M

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Automation of Business Cards

Shreya Srivastava, Suryanshu Sahay, Deepti Mehrotra and Vikas Deep

Abstract Business card is shared as hardcopy, the data present in business card will
be highly useful if it is available in digital format. The task of manually entering
the details of all business cards is laborious and time-consuming. Document image
analysis is used in this paper for automating this process. This will be accomplished by
performing OCR and then using the text to extract the Meta data. One more important
component of business card is the logo of the organization. The text extraction OCR
will be done using the Tesseract API. After conversion of the image to text, the data
will be saved in the database. The raw data will be saved in the database, which
will later be segregated and stored in the appropriate fields. It is generally ignored
in the process of saving text information, in this paper it is extracted and stored
in database. For logo detection various techniques like Gabor Filter, Harris edge
detection technique, MSER, etc., are compared to determine the best technique for
acquiring the most accurate logo extraction. Gabor filter gives the best result is used
for the extracting logo and storing in database. Java language on NetBeans IDE
platform which use the Spring MVC framework is used to implement this work.

Keywords Business card · OCR · Tesseract · Logo detection

S. Srivastava (B) · S. Sahay · D. Mehrotra · V. Deep

Amity University Uttar Pradesh, Noida, Uttar Pradesh, India
e-mail: shreya96srivastava@gmail.com
S. Sahay
e-mail: suryanshusahay19@gmail.com
D. Mehrotra
e-mail: mehdeepti@gmail.com
V. Deep
e-mail: Vikasdeep8@gmail.com

© Springer Nature Singapore Pte Ltd. 2019 371

M. Kumar et al. (eds.), Advances in Interdisciplinary Engineering, Lecture Notes in
Mechanical Engineering, https://doi.org/10.1007/978-981-13-6577-5_35
372 S. Srivastava et al.

1 Introduction

Business card is an important document which requires preservation and efficient data
handling. The limitation of the card being offline and as a result it requires manual
work for preserving the data digitally [1]. The greatest advantage of a business card,
its tangibility has somehow also become its greatest disadvantage. These are easily
accessible yet fragile. This problem has been tried to be solved by automating this
process. The automation of business cards will be done by extracting all the data
and saving each field into the database using different algorithms and logic using
document image analysis [2–5]. This data will be saved to different fields based on
the field analysis [6].
The data field extraction will be done to extract Meta data like Name, Company,
Phone, Fax, Email, and Address. The rest of the information will be saved in the
also be saved. The main contribution to this work was to filter the text by reducing
the noise of the obtained text and then obtain the information which is necessary,
i.e., categorizing the data into mapped and unmapped data. Mapped data will then be
used to segregate the information and classifying it on the basis of various algorithms.
Regular Expressions were used for pattern matching which will be used to extract
information like email address, phone numbers, etc., which follow a certain pattern.
However there will be exceptions like ‘@’ will also be for the twitter handle which
might be interpreted as an email. Core-NER-NLP (Name Entity Recognizer-Natural
Language Processing) which uses Stanford Dictionary for various predefined field
extraction like name of a person, organization, location, etc. The data extracted
from the text will then be saved in separate fields in the data base. As the tesseract
OCR is not 100% efficient, there will be a modification option where the user can
manually correct the text before the data extraction takes place [7]. The home page
will contain upload, modify, list, search, and delete links. Upload tab will be used to
upload an image of the business card. Modify will be used to change any text which
has been wrongly read by OCR. List will display all the business cards in a single
page. Any query based on the business card will be done using the Search tab. For
omitting/removing any uploaded business card, an option of Delete will be present.
This database will be available on the browser using the JDBC and Spring MVC
framework. This will be a service available for various platforms. The database will
be tried for being converted to contacts which will be directly saved in the mobile
device.

2 Methodology

This work uses following APIs and software’s for feature extraction: Stanford Core
NLP, Tesseract, Oracle Glassfish Server, NetBeans IDE, RegEx, and MATLAB.
Stanford Core NLP uses algorithms that easies the way computers are programmed
in order to fulfill the humanistic requirements. Tesseract API uses optical character
Automation of Business Cards 373

Fig. 1 Data flow diagram

recognition for extracting text from scanned images. Oracle Glassfish Server serves
the purpose of handling servlets and JSP. The following diagram describe the com-
plete work flow of the research work.
Figure 1 as shown explains how the scanned business card will be classified into
two parts: image and text. The image, if detected will be extracted and the classified
as the logo of the respective institution. Else the text will be mapped and classified
under the respective database fields. For instance, email ids, ‘@’ will be used for
identification and 10 digit numbers will be used for mobile numbers. There are
six fields in which the extracted text will be saved and classified. They are Name,
Company, Phone, Fax, Email, and Address.
This work has been created on the NetBeans IDE platform using Java language.
Since this is a web application, Glass Fish server was used. Glass Fish is an open-
source application server built for the java platform. For the architecture, Spring
Web MVC was used. The sole purpose of this is to handle all the HTTP requests
as well as responses. The Maven Repository was used for this work as it helped
in automatically building the dependencies for any number of times. The Tesseract
OCR API was used for reading the text from the images. The acquired text was then
stored in the database. It then further processed and filtered into the respective fields.
This was done by using RegEx, NER, and Stanford Core NLP.
374 S. Srivastava et al.

Fig. 2 Home page of the application

RegEx stands for regular expression. It is used for searching strings which follow
a definite pattern. For instance an email id will always have ‘@’ in between the string
or a mobile number will have a ‘+’ followed by ISD code and then the remaining 10
digits. There are exceptions in this as well, like for instance ‘@’ might also be present
in a twitter handle so.com was used for further classification. Phone numbers are also
present in various formats like XXX-XXX-XXXX or + XX-XXXXXXXXXX, etc.,
which were also dealt with different user cases [8]. NER or Name Entity Recogni-
tion is generally used to extract information like names, addresses, percentages, or
various quantities. Here it was used to identify the Name of the contact mentioned
in the business card. Stanford Core NLP (Natural Language Processing) is an imple-
mentation of NER, it is a predefined dictionary which is used to identify and extract
information having three major classes (Location, Person, and Organization) this
was used to identify the address of the office or institution as duly mentioned in the
business card.
The MySQL database was administered and managed by phpMyAdmin. Xampp
was used to manage cross-platform operations of majorly database and php.
The class HomeController was created to manage the map the browser requests.
The DAO class is used to as a logical interface between the database and the MVC
model. The POJO files manage uploading of the business card as well as creating
contacts in the database. Utility files include all the text extraction techniques for
each field, viz., email, fax, and phone number.
Figure 2 shows the home page of the application. As mentioned, there are options
to list all the uploaded cards from the database, manually edit the stored information
(see Fig. 3) as the OCRed text is not 100% efficient, search for the business cards
using the primary key from a database, and also delete any card.
Dependencies were built separately are done in MVC framework. The main advan-
tage of building dependencies is to eliminate the need to the work of identifying and
specifying them. These dependencies are added automatically. It is particularly use-
ful when a large dependency tree is formed as it then becomes difficult to keep a track
Automation of Business Cards 375

Fig. 3 Manually editing the fields

on each one of them. The few dependencies that have been built include JAR files like
tess4j for OCR extraction, jdbc for java and database connectivity, Stanford-corenlp
for the predefined dictionary, spring expression which is used for querying, spring-tx
which manages transactions.
For the logo part, MATLAB was used. MATLAB was preferred over Java because
the scanned image requires to be processed and since Java Image processing slows
down the application, therefore it is best to use the MATLAB in place of Java for
image processing. Java Image Processing is useful when the image is only of a few
bytes (few hundred dpis), but when larger sized images are being considered for
processing, it is best to use platforms which are best suited to process the images
as per the requirements. Many different techniques were used in order to identify
the logo from business cards. The techniques involved were edge detection, Gabor
filter, SURF and SIFT algorithms [9, 10]. All of these techniques are based on the
phenomenon of feature extraction. The feature extraction is a method of extracting
the important features from the image. These are useful in various fields like object
recognition or identifying a particular set of textures or for image retrieval [11, 12].
Edge detection: it is a technique to identify the image boundaries. This will be
useful in detecting the logos as it will extract the image present in the image itself.
Boundary detection is done by comparing the image brightness and checking its
discontinuities [13–15].
SIFT: Scale-Invariant Feature Transformation is one of the basic algorithms used
to detect logos. It is very useful as it is invariant to mostly all types of features like
scaling, translation, rotation, or even the illumination. It basically converts the image
into vectors which are then used for image description.
SURF: Speeded-Up robust Features is an algorithm derived through SIFT for
image detection it is better than SIFT in terms of computability and distinctiveness
(Fig. 4a, b).
Harris features: This technique is used to detect the corners and identify them
(Fig. 5).
376 S. Srivastava et al.

Fig. 4 a Implementation of SURF. b Implementation of SURF

Fig. 5 Harris corner detection

MSER: Maximally Stable External Regions is used for image detection. It deals
with the correspondences among the different image components (Fig. 6).
Gabor Filter: It is a filter used to identify the textures present. It serves the purpose
of identifying the regions where a certain frequency is present. The main disadvantage
of this filter is that it is over reliant on the orientation of the image Fig. 7. The following
code snippet was used for implementing the Gabor filter:
Comparing all the features and results, it was Gabor Filter which was found to be
most accurate and result-oriented. Others techniques like MSER and Harris corner
detection identified the texts also. Thus making it difficult to extract the logo from
them.
Automation of Business Cards 377

Fig. 6 Implementation of SURF

Fig. 7 Implementation of Gabor Filter

3 Results and Discussions

The business card scanned was used for text as well as logo extraction. As explained
below the following results were obtained:
The OCR performed text was retrieved in the database because data other than
the structured data might also be useful and hence can be of some purpose as shown
in Fig. 8. Figure 9 shows how information has been mapped to the respective fields
and thus the data has been retrieved in the database as well. Data has been stored in
each category differently.
378 S. Srivastava et al.

Fig. 8 The OCR performed text in the database

Fig. 9 Business card list

As the scanned cards may vary, they will have different formats like .jpg, .pdf, etc.
Hence, ghost jar as well as imagio jar was used for different formats of the uploaded
image. This application was run on the glassfish server as all the connectors and
backend APIs would work properly and efficiently on this server, therefore, making
user-friendly GUI. Thus, the application successfully returns the relevant extraction
of data using tesseract and other APIs and techniques.
The logo extraction was done based on the comparison of different techniques;
following results were obtained in the end (Fig. 10).
Automation of Business Cards 379

Fig. 10 Logo extraction using Gabor Filter

4 Conclusion

The work is tried and tested on business cards of different types but there was noise
which was prevalent in every business card. This was reduced by specifically applying
algorithms. As a result only important data was extracted in the database. Since
every business card is different therefore it is difficult to extract the information from
each business card with absolute efficiency. This can be improved with increase
in the dictionary and refinement of algorithms of tesseract, so that the noise can
be reduced and other important information can also be retrieved example some
business cards may have websites as well as their designation and branches of their
company. Therefore, such card’s OCR text field extraction would filter out such
important details as noise whereas these can also be extracted and saved under more
information tab so that correct and detailed data is provided to the user. Along with
the important data, the logo was also extracted which being an added feature will
help in simpler and easier classification of the contacts. The Gabor Filter was the
most relevant technique and as a result successful implementation was done.

References

1. LaForge L, Carlson D, Korver K, System for creating and reading digital business cards, forms,
and stationery. U.S. Patent Application No. 10/055,011
2. Carton C, Lemaitre A, Coüasnon B (2015) Automatic and interactive rule inference without
ground truth. In: 2015 13th international conference on document analysis and recognition
(ICDAR). IEEE
3. Karatzas D et al (2016) Human-document interaction systems—a new frontier for document
image analysis. In: 2016 12th IAPR workshop on document analysis systems (DAS). IEEE
4. Niyogi D, Srihari SN, Govindaraju V (1996) Analysis of printed forms. In: Handbook on
optical character recognition and document image analysis. World Scientific Publishing Co.,
Singapore
5. Harvey R, Oliver G (2016) Digital curation. ALA Neal-Schuman
6. Rusinol M, Benkhelfallah T, Poulaind’Andecy V, Field extraction from administrative docu-
ments by incremental structural templates
7. Mithe R, Indalkar S, Divekar N (2013) Optical character recognition. Int J Recent Technol Eng
2.1:72–75
380 S. Srivastava et al.

8. Zhu G, Bethea TJ, Krishna V (2007) Extracting relevant named entities for automated expense
reimbursement. In: Proceedings of the 13th ACM SIGKDD international conference on knowl-
edge discovery and data mining. ACM
9. Wang H, Chen Y (2009) Log detection in document images based on boundary extension of
feature rectangles. IEEE
10. Park JH, Jang IH, Kim NC, Skew correction of business card images acquired in PDA
11. Aksoy S, Haralick RM (1998) Textural features for image database retrieval. Content-based
access of image and video libraries. IEEE
12. Conners RW, Harlow CA (1976) Some theoretical considerations concerning texture analysis of
radiographic images. In: Proceedings of IEEE conference on decision and control, pp 162–167
13. Flickner M (1993) The QBIC project: querying images by content using color texture and
shape. SPIE storage and retrieval of image and video databases
14. Ma WY, Manjunath BS (1997) NETRA: a toolbox for navigating large image databases. In:
Proceedings of ICIP
15. Pentland A (1994) Photobook: content-based manipulation of image databases. SPIE storage
and retrieval of image and video databases II, pp 34–47, 1994 Feb

IEEE Research Paper For Drowsiness Detection System Project
No ratings yet
IEEE Research Paper For Drowsiness Detection System Project
8 pages
Automatic License Plate Recognition (ALPR) : A State-of-the-Art Review
No ratings yet
Automatic License Plate Recognition (ALPR) : A State-of-the-Art Review
15 pages
Sample Resume of Waqar Baig
No ratings yet
Sample Resume of Waqar Baig
3 pages
EECS 442: Prof. David Fouhey Winter 2019, University of Michigan
No ratings yet
EECS 442: Prof. David Fouhey Winter 2019, University of Michigan
64 pages
Bag of Words
No ratings yet
Bag of Words
72 pages
Msc. Research Proposal Master Program
No ratings yet
Msc. Research Proposal Master Program
8 pages
Video Summarization Techniques and Applications
No ratings yet
Video Summarization Techniques and Applications
6 pages
Facial Landmark Detection: A Literature Survey
No ratings yet
Facial Landmark Detection: A Literature Survey
28 pages
Structure From Motion Photogrammetry in Physical Geography: M.W. Smith J.L. Carrivick D.J. Quincey
No ratings yet
Structure From Motion Photogrammetry in Physical Geography: M.W. Smith J.L. Carrivick D.J. Quincey
29 pages
Construction Project Monitoring
No ratings yet
Construction Project Monitoring
9 pages
Scale Invariant Feature Transform Plus Hue Feature
No ratings yet
Scale Invariant Feature Transform Plus Hue Feature
6 pages
BizCardX - Extracting Business Card Data With OCR
No ratings yet
BizCardX - Extracting Business Card Data With OCR
3 pages
selectiveSearchDraft PDF
No ratings yet
selectiveSearchDraft PDF
14 pages
Kortli 2020
No ratings yet
Kortli 2020
6 pages
Descriptor Matching With Convolutional Neural Networks: A Comparison To SIFT
No ratings yet
Descriptor Matching With Convolutional Neural Networks: A Comparison To SIFT
10 pages
Dog Breed Identification: Whitney Larow Brian Mittl Vijay Singh
No ratings yet
Dog Breed Identification: Whitney Larow Brian Mittl Vijay Singh
7 pages
A Review On Image Feature Detection and Description
No ratings yet
A Review On Image Feature Detection and Description
4 pages
Zhang VAIS A Dataset 2015 CVPR Paper
No ratings yet
Zhang VAIS A Dataset 2015 CVPR Paper
7 pages
A Business Card Reader Application For iOS Devices Based On Tesseract
No ratings yet
A Business Card Reader Application For iOS Devices Based On Tesseract
4 pages
Missing Child Identification Using Deep Learning
No ratings yet
Missing Child Identification Using Deep Learning
16 pages
ETE-DIP Solution
No ratings yet
ETE-DIP Solution
15 pages
Liu 2020
No ratings yet
Liu 2020
9 pages
Review 3
No ratings yet
Review 3
17 pages
Chương 7 - Trắc nghiệm kiến thức - Attempt review
No ratings yet
Chương 7 - Trắc nghiệm kiến thức - Attempt review
13 pages
Lecture 03
No ratings yet
Lecture 03
82 pages
Chương 7 - Trắc nghiệm kiến thức - Attempt review
No ratings yet
Chương 7 - Trắc nghiệm kiến thức - Attempt review
12 pages
An Image Dataset of Bishnupur Terracotta Temples For Digital Heritage Research
No ratings yet
An Image Dataset of Bishnupur Terracotta Temples For Digital Heritage Research
23 pages
Batch 20
No ratings yet
Batch 20
17 pages
Sunspot Identification and Tracking With OpenCV
No ratings yet
Sunspot Identification and Tracking With OpenCV
6 pages
Yolo11 Car
No ratings yet
Yolo11 Car
16 pages
Unit - 3
No ratings yet
Unit - 3
42 pages
ICDAR2021-Information Extraction From Invoices
No ratings yet
ICDAR2021-Information Extraction From Invoices
17 pages
Localization of Autonomous Drone For Telecommunication Tower Inspection and Monitoring Using Computer Vision
No ratings yet
Localization of Autonomous Drone For Telecommunication Tower Inspection and Monitoring Using Computer Vision
5 pages
Data Science Document Processing & Structuring Project
No ratings yet
Data Science Document Processing & Structuring Project
6 pages
Minor 2
No ratings yet
Minor 2
4 pages
Deep Learning in Data Analytics Recent Techniques Practices and Applications 1st Edition Debi Prasanna Acharjya PDF Download
No ratings yet
Deep Learning in Data Analytics Recent Techniques Practices and Applications 1st Edition Debi Prasanna Acharjya PDF Download
76 pages
Ontotext GraphDB in Practice: The Complete Guide for Developers and Engineers
From Everand
Ontotext GraphDB in Practice: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Apache Arrow Dataset in Practice: The Complete Guide for Developers and Engineers
From Everand
Apache Arrow Dataset in Practice: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
Apex Programming Solutions: Definitive Reference for Developers and Engineers
From Everand
Apex Programming Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Data Lakes & Pipelines: A Modern Azure Guide
From Everand
Data Lakes & Pipelines: A Modern Azure Guide
Kameron Hussain
No ratings yet
Tesseract OCR Essentials: Definitive Reference for Developers and Engineers
From Everand
Tesseract OCR Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
T-SQL Techniques and Best Practices: Definitive Reference for Developers and Engineers
From Everand
T-SQL Techniques and Best Practices: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Amazon EMR Solutions in Cloud Computing: Definitive Reference for Developers and Engineers
From Everand
Amazon EMR Solutions in Cloud Computing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Comprehensive Guide to SAS Programming: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to SAS Programming: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Dgraph Essentials: The Complete Guide for Developers and Engineers
From Everand
Dgraph Essentials: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Azure Data Demystified: From SQL to Synapse
From Everand
Azure Data Demystified: From SQL to Synapse
Kameron Hussain
No ratings yet
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
From Everand
Advanced Apache Tez Techniques: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
From Everand
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Teradata Architecture and SQL Essentials: Definitive Reference for Developers and Engineers
From Everand
Teradata Architecture and SQL Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Realm Database in Mobile Application Development: Definitive Reference for Developers and Engineers
From Everand
Realm Database in Mobile Application Development: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
CrateDB for IoT and Machine Data: The Complete Guide for Developers and Engineers
From Everand
CrateDB for IoT and Machine Data: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Efficient Data Preparation with AWS Glue DataBrew: Definitive Reference for Developers and Engineers
From Everand
Efficient Data Preparation with AWS Glue DataBrew: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Textract Workflows and Applications: Definitive Reference for Developers and Engineers
From Everand
Textract Workflows and Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Mastering Apache Arrow: Accelerating Data Processing and In-Memory Analytics
From Everand
Mastering Apache Arrow: Accelerating Data Processing and In-Memory Analytics
Robert Johnson
No ratings yet
Dataproc Administration and Engineering Solutions: Definitive Reference for Developers and Engineers
From Everand
Dataproc Administration and Engineering Solutions: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
From Everand
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
Fouad Sabry
No ratings yet
Semantic Translation: Fundamentals and Applications
From Everand
Semantic Translation: Fundamentals and Applications
Fouad Sabry
No ratings yet
Airflow for Data Workflow Automation
From Everand
Airflow for Data Workflow Automation
Richard Johnson
No ratings yet
Automatic Image Annotation: Fundamentals and Applications
From Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
No ratings yet
Java Data Structures Explained: A Practical Guide with Example
From Everand
Java Data Structures Explained: A Practical Guide with Example
William E. Clark
No ratings yet
JavaScript Data Structures Explained: A Practical Guide with Examples
From Everand
JavaScript Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Data Structures Explained: A Practical Guide with Examples
From Everand
Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
C++ Data Structures Explained: A Practical Guide with Examples
From Everand
C++ Data Structures Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
From Everand
Lexicon of Computer Science Terminology: Lexicon of Tech and Business, #16
Mustafa Al-Dori
4/5 (1)
Databricks Essentials: A Guide to Unified Data Analytics
From Everand
Databricks Essentials: A Guide to Unified Data Analytics
Robert Johnson
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Learn Hadoop in 24 Hours
From Everand
Learn Hadoop in 24 Hours
Alex Nordeen
No ratings yet
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
From Everand
Study Guide Cisco 300-735 SAUTO Automating and Programming Cisco Security Solutions Exam
Anand Vemula
No ratings yet
Java / J2EE Interview Questions You'll Most Likely Be Asked
From Everand
Java / J2EE Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Ultimate Azure Data Engineering: Build Robust Data Engineering Systems on Azure with SQL, ETL, Data Modeling, and Power BI for Business Insights and Crack Azure Certifications (English Edition)
From Everand
Ultimate Azure Data Engineering: Build Robust Data Engineering Systems on Azure with SQL, ETL, Data Modeling, and Power BI for Business Insights and Crack Azure Certifications (English Edition)
Ashish Agarwal
No ratings yet
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
From Everand
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
Olga Maria Stefania Cucaro
No ratings yet
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
From Everand
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
Marcin Jamro
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Artificial Intelligence 2024 Book 2 of 2: AI, #2
From Everand
Artificial Intelligence 2024 Book 2 of 2: AI, #2
Yang Yen Thaw
No ratings yet
The Complete Guide to Technology & Programming
From Everand
The Complete Guide to Technology & Programming
MATHY WISDOM
No ratings yet
Mastering C: Advanced Techniques and Tricks
From Everand
Mastering C: Advanced Techniques and Tricks
Ted Norice
No ratings yet
CODING FOR ABSOLUTE BEGINNERS: How to Keep Your Data Safe from Hackers by Mastering the Basic Functions of Python, Java, and C++ (2022 Guide for Newbies)
From Everand
CODING FOR ABSOLUTE BEGINNERS: How to Keep Your Data Safe from Hackers by Mastering the Basic Functions of Python, Java, and C++ (2022 Guide for Newbies)
Eric Vargas
No ratings yet
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
SQL Programming & Database Management For Noobee
From Everand
SQL Programming & Database Management For Noobee
Kishor Sarkar X
No ratings yet
Introduction to Oracle Database Administration
From Everand
Introduction to Oracle Database Administration
Ying Wang
5/5 (1)
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
From Everand
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
JAMIE POWERS
No ratings yet
XML Programming: The Ultimate Guide to Fast, Easy, and Efficient Learning of XML Programming
From Everand
XML Programming: The Ultimate Guide to Fast, Easy, and Efficient Learning of XML Programming
Christopher Right
2.5/5 (2)
Data Structures I Essentials
From Everand
Data Structures I Essentials
Dennis Smolarski
No ratings yet
Concise Oracle Database For People Who Has No Time
From Everand
Concise Oracle Database For People Who Has No Time
Billy Aung Myint
No ratings yet
IBM WebSphere eXtreme Scale 6
From Everand
IBM WebSphere eXtreme Scale 6
Anthony Chaves
No ratings yet
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
From Everand
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
Anthony Serpico
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

10.1007@978 981 13 6577 535

Uploaded by

10.1007@978 981 13 6577 535

Uploaded by

Automation of Business Cards

Shreya Srivastava, Suryanshu Sahay, Deepti Mehrotra and Vikas Deep

Keywords Business card · OCR · Tesseract · Logo detection

S. Srivastava (B) · S. Sahay · D. Mehrotra · V. Deep

© Springer Nature Singapore Pte Ltd. 2019 371

Fig. 1 Data flow diagram

Fig. 2 Home page of the application

Fig. 3 Manually editing the fields

Fig. 4 a Implementation of SURF. b Implementation of SURF

Fig. 5 Harris corner detection

Fig. 6 Implementation of SURF

Fig. 7 Implementation of Gabor Filter

3 Results and Discussions

Fig. 8 The OCR performed text in the database

Fig. 9 Business card list

Fig. 10 Logo extraction using Gabor Filter

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.