0% found this document useful (0 votes)

93 views13 pages

ChatGPT - MyLearning On Dataset Aksara Jawa

on Dataset Aksara Jawa

Uploaded by

tbudiono

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

93 views13 pages

ChatGPT - MyLearning On Dataset Aksara Jawa

on Dataset Aksara Jawa

Uploaded by

tbudiono

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 13

Dataset Aksara Jawa by ChatGPT

1
Introduction

The purpose of this ChatGPT thread is for learning “Dataset Aksara Jawa”.

Method of Learning
By asking question.

2
Q1: Give examples of simple data set of Aksara Jawa that can
be used later for training machine learning algorithm for
Javanese Character Recognition?

Here’s a simple dataset of Aksara Jawa (Javanese script) to help you get
started with training a machine learning algorithm for Javanese Character
Recognition. This dataset includes a mix of basic characters, diacritics, and
numbers.

Simple Dataset Format

Structure:
 Images of individual characters saved in directories corresponding to their
labels.
 Labels represented by Unicode or human-readable names.
Dataset Directory Example:
dataset/
training/
ꦲ/ # Ha
img1.png
img2.png
ꦏ/ # Ka
img1.png
img2.png
validation/
ꦲ/ # Ha
img1.png
img2.png
ꦏ/ # Ka
img1.png
img2.png
labels.csv # Optional

Example CSV File for Labels:

image_path,label
training/ꦲ/img1.png,ꦲ
training/ꦏ/img1.png,ꦏ
validation/ꦲ/img1.png,ꦲ
validation/ꦏ/img1.png,ꦏ

Example Characters
Basic Javanese Characters

3
Unicod Charact Nam
e er e

U+A98
ꦲ Ha
0

U+A98
ꦏ Ka
1

U+A98
ꦒ Ga
2

U+A98
ꦤ Na
3

U+A98
ꦥ Pa
4

Diacritics (Sandhangan)

Unicod Charact
Name
e er

U+A9B
ꦴ Tarung
4

U+A9B Pangko
ꦵ
5 n

U+A9B
ꦶ Pepet
6

U+A9B
ꦷ Cakra
7

Numbers

Unicod Charact Nam

e er e

U+A9F
꧐ 0
0

U+A9F
꧑ 1
1

U+A9F
꧒ 2
2

U+A9F
꧓ 3
3

4
Example Training Dataset
Here’s a small sample with synthetic images created using Python’s PIL (Pillow)
library and the Noto Sans Javanese font.
Dataset Example with Labels:

Imag Label Label

e (Unicode) (Name)

ꦲ Ha

ꦏ Ka

ꦒ Ga

ꦤ Na

꧑ 1

Python Code to Generate Dataset

from PIL import Image, ImageDraw, ImageFont
import os

# Characters and labels

javanese_characters = [
{"char": "ꦲ", "label": "Ha"},
{"char": "ꦏ", "label": "Ka"},
{"char": "ꦒ", "label": "Ga"},
{"char": "ꦤ", "label": "Na"},
{"char": "ꦥ", "label": "Pa"}
]
output_dir = "dataset/training/"
font_path = "NotoSansJavanese-Regular.ttf"

# Create directories
os.makedirs(output_dir, exist_ok=True)
for char_info in javanese_characters:
char_dir = os.path.join(output_dir, char_info["label"])
os.makedirs(char_dir, exist_ok=True)

# Generate images
for char_info in javanese_characters:
for i in range(10): # Generate 10 images per character
img = Image.new("L", (32, 32), color=255) # Create white canvas
draw = ImageDraw.Draw(img)
font = ImageFont.truetype(font_path, 28)

5
draw.text((4, 0), char_info["char"], font=font, fill=0) # Draw
character

img.save(f"{output_dir}/{char_info['label']}/{char_info['label']}_{i}.png")

Tips for Expanding Dataset

1. Handwritten Samples:
o Collect handwritten samples using drawing tablets or scanning
handwritten text.
2. Augmentation:
o Rotate, scale, or distort images to simulate real-world conditions.

3. Diacritics Combination:
o Generate samples with base characters combined with diacritics
(e.g., ꦏꦶ).

How to Use the Dataset

1. Preprocessing:
o Normalize images (resize, grayscale, or binary format).

2. Model Training:
o Use a Convolutional Neural Network (CNN) for character recognition.

3. Evaluation:
o Validate on unseen combinations or real-world samples.

This simple dataset can serve as a starting point for training models to recognize
Javanese script. Let me know if you need further assistance with dataset
generation, augmentation, or model training!

6
Q2: Give examples of simple data set of Aksara Jawa in the
form of real image that I can download that can be used later
for training machine learning algorithm for Javanese Character
Recognition.

To develop a machine learning model for Javanese Character Recognition,

utilizing existing datasets can significantly expedite the process. Here are some
publicly available datasets containing real images of Aksara Jawa:

1. Hanacaraka Dataset
 Description: This dataset comprises digital handwritten images of the 20
basic Javanese characters, commonly known as Hanacaraka. Each
character class contains approximately 75 images, with resolutions below
500x500 pixels.
 Access: Available on GitHub.
 Usage: Suitable for training models focused on recognizing individual
handwritten Javanese characters.

2. Baksara Dataset
 Description: The Baksara dataset includes 164 images of Aksara Jawa
characters, annotated for object detection tasks. It also provides a pre-
trained model and an API for inference.
 Access: Hosted on Roboflow Universe.
 Usage: Ideal for developing object detection models that identify and
locate Javanese characters within images.

3. Aksara Jawa Object Detection Dataset

 Description: This dataset contains 85 images of Hanacaraka characters,
annotated for object detection purposes. It supports various formats
compatible with popular machine learning frameworks.
 Access: Available on Roboflow Universe.
 Usage: Beneficial for training models that detect and classify Javanese
script characters in diverse contexts.

4. Javanese Handwriting Object Detection Dataset

7
 Description: Comprising 1,464 images, this dataset focuses on
handwritten Javanese characters, annotated for object detection. It
includes a pre-trained model and API for immediate testing.
 Access: Accessible via Roboflow Universe.
 Usage: Suitable for projects aiming to recognize and locate handwritten
Javanese characters in images.

5. Aksara Jawa Custom Dataset

 Description: This dataset offers 2,154 training images and 480
evaluation images of handwritten Javanese characters. Each image is in
RGB format with a resolution of 224x224 pixels.
 Access: Referenced in academic publications; details can be found in
related research articles.
 Usage: Appropriate for training and evaluating models on standardized
handwritten Javanese character images.

6. Hanacaraka AI Project Dataset

 Description: Utilized in the Hanacaraka AI project, this dataset contains
images of handwritten Javanese characters. The project includes models
trained on this dataset, achieving up to 96% validation accuracy.
 Access: Details and access provided through the project's GitHub
repository.
 Usage: Useful for understanding model training processes and
performance benchmarks on Javanese character recognition tasks.

Recommendations for Use:

 Data Augmentation: Enhance model robustness by applying
transformations such as rotation, scaling, and noise addition to the
images.
 Preprocessing: Standardize image sizes and formats to align with the
input requirements of your machine learning model.
 Evaluation: Utilize the provided evaluation datasets to assess model
performance and ensure generalization to unseen data.
By leveraging these datasets, you can effectively train and evaluate machine
learning models for Javanese Character Recognition, facilitating advancements in
the digitization and preservation of Javanese script.

8
Q3: What are the criteria of good dataset for javanese
character recognition?

A good dataset for Javanese character recognition must meet several key
criteria to ensure the effective training, validation, and testing of machine
learning models. These criteria encompass aspects like diversity, quality, labeling
accuracy, and scalability. Below are the essential criteria:

1. Completeness
 All Relevant Characters:
o Include all basic characters (e.g., ꦲ to ꦳), numbers (e.g., ꧐ to ꦿ),
diacritics (sandhangan), and special symbols.
o Account for character combinations, such as base letters with
diacritics.
 Balanced Representation:
o Ensure that all classes (characters) have a comparable number of
samples to prevent model bias.

2. Diversity
 Styles:
o Include multiple styles:

 Printed characters (using fonts like Noto Sans Javanese).

 Handwritten characters from various individuals to account
for variability in handwriting.
 Sources:
o Use diverse sources such as scanned documents, handwritten
notes, and digital text.
 Variability:
o Include variations in:

 Font sizes and styles.

 Lighting conditions.
 Background textures.

3. Quality

9
 Image Resolution:
o Ensure all images are of sufficient resolution (e.g., at least 28x28
pixels for simple models, 224x224 pixels for complex models).
 Clarity:
o Avoid blurry or distorted images.

o Use preprocessing (e.g., noise reduction, contrast adjustment) to

improve image quality.
 Uniform Dimensions:
o Standardize image dimensions and formats to simplify model
training.

4. Labeling Accuracy
 Correct Labels:
o Each image must be labeled accurately with the corresponding
Unicode or a descriptive label (e.g., ꦲ, ꦏ, ꦩ).
 Consistent Labeling:
o Use a uniform format for labels (e.g., Unicode or text-based).

 Diacritic Annotation:
o Clearly label combinations of base characters and diacritics.

5. Scalability
 Size:
o Include a sufficient number of samples per character (e.g., hundreds
or thousands) to ensure model generalization.
 Expandable:
o Design the dataset structure to allow easy addition of new samples
or classes.

6. Noise and Disturbances

 Include realistic challenges such as:
o Handwritten samples with varying levels of neatness.

o Printed text with distortions, smudges, or noise (e.g., low-quality

scans).
o Images with overlapping or connected characters.

10
7. Data Augmentation
 Support augmentation techniques to increase diversity without manual
data collection:
o Rotation, scaling, and skewing.

o Brightness and contrast adjustments.

o Adding noise or simulating handwritten imperfections.

8. Balanced Dataset Split

 Training, Validation, and Test Sets:
o Split the dataset appropriately (e.g., 70% training, 15% validation,
15% testing).
 Avoid Data Leakage:
o Ensure there is no overlap between training, validation, and test
sets.

9. Open Accessibility
 Public Availability:
o If sharing the dataset, ensure it complies with copyright laws and
privacy concerns.
 Documentation:
o Provide clear documentation explaining the dataset structure,
format, and labeling system.

10. Ethical Considerations

 Representation:
o Include characters written by individuals from different
demographics to avoid biases.
 Cultural Respect:
o Properly acknowledge the cultural significance of the Javanese
script.

Checklist for a Good Dataset

11
Criteria Requirements

All characters, numbers, diacritics, and combinations

Completeness
included.

Diversity Variations in style, source, and appearance.

Clear, high-resolution images with uniform

Quality
dimensions.

Labeling Accuracy Accurate, consistent labels for each character.

Scalability Large, expandable dataset with balanced classes.

Noise and Realistic challenges like handwriting variability or

Disturbance noise.

Data Augmentation Support transformations for diversity.

Proper distribution into training, validation, and test

Balanced Split
sets.

Accessibility Publicly available with clear documentation.

Ethical
Inclusive and respectful of cultural context.
Considerations

Conclusion
A good dataset for Javanese character recognition must comprehensively cover
all characters, include diverse sources and styles, and maintain high-quality
images and accurate labels. These criteria ensure that the resulting machine
learning models perform effectively in real-world applications.
Would you like assistance in creating or finding such a dataset?

12
13

Psychology and Alchemy PDF
0% (1)
Psychology and Alchemy PDF
123 pages
ChatGPT - MyLearning On Character Recognition
No ratings yet
ChatGPT - MyLearning On Character Recognition
11 pages
Javanese Script Text Image Recognition Using Convolutional Neural Network-Goldy Najma Adli Kesaulya-Arna Fariza-Tita Karlita
No ratings yet
Javanese Script Text Image Recognition Using Convolutional Neural Network-Goldy Najma Adli Kesaulya-Arna Fariza-Tita Karlita
6 pages
Naked Power-The Phallus As An Apotropaic Symbol in The Images and Texts of Roman Italy
100% (1)
Naked Power-The Phallus As An Apotropaic Symbol in The Images and Texts of Roman Italy
132 pages
NCERT Solutions For Class 11 Physics Chapter 1 - Physical World - .
No ratings yet
NCERT Solutions For Class 11 Physics Chapter 1 - Physical World - .
7 pages
The Arson Betrayal Walkthrough
No ratings yet
The Arson Betrayal Walkthrough
18 pages
Character Master Sheets - V2
No ratings yet
Character Master Sheets - V2
103 pages
Construct-Centered Research Method Framework
No ratings yet
Construct-Centered Research Method Framework
192 pages
STIHL FS 110 Owners Instruction Manual
No ratings yet
STIHL FS 110 Owners Instruction Manual
116 pages
Boolean Search Quick Guide
100% (2)
Boolean Search Quick Guide
2 pages
Harvard Decision Making
No ratings yet
Harvard Decision Making
64 pages
0 103823 Chemical Bonding Board Notes
No ratings yet
0 103823 Chemical Bonding Board Notes
14 pages
Fabula Ultima - Tinkerer
No ratings yet
Fabula Ultima - Tinkerer
6 pages
Work Order For School Uniform
No ratings yet
Work Order For School Uniform
1 page
Section 43 (A) in A Vacuum: Cleaning Up False Advertising in An Unfair Competition Mess
No ratings yet
Section 43 (A) in A Vacuum: Cleaning Up False Advertising in An Unfair Competition Mess
14 pages
Krishna-Kaalii Abheda Varnanam
No ratings yet
Krishna-Kaalii Abheda Varnanam
18 pages
Cobbe 1888 The Scientific Spirit of The Age
No ratings yet
Cobbe 1888 The Scientific Spirit of The Age
264 pages
Special Ed Thesis Topics
100% (3)
Special Ed Thesis Topics
5 pages
A Philosophy For Social Change
No ratings yet
A Philosophy For Social Change
58 pages
Lai 2020 System Theory Development
No ratings yet
Lai 2020 System Theory Development
18 pages
ChatGPT - MyLearning On Coding For NLP
No ratings yet
ChatGPT - MyLearning On Coding For NLP
10 pages
Kilborn v. Bakhir, 4th Cir. (2003)
No ratings yet
Kilborn v. Bakhir, 4th Cir. (2003)
4 pages
Kobeleva 2020 Instructional Design Theory and Practice
No ratings yet
Kobeleva 2020 Instructional Design Theory and Practice
26 pages
ChatGPT MyLearning On Critique in Philosophical Writing
No ratings yet
ChatGPT MyLearning On Critique in Philosophical Writing
55 pages
Hydrological Modeling and Climate Change Impact Assessment Using HBV
No ratings yet
Hydrological Modeling and Climate Change Impact Assessment Using HBV
13 pages
Program of Study Outcomes: Lesson Title/Focus Class Badminton Day 1 (6 Day Condensed Unit) Course Grade 8
No ratings yet
Program of Study Outcomes: Lesson Title/Focus Class Badminton Day 1 (6 Day Condensed Unit) Course Grade 8
4 pages
Catalogue of Microbial Cultures
100% (1)
Catalogue of Microbial Cultures
78 pages
Global 6000 SN 9527 Specifications EQ
No ratings yet
Global 6000 SN 9527 Specifications EQ
12 pages
Cynarski 2019 Karatedo As Educational Systems - Bagus
No ratings yet
Cynarski 2019 Karatedo As Educational Systems - Bagus
14 pages
Ethical Insights Into ChatGPT - A Comprehensive Guide - Jenni AI
No ratings yet
Ethical Insights Into ChatGPT - A Comprehensive Guide - Jenni AI
9 pages
Complete Amazon Seo
No ratings yet
Complete Amazon Seo
5 pages
ChatGPT - MyLearning On Intelectual Working Tool For Research in Theory of Computation
No ratings yet
ChatGPT - MyLearning On Intelectual Working Tool For Research in Theory of Computation
10 pages
ChatGPT - MyLearning On Maturity Level in Compilation Techniques
No ratings yet
ChatGPT - MyLearning On Maturity Level in Compilation Techniques
16 pages
Condition Monitoring of A Surface Mounted Permanen
No ratings yet
Condition Monitoring of A Surface Mounted Permanen
18 pages
ChatGPT - MyLearning On Compiler Backend With LLVM
No ratings yet
ChatGPT - MyLearning On Compiler Backend With LLVM
8 pages
Japan 2020 From Ancient Tradition To Modern Sport
No ratings yet
Japan 2020 From Ancient Tradition To Modern Sport
3 pages
Lpu Dia Ol 2024-2025 540301
No ratings yet
Lpu Dia Ol 2024-2025 540301
6 pages
ChatGPT - MyLearning On Mode of Thought in Philosophy
No ratings yet
ChatGPT - MyLearning On Mode of Thought in Philosophy
10 pages
Data Extra Item PDF
No ratings yet
Data Extra Item PDF
4 pages
History II List For Book Reviews
No ratings yet
History II List For Book Reviews
4 pages
35 City of Manila Vs Chinese Community
No ratings yet
35 City of Manila Vs Chinese Community
2 pages
Lefkowitz 1988 The Spirit of Science - Penting
No ratings yet
Lefkowitz 1988 The Spirit of Science - Penting
4 pages
Xingying 2013 Integration Between Scientific Spirits and Humanistic Spirit
No ratings yet
Xingying 2013 Integration Between Scientific Spirits and Humanistic Spirit
4 pages
Adaptive Headlight With Orvm Technology
No ratings yet
Adaptive Headlight With Orvm Technology
7 pages
MUSTAFA TAREK. Public Accountant
No ratings yet
MUSTAFA TAREK. Public Accountant
2 pages
Script
No ratings yet
Script
3 pages
Route048 Timetable
No ratings yet
Route048 Timetable
2 pages
The Tourism Profile of Mindanao Lesson 9
No ratings yet
The Tourism Profile of Mindanao Lesson 9
5 pages
Python Data Science Cookbook
From Everand
Python Data Science Cookbook
Taryn Voska
No ratings yet
Microsoft Visual C++ Windows Applications by Example
From Everand
Microsoft Visual C++ Windows Applications by Example
Stefan BjÃ¶rnander
3.5/5 (3)
Python Data Analysis
From Everand
Python Data Analysis
Ivan Idris
4/5 (2)
JavaScript. A Comprehensive manual for creating dynamic, responsive websites and applications: Suitable For Both Novice And Experts.
From Everand
JavaScript. A Comprehensive manual for creating dynamic, responsive websites and applications: Suitable For Both Novice And Experts.
Abdulrazak Nugwa Ibrahim
5/5 (1)
RSpec Essentials
From Everand
RSpec Essentials
Mani Tadayon
3/5 (1)
In-Memory Analytics with Apache Arrow: Accelerate data analytics for efficient processing of flat and hierarchical data structures
From Everand
In-Memory Analytics with Apache Arrow: Accelerate data analytics for efficient processing of flat and hierarchical data structures
Matthew Topol
No ratings yet
Python Data Science Cookbook: Practical solutions across fast data cleaning, processing, and machine learning workflows with pandas, NumPy, and scikit-learn
From Everand
Python Data Science Cookbook: Practical solutions across fast data cleaning, processing, and machine learning workflows with pandas, NumPy, and scikit-learn
Taryn Voska
No ratings yet
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
From Everand
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
Marcin Jamro
No ratings yet
Python for Secret Agents
From Everand
Python for Secret Agents
Steven F. Lott
No ratings yet
Statistics with Rust, Second Edition
From Everand
Statistics with Rust, Second Edition
Keiko Nakamura
No ratings yet
Python for Developers: Learn to Develop Efficient Programs using Python
From Everand
Python for Developers: Learn to Develop Efficient Programs using Python
Mohit Raj
No ratings yet
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
From Everand
Python Programming: General-Purpose Libraries; NumPy,Pandas,Matplotlib,Seaborn,Requests,os & sys: Python, #2
e3
No ratings yet
Programming Kotlin
From Everand
Programming Kotlin
Stephen Samuel
No ratings yet
Statistics with Rust, Second Edition: Explore rust programming and its powerful crates across data science, machine learning and NLP projects
From Everand
Statistics with Rust, Second Edition: Explore rust programming and its powerful crates across data science, machine learning and NLP projects
Keiko Nakamura
No ratings yet
Python Interview Questions
From Everand
Python Interview Questions
equitypress
4.5/5 (6)
Java Data Science Cookbook
From Everand
Java Data Science Cookbook
Rushdi Shams
No ratings yet
Mastering TensorFlow 2.x: Implement Powerful Neural Nets across Structured, Unstructured datasets and Time Series Data
From Everand
Mastering TensorFlow 2.x: Implement Powerful Neural Nets across Structured, Unstructured datasets and Time Series Data
Rajdeep Dua
No ratings yet
IGNOU BCA Object-Oriented Technologies and Java Programming Previous Year Solved Papers MCS 024
From Everand
IGNOU BCA Object-Oriented Technologies and Java Programming Previous Year Solved Papers MCS 024
Manish Soni
No ratings yet
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
From Everand
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
Anthony Phillips
No ratings yet
Learning OpenCV 3 Application Development
From Everand
Learning OpenCV 3 Application Development
Samyak Datta
No ratings yet
Python Pranks and Mischief with NLP
From Everand
Python Pranks and Mischief with NLP
Edward Franklin
No ratings yet
Learning Functional Data Structures and Algorithms
From Everand
Learning Functional Data Structures and Algorithms
Atul S. Khot
No ratings yet
Scientific Computing with Scala
From Everand
Scientific Computing with Scala
Vytautas Jančauskas
No ratings yet
Java for Data Science
From Everand
Java for Data Science
Richard M. Reese
No ratings yet
Java Coding Problems: Improve your Java Programming skills by solving real-world coding challenges
From Everand
Java Coding Problems: Improve your Java Programming skills by solving real-world coding challenges
Anghel Leonard
No ratings yet
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
From Everand
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
Ivan Vasilev
No ratings yet
Statistics with Rust: 50+ Statistical Techniques Put into Action
From Everand
Statistics with Rust: 50+ Statistical Techniques Put into Action
Keiko Nakamura
No ratings yet
Java Algorithms for Beginners: A Practical Guide with Examples
From Everand
Java Algorithms for Beginners: A Practical Guide with Examples
William E. Clark
No ratings yet
Learning Cypher
From Everand
Learning Cypher
Onofrio Panzarino
No ratings yet
Julia Cookbook
From Everand
Julia Cookbook
Jalem Raj Rohit
No ratings yet
R Object-oriented Programming
From Everand
R Object-oriented Programming
Kelly Black
3/5 (1)
Data Structures and Algorithms with Go: Create efficient solutions and optimize your Go coding skills (English Edition)
From Everand
Data Structures and Algorithms with Go: Create efficient solutions and optimize your Go coding skills (English Edition)
Dušan Stojanović
No ratings yet
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
From Everand
Python for Data Science: Data Science Mastery by Nikhil Khan, #1
Nikhil Khan
No ratings yet
Rake Task Management Essentials
From Everand
Rake Task Management Essentials
Andrey Koleshko
3/5 (1)
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Java Data Structures Explained: A Practical Guide with Example
From Everand
Java Data Structures Explained: A Practical Guide with Example
William E. Clark
No ratings yet
Java: A complete practical solution
From Everand
Java: A complete practical solution
Swati Saxena
No ratings yet
Pig Design Patterns
From Everand
Pig Design Patterns
Pradeep Pasupuleti
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
Mastering Java: A Comprehensive Guide to Development Tools and Techniques
From Everand
Mastering Java: A Comprehensive Guide to Development Tools and Techniques
Lena Neill
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Python Regular Expressions Explained: A Practical Guide with Examples
From Everand
Python Regular Expressions Explained: A Practical Guide with Examples
William E. Clark
No ratings yet
Ian Talks Python A-Z
From Everand
Ian Talks Python A-Z
Ian Eress
No ratings yet
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Computer Programming: A Simplified Entry to Python, Java, and C++ Programming for Beginners
From Everand
Computer Programming: A Simplified Entry to Python, Java, and C++ Programming for Beginners
Lena Neill
No ratings yet
PostScript Language Essentials: Definitive Reference for Developers and Engineers
From Everand
PostScript Language Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
AP Computer Science A Premium, 12th Edition: Prep Book with 6 Practice Tests + Comprehensive Review + Online Practice
From Everand
AP Computer Science A Premium, 12th Edition: Prep Book with 6 Practice Tests + Comprehensive Review + Online Practice
Barron's Educational Series
No ratings yet
Performance Testing with JMeter 2.9
From Everand
Performance Testing with JMeter 2.9
Bayo Erinle
No ratings yet
Mastering Python: A Comprehensive Guide for Beginners and Experts
From Everand
Mastering Python: A Comprehensive Guide for Beginners and Experts
Rick Spair
No ratings yet
Mastering Python Programming: A Comprehensive Guide: The IT Collection
From Everand
Mastering Python Programming: A Comprehensive Guide: The IT Collection
Christopher Ford
5/5 (1)
Algorithms and Data Structures: An Easy Guide to Programming Skills
From Everand
Algorithms and Data Structures: An Easy Guide to Programming Skills
Rigdon Jonathan
No ratings yet
Coding for beginners The basic syntax and structure of coding
From Everand
Coding for beginners The basic syntax and structure of coding
Diamond Moore
No ratings yet
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
Everyday Data Structures
From Everand
Everyday Data Structures
William Smith
No ratings yet
Computer Programming The Doctrine
From Everand
Computer Programming The Doctrine
Adesh Silva
No ratings yet
C# Interview Questions You'll Most Likely Be Asked
From Everand
C# Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ChatGPT - MyLearning On Dataset Aksara Jawa

Uploaded by

ChatGPT - MyLearning On Dataset Aksara Jawa

Uploaded by

Dataset Aksara Jawa by ChatGPT

Simple Dataset Format

Example CSV File for Labels:

Unicod Charact Nam

Imag Label Label

Python Code to Generate Dataset

# Characters and labels

Tips for Expanding Dataset

How to Use the Dataset

To develop a machine learning model for Javanese Character Recognition,

3. Aksara Jawa Object Detection Dataset

4. Javanese Handwriting Object Detection Dataset

5. Aksara Jawa Custom Dataset

6. Hanacaraka AI Project Dataset

Recommendations for Use:

 Printed characters (using fonts like Noto Sans Javanese).

 Font sizes and styles.

o Use preprocessing (e.g., noise reduction, contrast adjustment) to

6. Noise and Disturbances

o Printed text with distortions, smudges, or noise (e.g., low-quality

o Brightness and contrast adjustments.

o Adding noise or simulating handwritten imperfections.

8. Balanced Dataset Split

10. Ethical Considerations

Checklist for a Good Dataset

All characters, numbers, diacritics, and combinations

Diversity Variations in style, source, and appearance.

Clear, high-resolution images with uniform

Labeling Accuracy Accurate, consistent labels for each character.

Scalability Large, expandable dataset with balanced classes.

Noise and Realistic challenges like handwriting variability or

Data Augmentation Support transformations for diversity.

Proper distribution into training, validation, and test

Accessibility Publicly available with clear documentation.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.