0% found this document useful (0 votes)

4 views5 pages

Optimizing Speech Models with Freezing

Adapting speech models to new languages requires an optimization of the trade-off between accuracy and computational cost. In this work, we investigate the optimization of Mozilla’s DeepSpeech model when adapted from English to German and Swiss German through selective freezing of layers. Employing a strategy of transfer learning, we analyze the performance impacts of freezing different numbers of network layers during fine-tuning.

Uploaded by

International Journal of Innovative Science and Research Technology

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views5 pages

Optimizing Speech Models with Freezing

Uploaded by

International Journal of Innovative Science and Research Technology

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Special Issue, RISEM–2025 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25jun167

Optimizing Speech Models with Freezing

Revanth Reddy Pasula1
1
Department of Computer Science Wichita State University, Wichita, United States

Publication Date: 2025/07/14

Abstract: Adapting speech models to new languages requires an optimization of the trade-off between accuracy and
computational cost. In this work, we investigate the optimization of Mozilla’s DeepSpeech model when adapted from English
to German and Swiss German through selective freezing of layers. Employing a strategy of transfer learning, we analyze the
performance impacts of freezing different numbers of network layers during fine-tuning. The experiment reveals that
freezing the initial layers achieves signiﬁcant performance improvements: training time decreases and accuracy increases.
This layer-freezing technique hence offers an extensible way to improve automated speech recognition for under-resourced
languages.

Keywords: Automatic Speech Recognition (ASR); Deep Speech; German; Layer Freezing; Low-Resource Languages; Swiss
German; Transfer Learning.

How to Cite: Revanth Reddy Pasula; (2025). Optimizing Speech Models with Freezing. International Journal of Innovative Science
and Research Technology, (RISEM–2025), 69-73. https://doi.org/10.38124/ijisrt/25jun167

I. INTRODUCTION pre-training of a network over an enormous, varied data set

and then initializing with these pre-trained parameters, fine-
ASR systems have improved mostly for the language of tuning over a more modest target data set can be rendered
English, leading to very well-optimized models for speech more efficient both in terms of time and performance [4]. This
tasks (e.g., text-to-speech systems [15]). In contrast, exploits the hierarchical representations obtained through
languages with few data sources—like standard German and training of the network: following exposure to large quantities
Swiss German—are under-resourced because they lack large of data, the layers within the network have extracted helpful
training sets and domain-specific models. The contribution of features that can be well-transferred to similar tasks without
the current work is to bridge this gap, and we adapt Mozilla's needing to begin from a zero starting point.
DeepSpeech implementation¹ of Baidu's DeepSpeech
architecture [1] to both German and Swiss German. We use It is common practice in computer vision to freeze parts
transfer learning with a proven pre-trained model in English, of pre-trained models during fine-tuning of the model for a
and we thoroughly investigate the impact of the freezing of novel task and keep previously acquired features [5]. The
various network layers during fine-tuning. practice has been adapted in end-to-end ASR models such as
DeepSpeech [4][6]. The idea is that the lower layers normally
Previous attempts at deploying DeepSpeech for extract the basis acoustic patterns (comparable to the low-
German [2] and Swiss German [3] have delivered early level visual features), whereas higher layers represent more
evidence; nonetheless, differences in data composition and abstract, language-dependent information. Current
training methods call for further inquiries. In this research, assessments of end-to-end ASR models show that, while the
emphasis is put into separating the effects of selective layer feature hierarchy of speech may not always be as apparent as
freezing and examining the contribution that it makes towards in vision, the higher layers do represent higher order phonetic
improving the performance of the recognizer while and linguistic features [7]. Practically, then, the earlier layers
minimizing training time. The research is framed against the capturing common acoustic features can have their
backdrop of modern developments in transfer learning parameters frozen while enabling the higher layers to adapt
methods and the growing interest in ensuring computationally to the new language
efficient ASR model adaptation towards the use of limited
resource environments. III. METHODOLOGY

II. TRANSFER LEARNING AND LAYER An experimental framework was formulated to

FREEZING investigate the effects of layer freezing in the scenario of ASR
under transfer learning. The methodology is comprised of
Transfer learning is now an essential method of deep four principal elements: the DeepSpeech architecture,
learning where models are able to recycle knowledge training procedure with layer freezing settings,
acquired from one task or data set for use in another. Through

IJISRT25JUN167 www.ijisrt.com 69
Special Issue, RISEM–2025 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25jun167

hyperparameters and computing environment, and dataset connected layers, layer 4 is an LSTM recurrent layer [11],
preparation as well as preprocessing pipeline layer 5 is an additional fully connected but ReLU-activated
layer, and layer 6 is the output layer generating character
A. Deep Speech Architecture probabilities through the use of softmax. The model is trained
The DeepSpeech version of version 0.7 from Mozilla with the use of the Connectionist Temporal Classification
was utilized as the base ASR architecture. The described (CTC) loss [9] and optimization with the use of the Adam
implementation, deviating minimally from the model optimizer [10].
proposed originally by Hannun et al. [1], is documented in
greater detail in the official documentation². The processing Table 1 shows the DeepSpeech architecture and data
pipeline starts with the MFCC [8] extraction from the raw flow, from input audio to feature extraction to output
audio input, followed by a total of six layers with the form of character probabilities (figure adapted from the official
a deep recurrent neural network. The network structure is documentation).
shown in Table I. Briefly, layers 1–3 are ReLU-activated fully

Table 1 Structure of the DeepSpeech Architecture.

Layer Description Activation/Notes
1–3 Fully connected ReLU
4 Recurrent (LSTM) Long Short-Term Memory [11]
5 Fully connected ReLU
6 Output layer Softmax (character probabilities)

B. Training Procedure and Layer Freezing During fine-tuning, the mentioned layers were frozen
We performed a series of training experiments to by indicating them as non-trainable, while the rest of the
measure the effect of frozen layers in transfer learning. For layers were trained over the target data. All the transfer
weight initialization, we utilized an English pre-trained learning models' output layer was re-initialized, as the
DeepSpeech model offered by Mozilla. Six training setups character set (output labels) was different for English
were done for both German and Swiss German, which are compared to the target language. This re-initialization
compiled in Table II. Moreover, we trained one model provided compatibility with German or Swiss German
entirely from scratch with random initialization as our transcripts.
baseline comparison point (labeled the “Reference” condition
with no transfer learning).

Table 2 Training Conditions for Evaluating the impact of layer freezing.

Condition Description
Reference Trained from scratch (random initialization, no pre-trained model).
0 Frozen Layers Initialized from the English model; all layers are fine-tuned on target data.
1 Frozen Layer Freeze the first layer; fine-tune layers 2–6 on target data.
2 Frozen Layers Freeze the first two layers; fine-tune layers 3–6 on target data.
3 Frozen Layers Freeze the first three layers; fine-tune layers 4–6 on target data.
4 Frozen Layers Freeze the first four layers; fine-tune only the last two layers on target data.

C. Hyperparameters and Computational Environment All models (both German and Swiss German) were trained
The same set of hyperparameters was utilized in all over the same number of epochs under the same conditions to
experiments (Table III), with no further tuning aside from make an unbiased comparison of the different freezing
these preselected values. Training was carried out under a strategies.
Linux server with 96 Intel Xeon Platinum 8160 CPU cores.

Table 3 Hyperparameter Settings Utilized when Training.

Hyperparameter Value Notes
Batch Size 24 –
Learning Rate 0.0005 –
Dropout Rate 0.4 –
Training Epochs 30 Per model (each experiment)
Optimizer Adam –

D. Datasets and Preprocessing speakers, with utterances lasting around 3 to 5 seconds. For
The data we used for our experiments are tabulated in the Swiss German models, we drew upon an even smaller
Table IV. For the German models, we utilized the training dataset of 70 hours of Swiss German speech derived from
data from Mozilla’s German corpus [12]. This data comprises Bernese parliamentary debates [13]. The Swiss German
around 315 hours of speech, provided by about 4,823 dataset covers formal speaking with relatively few speakers

IJISRT25JUN167 www.ijisrt.com 70
Special Issue, RISEM–2025 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25jun167

(around 191), and its size is considerably lower compared to itemized description of each component of the data set, as
the German corpus. well as the preprocessing pipeline.

The initial model for the English DeepSpeech model Along with the acoustic data, we used an external
was trained with a much larger dataset (over 6500 speech language model at time of inference to enhance the accuracy
audio hours of English data) aggregated from heterogeneous of the recognizer. To do this, we trained a tri-gram language
sources such as LibriSpeech and the English part of the model with the KenLM toolkit [14] over a large corpus of text
Common Voice dataset (more information can be found in consisting of public domain German-language text from
footnote 5). Before training, all data sets underwent common Wikipedia articles and Europarl parliamentary debates. This
preprocessing steps, for example, audio-normalization and language model we incorporated into the DeepSpeech
cleaning of transcript text (e.g. lowercasing, punctuation decoder for both German and Swiss German trials, helping
removal), to make them consistent. Table V contains an the system to make more accurate transcripts through
language contextualization.

Table 4 Summary of the Datasets used for Training.

Dataset Language Hours of Audio Number of Speakers
Pre-training English > 6500 —
Training German 315 4,823
Training Swiss German 70 191

Table 5 Description of each Dataset and key Preprocessing Details.

Component Description
German Dataset Collected from Mozilla Common Voice; crowd-sourced speech with diverse speakers; average
utterance length ~3–5 seconds.
Swiss German Collected from Bernese Parliament speeches; formal register, fewer speakers; significantly lower
Dataset volume of data compared to the German set.
English Pretraining Combined from large-scale English corpora (LibriSpeech + Common Voice English); provides broad
acoustic coverage for subsequent adaptation.

IV. RESULTS AND DISCUSSION These are all indications that, for German, retaining the
lower-level layers of the acoustic features (up to two or three
We compared the performance of the six training layers) gives the best result, significantly outperforming the
schemes in terms of word error rate (WER) and character baseline and the full fine-tuned model.
error rate (CER) on test sets for both German and Swiss
German. Table VI shows the WER and CER seen by each For Swiss German, we see the same pattern with
model configuration for German, while Table VII shows the differing magnitude. The baseline Swiss German model (no-
WER and CER for Swiss German. “Reference” in these transfer) achieved a WER of 74.0% (CER 52.0%). Fine-
tables indicates the model trained from scratch without any tuning all model layers on Swiss German data (0 frozen)
transfer learning, and the “Improvement” column shows the caused the WER to worsen slightly to 76.0%, which shows
percentage point improvement in WER with respect to that that such indiscriminate fine-tuning with no freezing can
baseline. overfit or mis-adapt to the small Swiss German corpus.
Freezing the early layers, in contrast, worked: with one
For the German ASR task, the baseline model of frozen, the WER improved to 69.0% (CER 48.0%), about a
training without any transfer learning obtained a WER of 5-point improvement over the baseline, and with two frozen,
70.0% with CER of 42.0%. Employing the pre-trained model the WER further improved to 67.0% (CER 45.0%), which
for English with no frozen layers (0 frozen, full fine-tuning) was the best performance for Swiss German (a 7-point
reduced the WER to 63.0% (CER 37.0%), which is only a improvement over baseline). Freezing three or four layers
modest improvement of 7.0 points. However, partial freezing showed no additional improvements (WER ~68.0% in each
of the initial layers produced much greater improvements. case, ~6 points improvement over baseline). So, for Swiss
Simply freezing the first layer improved the WER to 48.0% German, the first two layers of pre-trained model freezation
(CER 26.0%), which is a 22-point WER improvement over provided the greatest improvement, with freezation beyond
the baseline. Freezing the first two layers improved the WER two not bringing an additional advantage and retaining
further to 44.0% (CER 22.0%), which is the best performance approximately the same performance.
and an improvement of 26 points over the baseline.
Significantly, two or three frozen layers showed the same In total, selective freezing of layers resulted in notably
WER (44.0%), which means that there would have been no improved accuracy for both languages over training from
additional improvement from the third layer over the first scratch. The advantage was particularly dramatic for German,
two. With four frozen layers, performance actually decreased with the larger dataset; the method of transfer learning
slightly with an increase in WER to 46.0% and CER to reduced the WER by more than 26 absolute points. Swiss
25.0%, though still significantly better than the baseline. German, with the much smaller dataset and higher dialectal

IJISRT25JUN167 www.ijisrt.com 71
Special Issue, RISEM–2025 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25jun167

variation, also showed improved performance with transfer convergence patterns. This suggests that retaining the pre-
learning, though the relative improvement fell short. These trained low-level feature extractors didn't impede training;
results show that the lowest layers of the deep model encode the models all trained at around the same speed, just they
general acoustic representations that are relevant for many achieved different ending accuracy levels depending upon the
languages. By holding these layers constant, the fine-tuning number of layers updated. This result indicates that much of
procedure can concentrate on adapting higher-level layers to the key learning of the new language happens higher up in the
the target language’s idiosyncrasies. But freezing too many model after an effective set of building block features is
layers starts to restrict the model’s flexibility: the modest established.
decline in performance when four layers were frozen
indicates that the model required some of the later layers to Table 6 below presents the performance of the different
adapt to language-specific features. training strategies for German, and Table VII presents the
respective results for Swiss German. Table VIII presents a
Interestingly, we found that models with varying high-level comparison of each language's optimal freezing
numbers of frozen layers showed very comparable training configurations and the resultant error rates.

Table 6 German ASR Performance with Various Layer-Freezing Strategies (WER = Word Error Rate, CER = Character Error
Rate). The Improvement Column Indicates WER Improvement Compared to the Baseline (Reference) Model.
Training Strategy WER (%) CER (%) WER Improvement
Reference (No Transfer; Random Init.) 70.0 42.0 —
0 Frozen Layers (Full fine-tuning) 63.0 37.0 +7.0
1 Frozen Layer 48.0 26.0 +22.0
2 Frozen Layers 44.0 22.0 +26.0
3 Frozen Layers 44.0 22.0 +26.0
4 Frozen Layers 46.0 25.0 +24.0

Table 7 ASR Performance for Swiss German under Different Layer-Freezing Strategies.
Training Strategy WER (%) CER (%) WER Improvement
Reference (No Transfer; Random Init.) 74.0 52.0 —
0 Frozen Layers (Full fine-tuning) 76.0 54.0 –2.0
1 Frozen Layer 69.0 48.0 +5.0
2 Frozen Layers 67.0 45.0 +7.0
3 Frozen Layers 68.0 47.0 +6.0
4 Frozen Layers 68.0 46.0 +6.0

Table 8 Summary of Optimal Performance Results Across Languages.

Language Optimal # of Frozen Layers Best WER (%) Best CER (%)
German 2–3 44.0 22.0
Swiss German 2 67.0 45.0

V. CONCLUSION Our investigation demonstrated that higher layer wise

freezing for Swiss German (after the second layer) and for
Finally, we have shown in this work that transfer German (after the third layer) leads to no further
learning along with selective layer freezing can be an improvements in accuracy, but the selective freezing of
affordable approach to enhance ASR systems for low- higher dense layers is still very advantageous. It not only
resource languages. We experimented heavily with Mozilla’s increases the accuracy but also makes the fine-tuning more
DeepSpeech setup on German and Swiss German and could easily by decreasing the trainable parameters. There seems
confirm that by initializing the network from a pre-trained to be a trade-off between keeping pre-learned representations
English model and freezing the initial layers, recognition and enough flexibility for language-specific adaptation.
performance can be improved significantly. The best gains Freezing more layers (even four) harms the performance
were found in models with two to three frozen layers, slightly, and thus the higher layers still need some retraining
suggesting that low-level phonetic features that English ASR to handle the nuances of the target language. This trade-off
systems learned are highly transferable. By preserving them probably depends on the amount and quality of the training
with a frozen model, the fine-tuning can more quickly data available in the target language, and deserves further
specialize the higher layers of the model to the target exploration.
language. On the other hand, models trained from scratch
(i.e., no pre-training and no transfer learning) performed Overall, selective layer freezing transfer learning is a
significantly worse, demonstrating the utility of abundant powerful technique for closing the performance gap between
English data in low-resource settings. high and low resource languages in speech recognition. The
results also motivate additional research into adaptive
freezing strategies (e.g., deciding at runtime which layers to

IJISRT25JUN167 www.ijisrt.com 72
Special Issue, RISEM–2025 International Journal of Innovative Science and Research Technology
ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25jun167

freeze), and demonstrate the potential of such methodology Workshop on Representation Learning for NLP
towards building scalable, robust, and computationally (RepL4NLP@ACL 2017), Vancouver, Canada, Aug.
efficient multilingual ASR systems. Further studies in this 2017, Association for Computational Linguistics, pp.
area are likely to result in ASR technology that is more 168–177.
accessible across languages and dialects, and thus increase [5]. M. Huh, P. Agrawal, and A. A. Efros, “What makes
the inclusivity of ASR systems globally. ImageNet good for transfer learning?” 2016.
[6]. B. Li, X. Wang, and H. S. M. Beigi, “Cantonese
FUTURE WORK automatic speech recognition using transfer learning
from Mandarin,” CoRR, 2019.
Further, future work should focus on improving this [7]. Y. Belinkov and J. Glass, “Analyzing hidden
layer-freezing method and extending it to other models and representations in end-to-end automatic speech
languages. Another direction is to study how to optimize the recognition systems,” in Advances in Neural
selective layer freezing strategies by taking more adaptive or Information Processing Systems, vol. 30, 2017, pp.
dynamic ways. For instance, optimal number of frozen layers 2441–2451.
can be modified according to target dataset size and quality, [8]. S. Imai, “Cepstral analysis synthesis on the mel
as the trade-off of preserving prelearned features to adapt can frequency scale,” in Proc. IEEE International
be different. It remains for future work whether techniques Conference on Acoustics, Speech, and Signal
for being able to automatically determine or gradually Processing (ICASSP’83), vol. 8, 1983, pp. 93–96.
unfreeze the weights of layers could also benefit [9]. A. Graves, S. Fernández, F. Gomez, and J.
performance. It would be interesting to try other pretrained Schmidhuber, “Connectionist temporal classification:
model or models as the base. Testing the freezing strategy on Labelling unsegmented sequence data with recurrent
other state-of-the-art ASR models would reveal whether the neural networks,” in Proc. of the 23rd International
gains achieved are consistent across different network Conference on Machine Learning, 2006, pp. 369–376.
designs and potentially use richer pretrained representations [10]. D. P. Kingma and J. Ba, “Adam: A method for
for improving performance. In addition, an interesting stochastic optimization,” 2014.
direction is to extend the proposed transfer- learning to [11]. S. Hochreiter and J. Schmidhuber, “Long short-term
multiple languages or more complex datasets to examine its memory,” Neural Computation, vol. 9, no. 8, pp.
generality. Finally, it will be important to apply the presented 1735–1780, 1997.
method to languages outside of German and Swiss German [12]. R. Ardila, M. Branson, K. Davis, M. Kohler, J. Meyer,
(other language families, and also languages with phonetic M. Henretty, R. Morais, L. Saunders, F. M. Tyers, and
characteristics quite different from what was considered here) G. Weber, “Common Voice: A massively-multilingual
in order to see whether the low-level acoustic features learnt speech corpus,” in Proc. of The 12th Language
from English can generally be successfully employed, or Resources and Evaluation Conference (LREC 2020),
whether fine-tuning at the language specific level is required. Marseille, France, May 2020, European Language
Likewise, generalizing to more challenging and/or more Resources Association, pp. 4218–4222.
diverse datasets (such as larger speech corpora with more [13]. M. Plüss, L. Neukom, and M. Vogel, “GermEval 2020
speakers, dialectal variability and noisier audio), is important Task 4: Low-resource speech-to-text,” 2020.
to evaluate the robustness of the method in real-world [14]. K. Heafield, “KenLM: Faster and smaller language
settings. Such experiments would help confirming the model queries,” in Proc. of the 6th Workshop on
effectiveness of approach in multilingual setting and also Statistical Machine Translation, Association for
provide practical optimizations for scalable and efficient Computational Linguistics, 2011, pp. 187–197.
speech model adaptation for low resource scenarios. [15]. M. Schröder and J. Trouvain, “The German text-to-
speech synthesis system MARY: A tool for research,
REFERENCES development and teaching,” International Journal of
Speech Technology, vol. 6, no. 4, pp. 365–377, 2003.
[1]. A. Hannun, C. Case, J. Casper, B. Catanzaro, G.
Diamos, E. Elsen, R. Prenger, S. Satheesh, S.
Sengupta, A. Coates, and A. Y. Ng, “Deep speech:
Scaling up end-to-end speech recognition,” 2014.
[2]. A. Agarwal and T. Zesch, “German end-to-end speech
recognition based on DeepSpeech,” Proc. of the 15th
Conf. on Natural Language Processing (KONVENS
2019): Long Papers, Erlangen, Germany: German
Society for Computational Linguistics & Language
Technology, 2019, pp. 111–119.
[3]. “LTL-UDE at low-resource speech-to-text shared task:
Investigating Mozilla DeepSpeech in a low-resource
setting,” 2020.
[4]. J. Kunze, L. Kirsch, I. Kurenkov, A. Krug, J.
Johannsmeier, and S. Stober, “Transfer learning for
speech recognition on a budget,” in Proc. of the 2nd

IJISRT25JUN167 www.ijisrt.com 73

Transmission Lines Protection from Open Circuit and Short Circuit Faults using Basic Logics
No ratings yet
Transmission Lines Protection from Open Circuit and Short Circuit Faults using Basic Logics
9 pages
AQIP: Air Quality Index Prediction Using Supervised ML Classifiers
No ratings yet
AQIP: Air Quality Index Prediction Using Supervised ML Classifiers
8 pages
Innovative Approaches to Value-Based Education in a Global Context: Insights from International Perspectives
No ratings yet
Innovative Approaches to Value-Based Education in a Global Context: Insights from International Perspectives
6 pages
Material Elevation System Using a Rotating Helical Screw within a Cylindrical Casing
No ratings yet
Material Elevation System Using a Rotating Helical Screw within a Cylindrical Casing
4 pages
An Assessment to the Experience of Women at Barangay Taloy Norte, Tuba, Benguet in Relation to Abuse and Violence
No ratings yet
An Assessment to the Experience of Women at Barangay Taloy Norte, Tuba, Benguet in Relation to Abuse and Violence
8 pages
Fire Protection Strategies in Mechanical Systems: Design, Implementation, and Impact on Safety
No ratings yet
Fire Protection Strategies in Mechanical Systems: Design, Implementation, and Impact on Safety
5 pages
Landscape of Cloud Infrastructure: Security Concerns, Mitigation Strategies, and Research Opportunitieser
No ratings yet
Landscape of Cloud Infrastructure: Security Concerns, Mitigation Strategies, and Research Opportunitieser
10 pages
Optimizing Agricultural Production Using ML and AI
No ratings yet
Optimizing Agricultural Production Using ML and AI
6 pages
A Pharmacognostic and Bibliometric Exploration of Glycyrrhiza Glabra: from Ancient Remedies to Modern Applications
No ratings yet
A Pharmacognostic and Bibliometric Exploration of Glycyrrhiza Glabra: from Ancient Remedies to Modern Applications
24 pages
5 I Know Imprenta Docente
No ratings yet
5 I Know Imprenta Docente
94 pages
Lacquer crack lesions in pathological myopia
No ratings yet
Lacquer crack lesions in pathological myopia
7 pages
HIidro Studio Culverts - Manual
No ratings yet
HIidro Studio Culverts - Manual
26 pages
NosocomialInfection-AJANR
No ratings yet
NosocomialInfection-AJANR
6 pages
mcy-20240325SCHEDULE 14A INFORMATION
No ratings yet
mcy-20240325SCHEDULE 14A INFORMATION
50 pages
Research
No ratings yet
Research
5 pages
Arts Scopeand Sequence
No ratings yet
Arts Scopeand Sequence
17 pages
VES - Law - Prospectus 2022 23 1 1 PDF
No ratings yet
VES - Law - Prospectus 2022 23 1 1 PDF
37 pages
ICPEP-6 (2018) Book of Abstracts
50% (2)
ICPEP-6 (2018) Book of Abstracts
266 pages
The Jew Who Pulled Down The Walls Tiberi
No ratings yet
The Jew Who Pulled Down The Walls Tiberi
13 pages
Nurse Resume Samples.current.sp20
No ratings yet
Nurse Resume Samples.current.sp20
4 pages
Introduction To E-Models: Emarketing Excellence by Dave Chaffey and PR Smith
No ratings yet
Introduction To E-Models: Emarketing Excellence by Dave Chaffey and PR Smith
35 pages
1-Put A Period, Exclamation Mark, or A Question Mark After Each Sentence
No ratings yet
1-Put A Period, Exclamation Mark, or A Question Mark After Each Sentence
6 pages
Power System State Estimation and Bad Data Analysis Using Weighted Least Square Method
No ratings yet
Power System State Estimation and Bad Data Analysis Using Weighted Least Square Method
5 pages
IHRM Trends and Future Challenges
No ratings yet
IHRM Trends and Future Challenges
23 pages
"and" in Chinese - 与 (yǔ), 和 (hé) or 跟 (gēn) 4. Sentence structure differences
No ratings yet
"and" in Chinese - 与 (yǔ), 和 (hé) or 跟 (gēn) 4. Sentence structure differences
27 pages
REED-2 - WEEK-1-4-REVIWER (1st Year College)
No ratings yet
REED-2 - WEEK-1-4-REVIWER (1st Year College)
6 pages
Superbikes: Promotional Mix and Marketing Communication Strategies
No ratings yet
Superbikes: Promotional Mix and Marketing Communication Strategies
15 pages
Learn Works: Study Session 8 Solid Waste Reduction, Reuse and Recycling
No ratings yet
Learn Works: Study Session 8 Solid Waste Reduction, Reuse and Recycling
16 pages
Alternator Overhaul SK-KD 18.4
No ratings yet
Alternator Overhaul SK-KD 18.4
25 pages
Grade 10 (2022-23)
No ratings yet
Grade 10 (2022-23)
4 pages
Read and Choose The Best Answers: Choose The Word That Has The Same Meaning As The Underlined Word
No ratings yet
Read and Choose The Best Answers: Choose The Word That Has The Same Meaning As The Underlined Word
9 pages
Traditional Symbolism in Kubla Khann
No ratings yet
Traditional Symbolism in Kubla Khann
5 pages
LayingOutFrustumWithDividers 20jul2012
No ratings yet
LayingOutFrustumWithDividers 20jul2012
9 pages
Janalakshmi Financial Services
100% (1)
Janalakshmi Financial Services
26 pages
LP Birds KWL
No ratings yet
LP Birds KWL
3 pages
Unilever Competitors (Revenue) : Category Annual Turnover (2020) Products
No ratings yet
Unilever Competitors (Revenue) : Category Annual Turnover (2020) Products
2 pages
The Rock and Roll Generation - A Century of Dance, A Hundred Years of Musical Movement, From Waltz To Hip Hop - Ian Driver PDF
100% (1)
The Rock and Roll Generation - A Century of Dance, A Hundred Years of Musical Movement, From Waltz To Hip Hop - Ian Driver PDF
20 pages
Crude Oil Price Volatility and its Impact on Nigeria’s Balance of Trade: An Empirical Assessment (2000–2023)
No ratings yet
Crude Oil Price Volatility and its Impact on Nigeria’s Balance of Trade: An Empirical Assessment (2000–2023)
13 pages
Unlocking the Therapeutic Power of Coriander: A Review of Coriandrum Sativum’s Bioactive Compounds and Health Benefits
No ratings yet
Unlocking the Therapeutic Power of Coriander: A Review of Coriandrum Sativum’s Bioactive Compounds and Health Benefits
15 pages
Analyzing Social Communication Deficits in Autism Using Wearable Sensors and Real-Time Affective Computing Systems
No ratings yet
Analyzing Social Communication Deficits in Autism Using Wearable Sensors and Real-Time Affective Computing Systems
17 pages
Developing Gamified Educational Technologies to Enhance Learning and Motivate Student Engagement in Education: A Quantitative Study Using Human-Computer Interaction (HCI)
No ratings yet
Developing Gamified Educational Technologies to Enhance Learning and Motivate Student Engagement in Education: A Quantitative Study Using Human-Computer Interaction (HCI)
16 pages
Monte Carlo-Based Modeling of 2-D Ising Systems Using Metropolis Algorithm, Simulation Techniques, Thermodynamic Behavior and Magnetization Patterns
No ratings yet
Monte Carlo-Based Modeling of 2-D Ising Systems Using Metropolis Algorithm, Simulation Techniques, Thermodynamic Behavior and Magnetization Patterns
16 pages
Real - Time Recognition of Cardiovascular Conditions from ECG Images with Deep Learning
No ratings yet
Real - Time Recognition of Cardiovascular Conditions from ECG Images with Deep Learning
10 pages
Optimal Voltage Regulation in Standalone Photovoltaic Systems Using Model Predictive Control and MOGA
No ratings yet
Optimal Voltage Regulation in Standalone Photovoltaic Systems Using Model Predictive Control and MOGA
8 pages
Assessment of Underground Water Quality of Gosa Landfill Site of the Federal Capital Territory, Abuja Nigeria
No ratings yet
Assessment of Underground Water Quality of Gosa Landfill Site of the Federal Capital Territory, Abuja Nigeria
11 pages
The Cult of Morr
No ratings yet
The Cult of Morr
11 pages
M Co Quota 120216
No ratings yet
M Co Quota 120216
47 pages
A Phytochemical Evaluation of Sierra Leonean Cassia siamea: A Source of Bioactive Compounds
No ratings yet
A Phytochemical Evaluation of Sierra Leonean Cassia siamea: A Source of Bioactive Compounds
5 pages
A Review on Gold Nanoparticles: Properties, Synthesis and Biomedical Application in Drug Delivery and Cancer Therapy
No ratings yet
A Review on Gold Nanoparticles: Properties, Synthesis and Biomedical Application in Drug Delivery and Cancer Therapy
6 pages
Cost Comparative Analysis of Solar/Utility and Diesel/Utility Hybrid Power System for a Typical Residential Building
No ratings yet
Cost Comparative Analysis of Solar/Utility and Diesel/Utility Hybrid Power System for a Typical Residential Building
8 pages
A MIC-MAC-Based Structural Exploration of Determinants Impacting Investment Sensitivity
No ratings yet
A MIC-MAC-Based Structural Exploration of Determinants Impacting Investment Sensitivity
8 pages
Transition to Telepsychotherapy: Experiential Perspective of Debutant Therapists
No ratings yet
Transition to Telepsychotherapy: Experiential Perspective of Debutant Therapists
6 pages
Smart Narrator Robot: Enhancing Experiential Learning through Conditional Autonomy
No ratings yet
Smart Narrator Robot: Enhancing Experiential Learning through Conditional Autonomy
6 pages
Assessing the Achievements of the Re-Alignment of an Industry Educatiocal Based System in Society
No ratings yet
Assessing the Achievements of the Re-Alignment of an Industry Educatiocal Based System in Society
5 pages
Enhancing Model Accuracy for Keypoint-Based Sign Language Recognition using Optimized Neural Network Architectures
No ratings yet
Enhancing Model Accuracy for Keypoint-Based Sign Language Recognition using Optimized Neural Network Architectures
7 pages
Architecture as a Reflection of Cultural Continuity: A Study of Traditional Trends
No ratings yet
Architecture as a Reflection of Cultural Continuity: A Study of Traditional Trends
3 pages
Investigating the Interplay between Climate Change and Sustainable Environment Development: Challenges, Strategies and Future Directions
No ratings yet
Investigating the Interplay between Climate Change and Sustainable Environment Development: Challenges, Strategies and Future Directions
11 pages
Analysis of the Role of Websites, Design, and Performance Metrics in Improving Company Performance in Medan City
No ratings yet
Analysis of the Role of Websites, Design, and Performance Metrics in Improving Company Performance in Medan City
4 pages
EduTech Portal: An AI-Powered Student Assistant Chatbot
No ratings yet
EduTech Portal: An AI-Powered Student Assistant Chatbot
12 pages
Perception, Attitude, and Readiness in Artificial Intelligence Adoption among Academic Librarians in the Bicol Region Librarians Council (BRLC)
No ratings yet
Perception, Attitude, and Readiness in Artificial Intelligence Adoption among Academic Librarians in the Bicol Region Librarians Council (BRLC)
6 pages
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
4/5 (6441)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
4.5/5 (141)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
4.5/5 (5145)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
4/5 (100)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
4/5 (45)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
3.5/5 (233)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
3.5/5 (919)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
4/5 (642)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
4.5/5 (581)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
4.5/5 (999)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
4/5 (903)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
4/5 (278)
Yes Please
From Everand
Yes Please
Amy Poehler
4/5 (2010)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
4.5/5 (361)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
4.5/5 (4102)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
4/5 (650)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
4.5/5 (298)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
3.5/5 (836)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
4.5/5 (279)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
4.5/5 (628)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
4.5/5 (943)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
3.5/5 (2133)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
4/5 (4088)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
4/5 (1175)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
3.5/5 (2289)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
4.5/5 (244)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
4/5 (4360)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
4.5/5 (2033)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
4.5/5 (1856)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
4.5/5 (815)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
4/5 (1267)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
4/5 (1018)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
4/5 (78)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
3.5/5 (463)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
4.5/5 (1138)
John Adams
From Everand
John Adams
David McCullough
4.5/5 (2546)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
3.5/5 (2790)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
3.5/5 (144)
Little Women
From Everand
Little Women
Louisa May Alcott
4.5/5 (2369)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
4/5 (1090)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
4/5 (2884)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Optimizing Speech Models with Freezing

Uploaded by

Optimizing Speech Models with Freezing

Uploaded by

Special Issue, RISEM–2025 International Journal of Innovative Science and Research Technology

ISSN No:-2456-2165 https://doi.org/10.38124/ijisrt/25jun167

Optimizing Speech Models with Freezing

Publication Date: 2025/07/14

I. INTRODUCTION pre-training of a network over an enormous, varied data set

II. TRANSFER LEARNING AND LAYER An experimental framework was formulated to

Table 1 Structure of the DeepSpeech Architecture.

Table 2 Training Conditions for Evaluating the impact of layer freezing.

Table 3 Hyperparameter Settings Utilized when Training.

Table 4 Summary of the Datasets used for Training.

Table 5 Description of each Dataset and key Preprocessing Details.

Table 8 Summary of Optimal Performance Results Across Languages.

V. CONCLUSION Our investigation demonstrated that higher layer wise

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.