0% found this document useful (0 votes)

28 views16 pages

Report Group-8

The document presents the models and methods used in a project on summarization and translation. It discusses extractive summarization models like BERT and abstractive models like GPT-2, T5, BART, and PEGASUS. It used data from 214 pages of a G20 Indian government website and focused on one article to test these models. The models generate summaries through different techniques, like BERT using sentence clustering while GPT-2 uses layers and positional encoding. BART and PEGASUS were pre-trained on large datasets for summarization tasks. The project aims to test these models' performance on the dataset.

Uploaded by

ravi2587ranjan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views16 pages

Report Group-8

Uploaded by

ravi2587ranjan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Project Report on Summarization and Translation

21BDS057 - Ravi Ranjan

21BDS036 - Manish Kumar
21BDS028 - Kola Kiriti Kumar
21BDS042 - Dilli Babu N

Under the guidance of

Dr. Sunil Saumya
Asst. Prof., Dept. of Data Science And Intelligent Systems

DEPARTMENT OF DATA SCIENCE AND ARTIFICIAL INTELLIGENCE

INDIAN INSTITUTE OF INFORMATION TECHNOLOGY DHARWAD

December 12, 2023

1
Contents
1 Introduction 3

2 Related Work 3
2.1 Thematic Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Rhetorical Roles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3

3 Data and Methods 3

3.1 Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
3.2 Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
3.3 Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3.3.1 Sequence Length . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
3.3.2 Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.4 Computation Limit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.4.1 Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.5 Methodologies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

4 Results and Discussions 8

4.1 ROUGE . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
4.2 Pre-trained Model Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

5 Conclusion 12

6 References 13

2
1 Introduction
Summarization is a well-defined problem in Natural Language Processing (NLP) wherein lengthy
documents belonging to a certain domain (e.g., Health, Legal case, Lecture documents) are
reduced to digestible shorter paragraphs. Automatic summarization generally confronts issues
with understanding the context, clustering, and reconstructing the statements into a summary.

Summarization tasks generally have two methodologies:

• Abstractive Summarization: In this approach, the model attempts to generate novel
statements with the knowledge it has on the domain.
• Extractive Summarization: In this approach, the model ranks and selects essential
statements from the source file to compose a summary. This report proposes an imple-
mentation of an Extractive Summarization model.

2 Related Work
Majority of the previous work on domain adaptation of summary generation in Context, fail to
identify optimal dataset for the operations of fine-tuning the Language model. They also fail to
modify the structure of the summary in comparison to the document to capture much important
parts of the things into the summary.

2.1 Thematic Structure

Document-Bert is an attempt in materializing this approach, the study focuses on building three
versions domain specific BERT model; (i) Use BERT out of the box and fine tune on the task
data, (ii) Use BERT and adapt the language model with domain corpora and further fine tune
over the task data, (iii) Pre-train the BERT model over domain corpora instead of Generic
corpora.

2.2 Rhetorical Roles

In continuation to the discussion of Extractive Summarization model types, Unsupervised Do-
main Dependent Models : In addition to taking into the account rhetorical structure and se-
mantic roles, the creation and assignment of scoring system is also to be considered. In the
ideal scenarios the summary should be consisting of 10% Introduction, 25% Context, 60% Case
Analysis and 5% Conclusion with Ruling of summary . Another alternative is to use traditional
TF-IDF technique to rank the sentences and Saravanan et.al. uses the K - Mixture Model as
the deciding factor for inclusion of sentences in the summary .

3 Data and Methods

3.1 Data
We have procured the data from the G20 Indian government website.It was in a pdf form and
the total pages were 214.So to perform such summarization and translation was computationally
difficult.So we took one article from document and perform our action.

3
3.2 Models
This project involves using multiple models to test and review their scores using this dataset.
For extractive summarization we have model called BERT-EXTRACTIVE SUMMARIZATION
and for abstractive we have model like GPT-2,T5,PEGASUS and BART. For translation we
have model called MBART-large-50-many-to-many-mmt.

BERT works by first embedding the sentences, then running a clustering algorithm, finding
the sentences that are closest to the cluster’s centroids.

BERT MODEL

GPT-2 works in following ways

Layered Structure: GPT consists of multiple layers of the transformer model. Each layer in-
cludes a combination of self-attention and feedforward neural network sub-layers. Positional
Encoding: Since, transformers don’t inherently understand the order of tokens in a sequence,
positional encoding is added to provide information about the positions of tokens in the input
sequence. Attention Mechanism: GPT uses a self-attention mechanism that allows the model to
assign different weights to different parts of the input sequence, enabling it to focus on relevant
information. Fine-Tuning: After pre-training, GPT models can be fine-tuned on specific tasks
with smaller datasets to adapt to particular applications, such as language translation, summa-
rization, or question-answering. Autoregressive Generation: GPT is autoregressive, meaning it
generates sequences one token at a time. During inference, the model predicts the next token
based on the preceding context. Parameter Size: GPT models typically have a large number of
parameters, contributing to their ability to learn complex patterns and generate coherent and
contextually relevant text.

4
GPT-2 MODEL

The BART-Large-CNN model is the BART model which is pre-trained on the English lan-
guage, and fine-tuned on CNN Daily Mail . It was introduced in the paper BART: Denoising
Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Compre-
hension and first released in the repository .

BART, which stands short for Bidirectional Auto-Regressive Transformers, is an encoder-to-

encoder model which is synonymous to a sequence-to-sequence model. It employs a bidirec-
tional approach, similar to how BERT works, in which the input is read from both ends. This
helps the model to have better context from both ends of the input text at the same time. The
encoder is bidirectional, while the decoder is auto-regressive, which means that the model pre-
dicts future data based on past data(similar to GPT). The pre-training for this model involves
using a random noising function with the intent of corrupting the input text and then learning
a model to reconstruct the original text. The BART model is notably effective when it is fine-
tuned for the task of summarization. BART-Large-CNN is a fine-tuned version of BART, which
is used for text generation tasks like translation and summarization .

BART MODEL

5
PEGASUS , which stands short for Pre-training with Extracted Gap-sentences for Abstrac-
tive Summarization Using BERT as Encoder, is a transformer encoder-decoder model to improve
fine-tuning performance on abstractive summarization. In this model, during pre-training, sev-
eral whole sentences are removed from the documents and the it is tasked with recovering them.
The authors of the paper found out that if you choose the ’most important’ sentences from the
input document to mask then you would end up with results closest to a summary. The paper
concluded using some training results that large datasets for training samples is not necessary
for supervised learning anymore which is why it opened up many low-cost use cases.

PEGASUS MODEL

T5 woks on the principle of transfer text-to-text transformer (T5) is a state-of-the-art pre-

trained language model based on the transformer architecture. It adopts a unified text-to-text
framework that can handle any natural language processing (NLP) task by converting both the
input and output into natural language texts.

6
T5 Architecture

MBART MODEL MBART is a sequence-to-sequence denoising auto-encoder pretrained on

large-scale monolingual corpora in many languages using the BART objective. mBART is one of
the first methods for pretraining a complete sequence-to-sequence model by denoising full texts
in multiple languages, while previous approaches have focused only on the encoder, decoder, or
reconstructing parts of the text.
MBart is a multilingual encoder-decoder (sequence-to-sequence) model primarily intended for
translation task. As the model is multilingual it expects the sequences in a different format. A
special language id token is added in both the source and target text.

3.3 Problems
3.3.1 Sequence Length
In our dataset, as previously mentioned in the Data and Methods section, we can see that our
decisions are extremely long and range from 1500-2000words. Most transformer-based models
have input sequence limits of 512 tokens or 1024 tokens. This proves to be a significant problem
since our decision is nearly 5-10 times the token limit.

7
3.3.2 Solutions
1. Split the decision into multiple chunks and generate summaries separately and then com-
bine the summaries to one summary. The problem with this method is that transformers
use global context for the task of summarization, and splitting into chunks would contra-
dict the purpose of using transformers-based models, and subsequently context between
the chunks would be lost. Since our summarizer is abstractive, it would generate similar
summaries on consecutive chunks. We can avoid the latter problem by simply using cosine
similarity to select the sentences and drop sentences with extremely high similarity with
other ones.

3.4 Computation Limit

Our implementation involves fine-tuning a pre-trained model to summarize the large document
text. Fine-tuning a model for large sequence sizes is computationally expensive for even small
training and testing sizes. Even though we use a small dataset of 250 document-summary pairs,
we exceed computation limits of online resources.

3.4.1 Solutions
1. Train sample for fewer epochs decrease the size of the output summary which would allow
some online sources to train the model. This proves to be extremely difficult and the
summary generated using the fine-tuned models prove to be illegible and grammatically
incorrect.
2. Use other sources to train and test the models metrics, e.g., AutoTrain model (Hugging-
Face) is an online-open source website to use and train models on your dataset.

3.5 Methodologies

Methods for Extractive Summarization and translation of

G20 Text
We propose several methods for generating extractive and abstractice summaries of text then
translation:

1. Directly use a pre-trained abstractive model like BERT extractive summarizer.

2. Fine-tune an extractive summarizer for the Indian G20 document.
3. Fine-tune an abstractive summarizer like GPT-2,T5,PEGASUS,BART for the dataset then
comparing their similarity with human reference summary but in our case we are using
extractive summary as reference

4 Results and Discussions

4.1 ROUGE
In order to measure the degree of similarity between any two texts, we use the ROUGE score.
ROUGE, which stands for Recall Oriented Understudy for Gisting Evaluation, is widely con-
sidered to be the standard evaluation metric for NLP tasks. Below are the types of ROUGE
used:

1. ROUGE-N: Takes into account n-grams matching between the generated summary and
the gold standard summary. ROUGE-1 measures unigrams or single words that match be-
tween the generated summary and gold standard. Similarly, ROUGE-2 looks at matching
bigrams.

8
2. ROUGE-L: Counts the longest common sub-sequence matching between the generated
summary and the gold standard. The core idea behind this is that a higher ROUGE-L
score would mean that a long enough sub-sequence is common, and hence the summary is
closer to the gold standard.

ROUGE FORMULAE

ROUGE allows us to calculate three parameters, which are Recall, Precision, and F-1 score.
Recall is equal to the number of matching n-grams divided by the total number of n-grams in
the reference text. Precision is equal to the number of matching n-grams divided by the total
number of n-grams in our generated summary. The F-1 score is the simple harmonic mean of
recall and precision.

4.2 Pre-trained Model Results

Rouge score of individual models

Rouge score of combined model

9
Bar chat of rouge scores

10
11
Now looking at chart we can see that GPT-2 is best summary model. So,we will take that
model summary and then trasnlate in our desired languge using mBART model.

5 Conclusion
In the current scenario of the Indian government System overloaded with documents and peo-
ple don’t have time to all document. So,for easeness our model is best for summarization and
translation which will affect the lives of people on large scale.

Looking at the future scope for the project we can see that by training the model for just a
small part of the dataset, we produced decent scores which can be improved by increasing the
computational power.

12
6 References
code link
https://www.kaggle.com/code/raviranjan7284/summarization-and-translation

Citations
1.Narayan, S., Cohen, S.B. and Lapata, M., 2018. Don’t give me the details, just the sum-
mary! topic-aware convolutional neural networks for extreme summarization. arXiv preprint
arXiv:1808.08745

2.Erkan, G. and Radev, D.R., 2004. Lexrank: Graph-based lexical centrality as salience in
text summarization. Journal of artificial intelligence research, 22, pp.457-479.

3.Narayan, S., Cohen, S.B. and Lapata, M., 2018. Ranking sentences for extractive summa-
rization with reinforcement learning. arXiv preprint arXiv:1802.08636.

4.Nallapati, R., Zhai, F. and Zhou, B., 2017, February. Summarunner: A recurrent neural
network based sequence model for extractive summarization of documents. In Proceedings of
the AAAI conference on artificial intelligence (Vol. 31, No. 1).

13
Acknowledgement
We extend our heartfelt appreciation to all individuals who contributed to the successful
culmination of this project.
Our deepest gratitude is extended to Dr. Sunil Saumya Sir, our dedicated supervisor, whose
unwavering guidance, support, and invaluable insights were instrumental throughout the project.
His expertise and encouragement significantly influenced the trajectory of our undertaking.
We would also like to express our sincere thanks to our fellow teammates who, at various
stages of the project, provided invaluable support, shared insightful ideas, and offered assistance.
Their collaboration enriched the project experience and contributed to its overall success.
This project would not have been possible without the collective effort and encouragement
received from our academic community and beyond.
Thank you to everyone who played a role, directly or indirectly, in the realization of this
project.

14
Declaration
We declare that this written submission represents my ideas in my own words and where others’
ideas or words have been included, we have adequately cited and referenced the original sources.
We also declare that we have adhered to all principles of academic honesty and integrity and have
not misrepresented or fabricated or falsified any idea/data/fact/source in our submission. We
understand that any violation of the above will be cause for disciplinary action by the Institute
and can also evoke penal action from the sources which have thus not been properly cited or
from whom proper permission has not been taken when needed.

(Signature with date)

Ravi Ranjan
Roll No= 21BDS057

(Signature with date)

Manish Kumar
(Roll No=21BDS036)

(Signature with date)

Kiriti Kumar Kola
Roll No=21BDS028

(Signature with date)

Dilli Babu N
Roll No=21BDS042

15
Approval Sheet
This project report entitled (Title) by (Author Name 1), (Author Name 2), and (Author
Name 3) is approved for the degree of Bachelor of Technology in Computer Science and Engi-
neering.

Supervisors

Head of Department

(Head of Department)

Examiners

Date:
Place:

Pine Script v5 User Manual (200-350)
No ratings yet
Pine Script v5 User Manual (200-350)
151 pages
Excel Building Weight Calculator
0% (1)
Excel Building Weight Calculator
2 pages
BERT Summarization MP IA1
No ratings yet
BERT Summarization MP IA1
16 pages
ACM Journals Primary Article Template Latest Version 4
No ratings yet
ACM Journals Primary Article Template Latest Version 4
31 pages
BERT Summarization MP IA1Final
No ratings yet
BERT Summarization MP IA1Final
12 pages
T-BERTSum Topic-Aware Text Summarization Based On BERT
No ratings yet
T-BERTSum Topic-Aware Text Summarization Based On BERT
12 pages
Solution Methodology3
No ratings yet
Solution Methodology3
3 pages
NLP Mini Project
No ratings yet
NLP Mini Project
19 pages
Towards Abstractive Captioning of Infographics
No ratings yet
Towards Abstractive Captioning of Infographics
94 pages
Project
No ratings yet
Project
20 pages
Pretraining-Based Natural Language Generation For Text Summarization
No ratings yet
Pretraining-Based Natural Language Generation For Text Summarization
7 pages
Daniel Prijs MSC Thesis 2022
No ratings yet
Daniel Prijs MSC Thesis 2022
70 pages
Letter
No ratings yet
Letter
22 pages
Experiential Learning
No ratings yet
Experiential Learning
8 pages
News Summarization and Evaluation in The Era of GPT-3
No ratings yet
News Summarization and Evaluation in The Era of GPT-3
20 pages
CS5984 Final Report
No ratings yet
CS5984 Final Report
57 pages
David Coimbra - Dissertacao
No ratings yet
David Coimbra - Dissertacao
74 pages
Towards Efficient Knowledge Extraction: Natural Language Processing-Based Summarization of Research Paper Introductions
No ratings yet
Towards Efficient Knowledge Extraction: Natural Language Processing-Based Summarization of Research Paper Introductions
12 pages
Towards Efficient Knowledge Extraction Natural Lan
No ratings yet
Towards Efficient Knowledge Extraction Natural Lan
12 pages
Icimes 113
No ratings yet
Icimes 113
27 pages
UNIT-5 and 6
No ratings yet
UNIT-5 and 6
40 pages
IR Report
No ratings yet
IR Report
10 pages
AI-driven Generation of News Summaries
No ratings yet
AI-driven Generation of News Summaries
24 pages
Text Summarization Using The T5 Transformer Model
No ratings yet
Text Summarization Using The T5 Transformer Model
3 pages
Advanced Text Summarization Techniques: Integrating RNNS, Transformers, and Pca For Enhanced Performance
No ratings yet
Advanced Text Summarization Techniques: Integrating RNNS, Transformers, and Pca For Enhanced Performance
8 pages
Automatic Question & Answer Generation Using Generative Large Language Model (LLM)
No ratings yet
Automatic Question & Answer Generation Using Generative Large Language Model (LLM)
52 pages
Fine Tuning The Large Language Pegasus Model For Dialogue Summarization
No ratings yet
Fine Tuning The Large Language Pegasus Model For Dialogue Summarization
13 pages
Assignment 05 CL
No ratings yet
Assignment 05 CL
3 pages
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
No ratings yet
TC6 PROJECT SYNOPSIS KrishShetty VedantLandge 231106 101402
13 pages
Transformers MUIA
No ratings yet
Transformers MUIA
34 pages
Group 13 Sem 2 Review 1
No ratings yet
Group 13 Sem 2 Review 1
20 pages
Reasoning With Transformer Bas
No ratings yet
Reasoning With Transformer Bas
28 pages
Jacob Devlin BERT
No ratings yet
Jacob Devlin BERT
43 pages
Introduction To LLMS: Transformers Types of Llms Configuration Settings
100% (2)
Introduction To LLMS: Transformers Types of Llms Configuration Settings
7 pages
Lecture 12 Pretraining
No ratings yet
Lecture 12 Pretraining
46 pages
INTELLIPAAT - 2024 - 01 - 20 - Tansformers Cont. and Autoencoders
No ratings yet
INTELLIPAAT - 2024 - 01 - 20 - Tansformers Cont. and Autoencoders
11 pages
BERT
No ratings yet
BERT
4 pages
NLP Short Que Ans
No ratings yet
NLP Short Que Ans
21 pages
Group PPT
No ratings yet
Group PPT
29 pages
PROSPECT-SCI: Performance Review and Optimization of Summarization Techniques For Scientific Content
No ratings yet
PROSPECT-SCI: Performance Review and Optimization of Summarization Techniques For Scientific Content
13 pages
Lec14 Pretraining
No ratings yet
Lec14 Pretraining
42 pages
Automatic Text Recognisation
No ratings yet
Automatic Text Recognisation
4 pages
Week 3: Deeplearning - Ai
No ratings yet
Week 3: Deeplearning - Ai
98 pages
Text Summarisation and Document Understanding
No ratings yet
Text Summarisation and Document Understanding
7 pages
NLP Case Study
No ratings yet
NLP Case Study
5 pages
On The Applicability of Deep Learning To Construct Process Models From Natural Text 16 05
No ratings yet
On The Applicability of Deep Learning To Construct Process Models From Natural Text 16 05
66 pages
Paper Review
No ratings yet
Paper Review
6 pages
Project Report
No ratings yet
Project Report
25 pages
IOT Based Mini Project
No ratings yet
IOT Based Mini Project
28 pages
A E A T - B L M: E O M: Nalysis of The Volution of Dvanced Ransformer Ased Anguage Odels Xperiments On Pinion Ining
No ratings yet
A E A T - B L M: E O M: Nalysis of The Volution of Dvanced Ransformer Ased Anguage Odels Xperiments On Pinion Ining
16 pages
Final Ojt
No ratings yet
Final Ojt
31 pages
Personalized News Summarization and Analysis Using Pre-Trained Transformer Models
No ratings yet
Personalized News Summarization and Analysis Using Pre-Trained Transformer Models
6 pages
Summaries of The Chapters
No ratings yet
Summaries of The Chapters
29 pages
Leveraging ParsBERT and Pretrained MT5 For Persian Abstractive
No ratings yet
Leveraging ParsBERT and Pretrained MT5 For Persian Abstractive
7 pages
NLP-Driven Summarization of Local Language Texts
No ratings yet
NLP-Driven Summarization of Local Language Texts
52 pages
News Summarization and Evaluation in The Era of GP
No ratings yet
News Summarization and Evaluation in The Era of GP
19 pages
NM Project Phase-2
No ratings yet
NM Project Phase-2
9 pages
Project Handout
No ratings yet
Project Handout
30 pages
Thuyết Trình TWP
No ratings yet
Thuyết Trình TWP
7 pages
ChatBot With GANs
No ratings yet
ChatBot With GANs
61 pages
Design and Analysis of Algorithms: 1, #1
From Everand
Design and Analysis of Algorithms: 1, #1
S. R. Jena
No ratings yet
ChatGPT for Business: Strategies for Success
From Everand
ChatGPT for Business: Strategies for Success
Matthew C. Smith
1/5 (1)
Pick&Place Station Assembly Instructions
No ratings yet
Pick&Place Station Assembly Instructions
20 pages
Buck Converter Modeling, Control, and Compensator Design
No ratings yet
Buck Converter Modeling, Control, and Compensator Design
56 pages
DX Diag
No ratings yet
DX Diag
33 pages
An Introduction To Combustion - Pyronics PDF
No ratings yet
An Introduction To Combustion - Pyronics PDF
4 pages
Changes DC-Pile For Windows Date Changes
No ratings yet
Changes DC-Pile For Windows Date Changes
7 pages
Introduction To Computer Fundamentals
No ratings yet
Introduction To Computer Fundamentals
15 pages
03-Citation and Referencing Guidelines
No ratings yet
03-Citation and Referencing Guidelines
6 pages
Fifo Generator Ug175
No ratings yet
Fifo Generator Ug175
192 pages
Bda Toppers Solution
No ratings yet
Bda Toppers Solution
71 pages
Icar Syllabus-Physics, Chemistry, Maths, Bio & Agriculture
75% (4)
Icar Syllabus-Physics, Chemistry, Maths, Bio & Agriculture
26 pages
High Frequency Isolated Bidirectional Dual Active Bridge DC-DC Converters and Its Application To Distributed Energy Systems: An Overview
No ratings yet
High Frequency Isolated Bidirectional Dual Active Bridge DC-DC Converters and Its Application To Distributed Energy Systems: An Overview
23 pages
II Sem Syllabus
No ratings yet
II Sem Syllabus
12 pages
Multiple Choice (8 X 1 PT)
No ratings yet
Multiple Choice (8 X 1 PT)
5 pages
BS5467 Cables Prysmian PDF
No ratings yet
BS5467 Cables Prysmian PDF
5 pages
Edit - The Complete Guide To MACD Indicator
No ratings yet
Edit - The Complete Guide To MACD Indicator
18 pages
BSP03 Multi Process Control Trainer
No ratings yet
BSP03 Multi Process Control Trainer
2 pages
2018 Howland Et Al. Quantifying The Effects of Erosion On Archaeological Sites With Low-Altitude Aerial Photography, Structure From Motion, and GIS
No ratings yet
2018 Howland Et Al. Quantifying The Effects of Erosion On Archaeological Sites With Low-Altitude Aerial Photography, Structure From Motion, and GIS
9 pages
B.A. Revised Syllabus
No ratings yet
B.A. Revised Syllabus
41 pages
Beginner Course-Navigating The UI
No ratings yet
Beginner Course-Navigating The UI
8 pages
Section 6 Quiz 1 l1 l4
No ratings yet
Section 6 Quiz 1 l1 l4
4 pages
DLL - Science 3 - Q3 - Week 1
No ratings yet
DLL - Science 3 - Q3 - Week 1
7 pages
9 Vlsicad Placer 8 PDF
No ratings yet
9 Vlsicad Placer 8 PDF
3 pages
Fluid Power - 2
No ratings yet
Fluid Power - 2
11 pages
GE 10 Lab Ex 4
No ratings yet
GE 10 Lab Ex 4
8 pages
Harmonics
No ratings yet
Harmonics
4 pages
Microwave Solid Antennas: Introduction and Antenna Descriptions
No ratings yet
Microwave Solid Antennas: Introduction and Antenna Descriptions
56 pages
Canopus
No ratings yet
Canopus
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Report Group-8

Uploaded by

Report Group-8

Uploaded by

Project Report on Summarization and Translation

21BDS057 - Ravi Ranjan

Under the guidance of

DEPARTMENT OF DATA SCIENCE AND ARTIFICIAL INTELLIGENCE

December 12, 2023

3 Data and Methods 3

4 Results and Discussions 8

Summarization tasks generally have two methodologies:

2.1 Thematic Structure

2.2 Rhetorical Roles

3 Data and Methods

GPT-2 works in following ways

BART, which stands short for Bidirectional Auto-Regressive Transformers, is an encoder-to-

T5 woks on the principle of transfer text-to-text transformer (T5) is a state-of-the-art pre-

MBART MODEL MBART is a sequence-to-sequence denoising auto-encoder pretrained on

3.4 Computation Limit

Methods for Extractive Summarization and translation of

1. Directly use a pre-trained abstractive model like BERT extractive summarizer.

4 Results and Discussions

4.2 Pre-trained Model Results

Rouge score of combined model

(Signature with date)

(Signature with date)

(Signature with date)

(Signature with date)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.