0% found this document useful (0 votes)

44 views2 pages

Aditya Shah CV PDF

Uploaded by

9960dhirajmanje

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views2 pages

Aditya Shah CV PDF

Uploaded by

9960dhirajmanje

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Aditya Shah

+1 (540) 824 9021 # aditya.shahh3@gmail.com ï aditya-shahh ³ Google Scholar

Summary
• I am closely working with the leadership and research teams to build large-scale enterprise GenAI solutions.
• My core responsibilities include:
◦ Implementing Large Language Model architectures (Llama2, Mixtral, etc) and performing domain adaptive pre-training on
multi-GPU sytems using DeepSpeed / Megatron.
◦ Synthesizing domain specific enterprise data into required prompt-instruction pairs and performing instruction fine-tuning
using DPO / RLHF.
◦ Developing a solid understanding of novel architectural innovations (KV cache, flash attention, MQA, etc), optimization
techniques (quantization, FSDP, etc), and implementing newer research methods.
• Key Expertise:
◦ Tools/Tech: Python, PyTorch, DeepSpeed, Megatron-LM, Accelerate, GPU, CUDA, Docker, NumPy, Pandas, SQL.
◦ Deep Learning: Pretraining & Finetuning Large Language Models (LLMs), RLHF, DPO, FSDP, RAG, Vector DB.

Work Experience
Capital One Mclean, USA
•
Machine Learning Scientist - AI Foundations Jun 2023 - Present
◦ Built in-house models with Llama2 and Mixtral, loaded checkpoint weights, and conducted further pre-training on
enterprise data (casual language modelling) using Megatron, FSDP on multiple GPUs.
◦ Developed prompt-instruction dataset from enterprise data and fine-tuned these base models using DPO/RLHF to align
them for chat-based use cases.
◦ Implemented various optimization techniques like KV cache, reduced precision, Multi Query Attention, Rotary Position
Embeddings, etc to optimize fine-tuning and inference pipelines.
◦ Building domain specific LLM agents using RAG and VectorDB to provide AI based virtual assistance with different
financial legalities and preventing risks.
◦ Delivered various keynotes and training sessions on Generative AI, and NLP, highlighting personal expertise and
leadership in upskilling teams.

Google Seattle, USA

•
Research Scientist Intern - LLMs Sep 2022 - Dec 2022
◦ Worked with DeepMind to integrate soft prompt parameters and adapters in a Multimodal Large Language Model
(MLLM) for Document Extraction.
◦ Developed an efficient optimization pipeline and performed parameter-efficient prefix fine-tuning on TPUs to
extract data from invoice documents.
◦ Enhanced model’s adaptability and robustness in sequential uptraining, which reduced catastrophic forgetting by 14%.

Capital One Mclean, USA

•
Data Science Intern - NLP Jun 2022 - Aug 2022
◦ Fine-tuned transformer-based language models (RoBERTa, XLNet, T5) on enterprise wide call transcript data to extract
relevant knowledge, identify entities and summarize the transcript.
◦ Improved customer request fulfillment and agent performance through co-reference resolution and eliminated 70% of
false positives with 94% accuracy.

Indian Institute of Technology (IIT) Indore, India

•
Research Scientist - Machine Learning Sep 2020 - Aug 2021
◦ Developed a novel multimodal neural network architecture for sarcasm detection which outperformed existing benchmarks
by 6.14% F1 score. Research Paper accepted in ICONIP 2021
◦ Proposed an efficient self-attention based model to capture incongruity for code-mixed sarcasm detection. Achieved
competitive F1 score as compared to multilingual models while training 10x faster and using lower memory footprint.
Research Paper accepted in ICON — (ACL 2021)

Saarthi.ai Bangalore, India

•
Machine Learning Engineer Jul 2020 - Oct 2020
◦ Conducted applied research on ASR and developed a deep learning model based on BiLSTM and 1-D CNN for gender
identification from audio data.
◦ Achieved a test accuracy of 96% with 15% improvement over previously designed approach. Further, worked on age
identification and specific keyword detection from real-time audio input.
Selected Publications
• A. Shah, A. Jain, S. Thapa, and L. Huang, “ADEPT: Adapter-based Efficient Prompt Tuning Approach for Language
Models”, The 61st Annual Meeting of the Association for Computational Linguistics (ACL), 2023 Paper

• B. Yao*, A. Shah*, L. Sun, and L. Huang, “End-to-End Multimodal Fact-Checking and Explanation Generation: A
Challenging Dataset and Models”, International ACM SIGIR Conference on Research and Development in
Information Retrieval (SIGIR), 2023 Paper (Best Paper Honorable Mention)

• S. Thapa*, A. Shah*, F. Jafri, U. Naseem, and I. Razzak, “A Multi-Modal Dataset for Hate Speech Detection on Social
Media:Case-study of Russia-Ukraine Conflict”, Conference on Empirical Methods in Natural Language Processing
(EMNLP), 2022. Paper

• A. Shah and C. Maurya, “How effective is incongruity? Implications for code-mixed sarcasm detection”, Proceedings of
the 18th International Conference on Natural Language Processing — (ACL), 2021. Code Paper

• S. Gupta, A. Shah, M. Shah, L. Syiemlieh, and C. Maurya, “FiLMing Multimodal Sarcasm Detection with Attention”,
Proceedings of the 28th International Conference on Neural Information Processing (ICONIP), 2021. Code Paper

• L. Kurup, M. Narvekar, R. Sarvaiya, and A. Shah, “Evolution of Neural Text Generation: A Comparative Analysis”,
Advances in Intelligent Systems and Computing, Springer (IC4S), 2020. Paper

Skills Summary
• Libraries and Technologies: PyTorch, DeepSpeed, Megatron-LM, Accelerate, GPU, CUDA, NumPy, Pandas, SpaCy
• Languages & Frameworks: Python, C++, SQL, MongoDB, Flask, Docker, Kubernetes, Spark

Education
Virginia Tech Blacksburg, USA
•
Masters of Science in Computer Science - Research 2021 - 2023
Thesis: NLP based Episodic Future Thinking (EFT). (Funded by NIH)

Dwarkadas J. Sanghvi College of Engineering Mumbai, India

•
Bachelors of Science in Computer Science 2016 - 2020

Academic Projects
• Code Interpretability on transformer models using SHAP: Conducted an exclusive research study on analysing code
interpretability using SHAP values and Logit manipulation for Codebert and Graph Codebert models. Code

• Adaptive pooling based Electra model for Multi Label Relation Classification: Proposed an Adaptive pooling based
method on top of Electra model — AdaElectra for multilabel relation classification achieving F1 score of 0.88 on the NYT29
dataset. Code

• Weighted Contextual N-gram method for evaluation of Text Summarization: Finetuned T5 model on Extreme
Summarization (XSum) Dataset and proposed the use of Weighted Contextual N-gram (WCN) method – an alternative metric for
evaluation of text generation. Code

• Supervised Text Generation using GPT2 model, BiLSTM, and GloVe Embedding: Fined tuned GPT2 model on
wikisent data for generating context-dependent text samples. Developed a BiLSTM with GloVe embedding and N-gram model to
generate text with 90% test accuracy. Code

• Food-101 Challenge by ETH Zurich: Designed a Neural Network model on top of the Xception network and fine-tuned it to
achieve State-of-the-Art result on the challenging Food 101 Dataset with a test accuracy of 87%. Code

Honors and Awards

• Received “Best Paper Honorable Mention” for the work on Multimodal Fact Checking at ’SIGIR’, 2023.
• Served as a Reviewer for: NAACL SRW 2023, ICON 2023, EMNLP 2022, COLING 2022, ICON - ACL 2021.
• Selected for AI fellowship program, Fellowship.ai, May 2020.
• Awarded “Best Research Project” at HaXplore, IIT BHU Machine Learning hackathon, 2019.
• Received “Innovative Research Project” award in ‘CodeShastra Intercollege Hackathon’, 2019.
• Served as “Co-Technical Head” for ACM, 2017-18. Mentored a team of 10 students for Software Development and ML.
• Awarded “Google India Scholarship”, 2017. in Android Application Development.

Abhishek Saraswat RESUME
No ratings yet
Abhishek Saraswat RESUME
1 page
CV Yash
No ratings yet
CV Yash
2 pages
Resume Amazon Applied Scientist IISc 2022
No ratings yet
Resume Amazon Applied Scientist IISc 2022
1 page
Avinash Resume New
No ratings yet
Avinash Resume New
2 pages
Gurnameh Resume Research
No ratings yet
Gurnameh Resume Research
2 pages
Poorvaja T Hegde
No ratings yet
Poorvaja T Hegde
2 pages
Education: Software Engineer, Google
No ratings yet
Education: Software Engineer, Google
5 pages
Resume Latest
No ratings yet
Resume Latest
3 pages
Sudipta Kar Resume
No ratings yet
Sudipta Kar Resume
3 pages
Manav Kapadnis Resume
No ratings yet
Manav Kapadnis Resume
1 page
Nisarg Patel Resume
No ratings yet
Nisarg Patel Resume
1 page
Anshika Mishra Resume
No ratings yet
Anshika Mishra Resume
1 page
Monisha Jegadeesan: Software Engineer, Google
No ratings yet
Monisha Jegadeesan: Software Engineer, Google
3 pages
Shalha Mucha CV Higher Study
No ratings yet
Shalha Mucha CV Higher Study
2 pages
Shubh Garg REIDMM
No ratings yet
Shubh Garg REIDMM
2 pages
Resume 1
No ratings yet
Resume 1
1 page
Shubh Garg REIDMM
No ratings yet
Shubh Garg REIDMM
2 pages
Pratik Ratadiya Resume
No ratings yet
Pratik Ratadiya Resume
2 pages
Adyasha Maharana: Education
No ratings yet
Adyasha Maharana: Education
2 pages
Faang 5
No ratings yet
Faang 5
5 pages
Abhishek Sharma Resume
No ratings yet
Abhishek Sharma Resume
2 pages
CV Ashwin-1
No ratings yet
CV Ashwin-1
2 pages
Chella Sudeep Resume
No ratings yet
Chella Sudeep Resume
1 page
Sajal Dhingra DS
No ratings yet
Sajal Dhingra DS
1 page
Arup - Das - Resume - Arup Das
No ratings yet
Arup - Das - Resume - Arup Das
2 pages
Wa0001.
No ratings yet
Wa0001.
1 page
Rishav Arjun Resume
No ratings yet
Rishav Arjun Resume
1 page
Archit Mangrulkar
No ratings yet
Archit Mangrulkar
3 pages
Namrata Mukhija
No ratings yet
Namrata Mukhija
2 pages
CurriculumVitae AishwaryaAgrawal
No ratings yet
CurriculumVitae AishwaryaAgrawal
8 pages
Scholastic Achievements
No ratings yet
Scholastic Achievements
2 pages
Srimouli Borusu - Apta Talenta
No ratings yet
Srimouli Borusu - Apta Talenta
2 pages
Shashank Mahato Resume
No ratings yet
Shashank Mahato Resume
2 pages
Dhruv Awasthi - CV
No ratings yet
Dhruv Awasthi - CV
2 pages
Anshika Resume
No ratings yet
Anshika Resume
1 page
VB Final AI ML Resume
No ratings yet
VB Final AI ML Resume
1 page
Pankaj Singh MTech Elec Mercedes Benz
No ratings yet
Pankaj Singh MTech Elec Mercedes Benz
2 pages
Kalpesh Resume
No ratings yet
Kalpesh Resume
4 pages
Two Page
No ratings yet
Two Page
2 pages
Design Engineer Interview Questions
0% (1)
Design Engineer Interview Questions
2 pages
Kaustubh
No ratings yet
Kaustubh
1 page
CV Ritesh Goru
No ratings yet
CV Ritesh Goru
2 pages
Cost Anlysis
No ratings yet
Cost Anlysis
1 page
External Apping Resume
No ratings yet
External Apping Resume
2 pages
Sanket Purandare CV
No ratings yet
Sanket Purandare CV
2 pages
Resume NLP V1
No ratings yet
Resume NLP V1
1 page
Sahil Sahu
No ratings yet
Sahil Sahu
2 pages
HPE - A00007129en - Us - R13xx-HPE FlexNetwork 5510 HI Layer 2 - LAN Switching Configuration Guide
No ratings yet
HPE - A00007129en - Us - R13xx-HPE FlexNetwork 5510 HI Layer 2 - LAN Switching Configuration Guide
329 pages
Pawan Resume May 2023
No ratings yet
Pawan Resume May 2023
2 pages
Meesho F
No ratings yet
Meesho F
75 pages
Top 41 SAP Security Interview Questions and Answers
No ratings yet
Top 41 SAP Security Interview Questions and Answers
6 pages
Advanced Techniques For Fault Detection and Classification in Electrical Power Transmission Systems: An Overview
No ratings yet
Advanced Techniques For Fault Detection and Classification in Electrical Power Transmission Systems: An Overview
10 pages
CLASS 7 CSS Handout 1
No ratings yet
CLASS 7 CSS Handout 1
10 pages
2nd Summative Test
No ratings yet
2nd Summative Test
8 pages
C01001848H - 00 Man - Inst Rack Cooler CRCC - CRCX en
No ratings yet
C01001848H - 00 Man - Inst Rack Cooler CRCC - CRCX en
34 pages
EXERCISE 13A - Door and Window Schedule
No ratings yet
EXERCISE 13A - Door and Window Schedule
1 page
English 5 Blended Words
No ratings yet
English 5 Blended Words
26 pages
بنك الاسئله لنظم التشغيل
No ratings yet
بنك الاسئله لنظم التشغيل
46 pages
IADC Rig Equipment List DDD - Rev - 2014.12.09
No ratings yet
IADC Rig Equipment List DDD - Rev - 2014.12.09
95 pages
Trinitronkv28fx66b 1
No ratings yet
Trinitronkv28fx66b 1
78 pages
DCCN Lecture 20 21 MAC Sublayer
No ratings yet
DCCN Lecture 20 21 MAC Sublayer
31 pages
R Rec M.823 3 2006
No ratings yet
R Rec M.823 3 2006
20 pages
401 Presentation: Group - II
No ratings yet
401 Presentation: Group - II
33 pages
IoE Retail Key Findings
No ratings yet
IoE Retail Key Findings
7 pages
Isolation Forest Step by Step. Overview - by Hyunsu Kim - Medium
No ratings yet
Isolation Forest Step by Step. Overview - by Hyunsu Kim - Medium
5 pages
Applications of Reinforcement Learning
No ratings yet
Applications of Reinforcement Learning
10 pages
Law Firm Receptionist11-Signed
No ratings yet
Law Firm Receptionist11-Signed
8 pages
Gemini For Google Cloud Documentation
No ratings yet
Gemini For Google Cloud Documentation
2 pages
Horus Heresy Cost Efficiency
No ratings yet
Horus Heresy Cost Efficiency
11 pages
Tutorial Bootstrap Part 3 - Cara Menginstall Bootstrap 5
No ratings yet
Tutorial Bootstrap Part 3 - Cara Menginstall Bootstrap 5
6 pages
Open Geodata Repositories & ISRO Geoweb Services For Thematic Applications by Shri. Kamal Pandey
No ratings yet
Open Geodata Repositories & ISRO Geoweb Services For Thematic Applications by Shri. Kamal Pandey
12 pages
BAPD1001 Assignment1 530487090
No ratings yet
BAPD1001 Assignment1 530487090
5 pages
FYP Final Report Preparation 2019-2020 - MKMJ PDF
No ratings yet
FYP Final Report Preparation 2019-2020 - MKMJ PDF
10 pages
Icovia® Room Planner
No ratings yet
Icovia® Room Planner
8 pages
Fdd5614P: 60V P-Channel Powertrench Mosfet
No ratings yet
Fdd5614P: 60V P-Channel Powertrench Mosfet
6 pages
Remarks On A Tropical Key Exchange System: Dylan Rudy Chris Monico
No ratings yet
Remarks On A Tropical Key Exchange System: Dylan Rudy Chris Monico
4 pages
p102613 Docjl Burnerspec Sheet 3
No ratings yet
p102613 Docjl Burnerspec Sheet 3
2 pages
Foundational Models and Architectures S1: Generative AI, #1
From Everand
Foundational Models and Architectures S1: Generative AI, #1
Leaster Startx
No ratings yet
Programming AI Workloads with Habana Gaudi SDK: The Complete Guide for Developers and Engineers
From Everand
Programming AI Workloads with Habana Gaudi SDK: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Efficient Scientific Programming with Spyder: Definitive Reference for Developers and Engineers
From Everand
Efficient Scientific Programming with Spyder: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Rasa Conversational AI Framework: The Complete Guide for Developers and Engineers
From Everand
Rasa Conversational AI Framework: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Applied HuggingSound for Speech Recognition: The Complete Guide for Developers and Engineers
From Everand
Applied HuggingSound for Speech Recognition: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Mobile Neural Network Framework in Practice: The Complete Guide for Developers and Engineers
From Everand
Mobile Neural Network Framework in Practice: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
KenLM: Efficient Language Modeling in Practice
From Everand
KenLM: Efficient Language Modeling in Practice
William Smith
No ratings yet
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
From Everand
Pandas Essentials for Data Analysis: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Practical Kaldi for Speech Recognition: The Complete Guide for Developers and Engineers
From Everand
Practical Kaldi for Speech Recognition: The Complete Guide for Developers and Engineers
William Smith
No ratings yet
Practical MXNet Applications: Definitive Reference for Developers and Engineers
From Everand
Practical MXNet Applications: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
From Everand
Caffe Deep Learning Framework Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
From Everand
Deep Learning with Fast.ai: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Gensim for Natural Language Processing: Definitive Reference for Developers and Engineers
From Everand
Gensim for Natural Language Processing: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Aditya Shah CV PDF

Uploaded by

Aditya Shah CV PDF

Uploaded by

Aditya Shah

+1 (540) 824 9021 # aditya.shahh3@gmail.com ï aditya-shahh ³ Google Scholar

Google Seattle, USA

Capital One Mclean, USA

Indian Institute of Technology (IIT) Indore, India

Saarthi.ai Bangalore, India

Dwarkadas J. Sanghvi College of Engineering Mumbai, India

Honors and Awards

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Aditya Shah CV PDF

Uploaded by

Aditya Shah CV PDF

Uploaded by

Aditya Shah

 +1 (540) 824 9021 # aditya.shahh3@gmail.com ï aditya-shahh ³ Google Scholar

Google Seattle, USA

Capital One Mclean, USA

Indian Institute of Technology (IIT) Indore, India

Saarthi.ai Bangalore, India

Dwarkadas J. Sanghvi College of Engineering Mumbai, India

Honors and Awards

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

+1 (540) 824 9021 # aditya.shahh3@gmail.com ï aditya-shahh ³ Google Scholar