This repository contains materials for the Advanced Natural Language Processing course taught at Barcelona School of Economics.
This course navigates the evolution of Natural Language Processing (NLP) from foundational techniques to advanced concepts like Large Language Models and ChatGPT. It begins with core principles such as TF-IDF and word embeddings, advancing through deep learning innovations like LSTM and BERT.
The course is structured into three main parts:
- Good old fashioned NLP (Sessions 1-4)
- Almost part of good old fashioned NLP (Sessions 5-8)
- LLMs, Agents & Others (Sessions 9 & 10)
- docs/: Course documentation and lecture notes
- pdfs/: PDF versions of lecture slides
- chapter*/: Content for each session
- notebooks/: Jupyter notebooks for hands-on exercises
- Python 3.11 or higher
- Poetry (for dependency management)
- Clone the repository
git clone https://github.com/agombert/AdvancedNLPClasses.git
cd AdvancedNLPClasses
- Install dependencies with Poetry
# Install Poetry if you don't have it
curl -sSL https://install.python-poetry.org | python3 -
# Install project dependencies
poetry install
- Set up notebook environment
# Download necessary models and datasets
poetry run setup-notebooks
- Start the documentation server
poetry run mkdocs serve
- Launch Jupyter for notebooks
poetry run jupyter notebook
- Documentation: Visit the course website for comprehensive materials
- Notebooks: Explore the notebooks directory for hands-on exercises
- Slides: Download lecture slides from the pdfs directory
If you find errors or have suggestions for improving the course materials, please open an issue or submit a pull request.