Skip to content
@docling-project

Docling Project

Welcome to the Docling Project

This is the GitHub organization Docling open-source project. We like to get continuous feedback from the community: take the poll!

Docling

Docling is our main open-source package. It is a powerful library which simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

We support an amazing community which helps us driving forward the adoption of Docling. Give it a try and join the community!



The key repositories of Docling are:

  • docling - The home of the main docling package.
  • docling-core - The definition of types, transforms, serializers, etc. If it has to do with the DoclingDocument you will find it here.
  • docling-parse - The backend PDF parser used by Docling.
  • docling-serve - The FastAPI wrappers for running Docling as REST API and distribute large jobs.
  • docling-ibm-models - The AI models powering Docling.
  • docling-sdg - Synthetic data generation (SDG) on documents for dataset generation for RAG, finetuning, etc.
  • docling-mcp - The definition of tools with the Model Context Protocol for document conversion, manipulation and generation agents.

LF AI & Data

Docling is hosted as a project in the LF AI & Data Foundation.

IBM ❤️ Open Source AI

The project was started by the AI for knowledge team at IBM Research Zurich.

Pinned Loading

  1. docling docling Public

    Get your documents ready for gen AI

    Python 34.4k 2.3k

  2. docling-serve docling-serve Public

    Running Docling as an API service

    Python 545 125

  3. docling-core docling-core Public

    A python library to define and validate data types in Docling.

    Python 155 70

  4. community community Public

    4

Repositories

Showing 10 of 19 repositories
  • docling Public

    Get your documents ready for gen AI

    docling-project/docling’s past year of commit activity
    Python 34,365 MIT 2,307 426 (8 issues need help) 25 Updated Jul 16, 2025
  • docling-project/docling-ibm-models’s past year of commit activity
    Python 131 MIT 38 22 10 Updated Jul 16, 2025
  • docling-core Public

    A python library to define and validate data types in Docling.

    docling-project/docling-core’s past year of commit activity
    Python 155 MIT 70 28 6 Updated Jul 16, 2025
  • docling-mcp Public

    Making docling agentic through MCP

    docling-project/docling-mcp’s past year of commit activity
    Python 126 MIT 31 14 2 Updated Jul 16, 2025
  • docling-workshops Public

    Docling workshops

    docling-project/docling-workshops’s past year of commit activity
    Jupyter Notebook 12 CC0-1.0 2 0 0 Updated Jul 16, 2025
  • docling-project/docling-jobkit’s past year of commit activity
    Python 6 MIT 2 6 2 Updated Jul 15, 2025
  • docling-serve Public

    Running Docling as an API service

    docling-project/docling-serve’s past year of commit activity
    Python 545 MIT 125 53 5 Updated Jul 15, 2025
  • docling-eval Public

    Evaluation framework for document processing models and services.

    docling-project/docling-eval’s past year of commit activity
    Python 24 MIT 7 7 7 Updated Jul 15, 2025
  • docling-project/docling-operator’s past year of commit activity
    Go 10 Apache-2.0 9 3 2 Updated Jul 14, 2025
  • community Public
    docling-project/community’s past year of commit activity
    4 MIT 0 0 0 Updated Jul 14, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy