Skip to content
GitLab
Projects Groups Topics Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
  • Register
  • Sign in
  • P parse-pagexml
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributor statistics
    • Graph
    • Compare revisions
  • Issues 1
    • Issues 1
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 0
    • Merge requests 0
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Container Registry
    • Terraform modules
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • cssh
  • releases
  • parse-pagexml
  • Merge requests
  • !2
You need to sign in or sign up before continuing.

fix: Enhance robustness, fix metadata handling, and update YOLO documentation

  • Review changes

  • Download
  • Patches
  • Plain diff
Merged LAURA SERVAT BACH requested to merge fix/dataset-compatibility into main Dec 30, 2025
  • Overview 1
  • Commits 4
  • Pipelines 0
  • Changes 5

This MR addresses several robustness issues in the parse-pagexml library, specifically regarding lines without coordinates and metadata processing. It also ensures data splits are automatically generated if missing and provides comprehensive documentation for the YOLO dataset generation feature.

Assignee
Assign to
Reviewers
Request review from
Time tracking
Source branch: fix/dataset-compatibility