Skip to content

WAT/WET generator performance improvements #15

@sebastian-nagel

Description

@sebastian-nagel

Try to improve the performance of WAT/WET generator. The results of a profiler run (on 3abab54 using async-profiler) shows that most time is spent for

  • HTML parsing (by htmlparser.org) - including reading, verifying, decompressing the WARC data
  • JSON serialization
  • regex matching

weat_prof
(interactive SVG.zip)

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions