Skip to content

parallel_coordinates rendering problem with large data #2124

Closed
@Rabeez

Description

@Rabeez

I have a reasonably large data frame with 165,693 rows. I made a parallel_coordinates plot with 4 variables and the output looks strange as if it's trying to plot all the lines but couldn't finish. The lines between the second and third variable don't go all the way.
Screenshot (5)

When I try to reorder by moving the second axis, the plot 'refreshes' and removes the halfway drawn lines:
Screenshot (6)

To explain a bit further. The first 3 variables are actually categorical and I've encoded them to integers to plot using parallel coordinates which is why the lines are clustered around certain points on the first 3 axes.

Also, I've had to hide the tick and axis labels because of reasons 🙄 , hopefully this can be debugged even with my horrible paint skills

EDIT: Tried with first 25,000 rows of the df only, got similar result.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions