Skip to content

Conversation

@romainx
Copy link
Collaborator

@romainx romainx commented Dec 13, 2020

Hello,

Fixes #1168.
It consists in applying the fix described in the Spark downloading instructions:

For Java 11, -Dio.netty.tryReflectionSetAccessible=true is required additionally for Apache Arrow library. This prevents java.lang.UnsupportedOperationException: sun.misc.Unsafe or java.nio.DirectByteBuffer.(long, int) not available when Apache Arrow uses Netty internally.

The only problem of this solution is how it fits with other versions of Spark / Java that can be changed at build time through ARGS. However I have tested with Java 8 and tests are passing, Spark 2.x does not work with Python 3.8, see apache/spark#26194, so I'm not sure it's worth going further on this compatibility point.

I've added a specific test to ensure that the issue is fixed.
Best

@romainx romainx added the type:Bug A problem with the definition of one of the docker images maintained here label Dec 14, 2020
@romainx romainx requested a review from parente December 19, 2020 04:11
@romainx
Copy link
Collaborator Author

romainx commented Dec 26, 2020

Hello,

I'm merging it since it's a bug fix. We will rollback in case of problem.

Best.

@romainx romainx merged commit d113a60 into jupyter:master Dec 26, 2020
@romainx romainx deleted the fix_spark_java11 branch March 21, 2021 19:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

type:Bug A problem with the definition of one of the docker images maintained here

Projects

None yet

Development

Successfully merging this pull request may close these issues.

java.lang.UnsupportedOperationException using pandas in Spark

1 participant