Skip to content

Conversation

@kayousterhout
Copy link
Contributor

This data structure was misused and, as a result, later renamed to an incorrect name.

This data structure seems to have gotten into this tangled state as a result of @henrydavidge using the stageID instead of the job Id to index into it and later @andrewor14 renaming the data structure to reflect this misunderstanding.

This patch renames it and removes an incorrect indexing into it. The incorrect indexing into it meant that the code added by @henrydavidge to warn when a task size is too large (added here 5757993) was not always executed; this commit fixes that.

This data structure was misnamed and, as a result, misused.
@AmplabJenkins
Copy link

Merged build triggered.

@AmplabJenkins
Copy link

Merged build started.

@rxin
Copy link
Contributor

rxin commented Apr 2, 2014

lgtm

1 similar comment
@mridulm
Copy link
Contributor

mridulm commented Apr 2, 2014

lgtm

@AmplabJenkins
Copy link

Merged build finished. All automated tests passed.

@AmplabJenkins
Copy link

All automated tests passed.
Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13675/

@andrewor14
Copy link
Contributor

good catch

@pwendell
Copy link
Contributor

pwendell commented Apr 2, 2014

Thanks, merged.

@markhamstra
Copy link
Contributor

Yes, good catch. This isn't the first confusion/conflation of jobId and stageId within the DAGScheduler and TaskManager, so it is something to keep a sharp eye on. This renaming should make future such mistakes less likely, so it LGTMT.

@asfgit asfgit closed this in 11973a7 Apr 2, 2014
pdeyhim pushed a commit to pdeyhim/spark-1 that referenced this pull request Jun 25, 2014
This data structure was misused and, as a result, later renamed to an incorrect name.

This data structure seems to have gotten into this tangled state as a result of @henrydavidge using the stageID instead of the job Id to index into it and later @andrewor14 renaming the data structure to reflect this misunderstanding.

This patch renames it and removes an incorrect indexing into it.  The incorrect indexing into it meant that the code added by @henrydavidge to warn when a task size is too large (added here apache@5757993) was not always executed; this commit fixes that.

Author: Kay Ousterhout <[email protected]>

Closes apache#301 from kayousterhout/fixCancellation and squashes the following commits:

bd3d3a4 [Kay Ousterhout] Renamed stageIdToActiveJob to jobIdToActiveJob.
lins05 pushed a commit to lins05/spark that referenced this pull request May 30, 2017
* Add proxy configuration to retrofit clients.

* Add logging
erikerlandson pushed a commit to erikerlandson/spark that referenced this pull request Jul 28, 2017
* Add proxy configuration to retrofit clients.

* Add logging
gatesn pushed a commit to gatesn/spark that referenced this pull request Mar 14, 2018
holdenk pushed a commit to holdenk/spark that referenced this pull request Sep 12, 2019
arjunshroff pushed a commit to arjunshroff/spark that referenced this pull request Nov 24, 2020
turboFei pushed a commit to turboFei/spark that referenced this pull request Nov 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants