Skip to content

Conversation

@noslowerdna
Copy link
Contributor

@noslowerdna noslowerdna commented May 6, 2019

HADOOP-16294

Adding a protected-scope getter for the DistCpOptions, so that a subclass does not need to save its own copy of the inputOptions supplied to its constructor, if it wishes to override the createInputFileListing method with logic similar to the original implementation, i.e. calling CopyListing#buildListing with a path and input options.

@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Comment
0 reexec 58 Docker mode activated.
_ Prechecks _
+1 dupname 0 No case conflicting files found.
+1 @author 0 The patch does not contain any @author tags.
-1 test4tests 0 The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
_ trunk Compile Tests _
+1 mvninstall 1302 trunk passed
+1 compile 28 trunk passed
+1 checkstyle 21 trunk passed
+1 mvnsite 31 trunk passed
+1 shadedclient 768 branch has no errors when building and testing our client artifacts.
+1 javadoc 22 trunk passed
0 spotbugs 44 Used deprecated FindBugs config; considering switching to SpotBugs.
+1 findbugs 42 trunk passed
_ Patch Compile Tests _
-1 mvninstall 19 hadoop-distcp in the patch failed.
-1 compile 19 hadoop-distcp in the patch failed.
-1 javac 19 hadoop-distcp in the patch failed.
-0 checkstyle 14 hadoop-tools/hadoop-distcp: The patch generated 1 new + 16 unchanged - 0 fixed = 17 total (was 16)
-1 mvnsite 19 hadoop-distcp in the patch failed.
+1 whitespace 0 The patch has no whitespace issues.
+1 shadedclient 821 patch has no errors when building and testing our client artifacts.
+1 javadoc 18 the patch passed
-1 findbugs 21 hadoop-distcp in the patch failed.
_ Other Tests _
-1 unit 21 hadoop-distcp in the patch failed.
+1 asflicense 28 The patch does not generate ASF License warnings.
3348
Subsystem Report/Notes
Docker Client=17.05.0-ce Server=17.05.0-ce base: https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/artifact/out/Dockerfile
GITHUB PR #796
JIRA Issue HADOOP-16294
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle
uname Linux c058a8d2f85c 4.4.0-144-generic #170~14.04.1-Ubuntu SMP Mon Mar 18 15:02:05 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality personality/hadoop.sh
git revision trunk / 12b7059
Default Java 1.8.0_191
mvninstall https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/artifact/out/patch-mvninstall-hadoop-tools_hadoop-distcp.txt
compile https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/artifact/out/patch-compile-hadoop-tools_hadoop-distcp.txt
javac https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/artifact/out/patch-compile-hadoop-tools_hadoop-distcp.txt
checkstyle https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/artifact/out/diff-checkstyle-hadoop-tools_hadoop-distcp.txt
mvnsite https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/artifact/out/patch-mvnsite-hadoop-tools_hadoop-distcp.txt
findbugs https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/artifact/out/patch-findbugs-hadoop-tools_hadoop-distcp.txt
unit https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/artifact/out/patch-unit-hadoop-tools_hadoop-distcp.txt
Test Results https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/testReport/
Max. process+thread count 304 (vs. ulimit of 5500)
modules C: hadoop-tools/hadoop-distcp U: hadoop-tools/hadoop-distcp
Console output https://builds.apache.org/job/hadoop-multibranch/job/PR-796/1/console
versions git=2.7.4 maven=3.3.9 findbugs=3.1.0-RC1
Powered by Apache Yetus 0.10.0 http://yetus.apache.org

This message was automatically generated.

}

/**
* Returns the input options
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

needs a "." at the end or javadoc is unhappy

@steveloughran
Copy link
Contributor

-1, compile failing

[WARNING]   symbol:   variable inputOptions
  location: class DistCp

@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Comment
0 reexec 48 Docker mode activated.
_ Prechecks _
+1 dupname 0 No case conflicting files found.
+1 @author 0 The patch does not contain any @author tags.
-1 test4tests 0 The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
_ trunk Compile Tests _
+1 mvninstall 1172 trunk passed
+1 compile 28 trunk passed
+1 checkstyle 19 trunk passed
+1 mvnsite 31 trunk passed
+1 shadedclient 761 branch has no errors when building and testing our client artifacts.
+1 javadoc 20 trunk passed
0 spotbugs 47 Used deprecated FindBugs config; considering switching to SpotBugs.
+1 findbugs 45 trunk passed
_ Patch Compile Tests _
-1 mvninstall 21 hadoop-distcp in the patch failed.
-1 compile 19 hadoop-distcp in the patch failed.
-1 javac 19 hadoop-distcp in the patch failed.
+1 checkstyle 15 the patch passed
-1 mvnsite 21 hadoop-distcp in the patch failed.
+1 whitespace 0 The patch has no whitespace issues.
+1 shadedclient 804 patch has no errors when building and testing our client artifacts.
+1 javadoc 18 the patch passed
-1 findbugs 21 hadoop-distcp in the patch failed.
_ Other Tests _
-1 unit 21 hadoop-distcp in the patch failed.
+1 asflicense 27 The patch does not generate ASF License warnings.
3172
Subsystem Report/Notes
Docker Client=17.05.0-ce Server=17.05.0-ce base: https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/artifact/out/Dockerfile
GITHUB PR #796
JIRA Issue HADOOP-16294
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle
uname Linux a580ae1efe96 4.4.0-144-generic #170~14.04.1-Ubuntu SMP Mon Mar 18 15:02:05 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality personality/hadoop.sh
git revision trunk / 49e1292
Default Java 1.8.0_191
mvninstall https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/artifact/out/patch-mvninstall-hadoop-tools_hadoop-distcp.txt
compile https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/artifact/out/patch-compile-hadoop-tools_hadoop-distcp.txt
javac https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/artifact/out/patch-compile-hadoop-tools_hadoop-distcp.txt
mvnsite https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/artifact/out/patch-mvnsite-hadoop-tools_hadoop-distcp.txt
findbugs https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/artifact/out/patch-findbugs-hadoop-tools_hadoop-distcp.txt
unit https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/artifact/out/patch-unit-hadoop-tools_hadoop-distcp.txt
Test Results https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/testReport/
Max. process+thread count 339 (vs. ulimit of 5500)
modules C: hadoop-tools/hadoop-distcp U: hadoop-tools/hadoop-distcp
Console output https://builds.apache.org/job/hadoop-multibranch/job/PR-796/2/console
versions git=2.7.4 maven=3.3.9 findbugs=3.1.0-RC1
Powered by Apache Yetus 0.10.0 http://yetus.apache.org

This message was automatically generated.

@noslowerdna
Copy link
Contributor Author

noslowerdna commented May 7, 2019

compile failing

Fixed ( noslowerdna@0301c83 ), I didn't realize that the trunk code had changed from the version I'd been working with. We would want subclasses to have access to the DistCpContext instead since that's what CopyListing#buildListing uses now.

@hadoop-yetus
Copy link

💔 -1 overall

Vote Subsystem Runtime Comment
0 reexec 45 Docker mode activated.
_ Prechecks _
+1 dupname 0 No case conflicting files found.
+1 @author 0 The patch does not contain any @author tags.
-1 test4tests 0 The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch.
_ trunk Compile Tests _
+1 mvninstall 1184 trunk passed
+1 compile 36 trunk passed
+1 checkstyle 26 trunk passed
+1 mvnsite 39 trunk passed
+1 shadedclient 884 branch has no errors when building and testing our client artifacts.
+1 javadoc 26 trunk passed
0 spotbugs 61 Used deprecated FindBugs config; considering switching to SpotBugs.
+1 findbugs 57 trunk passed
_ Patch Compile Tests _
+1 mvninstall 30 the patch passed
+1 compile 27 the patch passed
+1 javac 27 the patch passed
+1 checkstyle 18 the patch passed
+1 mvnsite 31 the patch passed
+1 whitespace 0 The patch has no whitespace issues.
+1 shadedclient 937 patch has no errors when building and testing our client artifacts.
+1 javadoc 21 the patch passed
+1 findbugs 55 the patch passed
_ Other Tests _
+1 unit 898 hadoop-distcp in the patch passed.
+1 asflicense 31 The patch does not generate ASF License warnings.
4436
Subsystem Report/Notes
Docker Client=17.05.0-ce Server=17.05.0-ce base: https://builds.apache.org/job/hadoop-multibranch/job/PR-796/3/artifact/out/Dockerfile
GITHUB PR #796
JIRA Issue HADOOP-16294
Optional Tests dupname asflicense compile javac javadoc mvninstall mvnsite unit shadedclient findbugs checkstyle
uname Linux 78ad66f77a26 4.4.0-144-generic #170~14.04.1-Ubuntu SMP Mon Mar 18 15:02:05 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux
Build tool maven
Personality personality/hadoop.sh
git revision trunk / 49e1292
Default Java 1.8.0_191
Test Results https://builds.apache.org/job/hadoop-multibranch/job/PR-796/3/testReport/
Max. process+thread count 312 (vs. ulimit of 5500)
modules C: hadoop-tools/hadoop-distcp U: hadoop-tools/hadoop-distcp
Console output https://builds.apache.org/job/hadoop-multibranch/job/PR-796/3/console
versions git=2.7.4 maven=3.3.9 findbugs=3.1.0-RC1
Powered by Apache Yetus 0.10.0 http://yetus.apache.org

This message was automatically generated.

@steveloughran
Copy link
Contributor

yetus isn't reviewing this again, is it?

@steveloughran
Copy link
Contributor

...even if yetus is silent, patch LGTM. @noslowerdna once you are happy with these changes are working for what you are doing with distcp, I'm happy to merge it in

@noslowerdna
Copy link
Contributor Author

Thank you @steveloughran - I'm happy with this patch.

@steveloughran
Copy link
Contributor

+1

committed to branch-3.2+, can go earlier if you want

shanthoosh pushed a commit to shanthoosh/hadoop that referenced this pull request Oct 15, 2019
This PR makes the following changes

* Enriches StreamPartitionCountMonitor to periodically monitor input-regexes to match to actual inputs and stop the job when a new input stream is discovered.

* Add a new API to SysAdmin to allow listing of all streams, e.g., Kafka-topics. KafkaSysAdmin implementation of this uses KafkaConsumer's listTopics API. (Even if listTopics had 1 million topics with 100 bytes per topic total, temporary memory overhead will be 100 MB).

* Added config job.coordinator.monitor-input-regex.frequency.ms for the monitoring frequency, and job.coordinator.monitor-input-regex.%s for each input system. Users can then choose desired regex for each input system, e.g., job.coordinator.monitor-input-regex.kafka=test-.*.

* We can later enrich RegexTopicGen rewriter to add a monitor-input-regex config to allow periodic jonitoring

* Tested: Unit test for SPCM and tested with test jobs on local grid.

Author: Ray Matharu <[email protected]>

Reviewers: Jagadish<[email protected]>

Closes apache#796 from rmatharu/newtopic-test
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants