[SPARK-28390][SQL][PYTHON][TESTS] [FOLLOW-UP] Update the TODO with actual blocking JIRA IDs #25415

shivusondur · 2019-08-12T06:12:13Z

What changes were proposed in this pull request?

only todo message updated. Need to add udf() for GroupBy Tests, after resolving following jira
[SPARK-28386] and [SPARK-26741]

How was this patch tested?

NA, only TODO message updated.

shivusondur · 2019-08-12T06:14:18Z

@HyukjinKwon
plz check
or we can wait for corresponding jira([SPARK-28386] and [SPARK-26741]) to handle?

dongjoon-hyun · 2019-08-12T06:17:52Z

sql/core/src/test/resources/sql-tests/inputs/udf/pgSQL/udf-select_having.sql

 --
 -- This test file was converted from inputs/pgSQL/select_having.sql
-- TODO: We should add UDFs in GROUP BY clause when [SPARK-28445] is resolved.
+-- TODO: We should add UDFs in GROUP BY clause when [SPARK-28386] and [SPARK-26741] is resolved.


SPARK-28445 was wrong from the beginning, @shivusondur ?

@dongjoon-hyun
After resolving the SPARK-28445 also, test were failing and found [SPARK-28386] and [SPARK-26741] are blocking it.

for furher details follow #25215 (comment)

HyukjinKwon · 2019-08-12T08:20:07Z

sql/core/src/test/resources/sql-tests/inputs/udf/pgSQL/udf-select_having.sql

 --
 -- This test file was converted from inputs/pgSQL/select_having.sql
-- TODO: We should add UDFs in GROUP BY clause when [SPARK-28445] is resolved.
+-- TODO: We should add UDFs in GROUP BY clause when [SPARK-28386] and [SPARK-26741] is resolved.


I am a bit lost about this or I forget something.
Can't we add UDF in group-by clause (resolved in SPARK-28445)?

@HyukjinKwon
From this
#25215 (comment)
I thought I need to update todo with blocking jira numbers

Ah, right. I forgot. Can we enable all other tests with UDF in group-by and comment out the test?

-- !query 11 SELECT udf(b), udf(c) FROM test_having GROUP BY udf(b), udf(c) HAVING udf(count(*)) = 1 ORDER BY udf(b), udf(c) -- !query 11 schema struct<> -- !query 11 output org.apache.spark.sql.AnalysisException cannot resolve 'b' given input columns: [CAST(udf(cast(b as string)) AS INT), CAST(udf(cast(c as string)) AS STRING)]; line 2 pos 63

I guess we can still add some more tests?

dongjoon-hyun · 2019-08-12T09:26:57Z

BTW, @shivusondur . The follow-up PR had better have its own PR title. The current one seems to be copied from the original PR.

dongjoon-hyun · 2019-08-14T16:22:07Z

Usually, it's not worth to change it immediately because it will be resolved in 3.0.0 timeframe. But, I'm okay to keep it up-to-date, too.
I'll leave this PR to @HyukjinKwon .

HyukjinKwon · 2019-08-14T21:32:18Z

Yea that's fine to update comments. But @shivusondur can you confirm if you are unable to fix any test or some tests to have use in GROUP BY clause due to both JIRAs? If you can, let's add some tests and only comment out the other tests not working by both JIRAs.

SparkQA · 2019-08-15T14:25:07Z

Test build #4829 has finished for PR 25415 at commit dd41b26.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

shivusondur · 2019-08-18T16:19:45Z

Yea that's fine to update comments. But @shivusondur can you confirm if you are unable to fix any test or some tests to have used in GROUP BY clause due to both JIRAs? If you can, let's add some tests and only comment out the other tests not working by both JIRAs.

@HyukjinKwon

There are 3 instances of groupby test in udf-select_having.sql, all 3 are not working due to the same reason.
Originally we have copied this "udf-select_having.sql" from "select_having.sql", so we are maintaining the same tests as original, (#25161 (comment))
if the extra tests we can add in new file.

HyukjinKwon · 2019-08-19T00:17:21Z

retest this please

SparkQA · 2019-08-19T03:56:31Z

Test build #109304 has finished for PR 25415 at commit dd41b26.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

HyukjinKwon · 2019-08-19T04:01:06Z

Merged to master.

Only TODO message updated

dd41b26

dongjoon-hyun reviewed Aug 12, 2019

View reviewed changes

dongjoon-hyun added PYSPARK SQL TESTS labels Aug 12, 2019

HyukjinKwon reviewed Aug 12, 2019

View reviewed changes

shivusondur changed the title ~~[SPARK-28390][SQL][PYTHON][TESTS] [FOLLOW-UP]Convert and port 'pgSQL/select_having.sql' into UDF test base~~ [SPARK-28390][SQL][PYTHON][TESTS] [FOLLOW-UP] Update the TODO with actual blocking JIRA IDs Aug 13, 2019

HyukjinKwon approved these changes Aug 19, 2019

View reviewed changes

HyukjinKwon closed this in c96b615 Aug 19, 2019

[SPARK-28390][SQL][PYTHON][TESTS] [FOLLOW-UP] Update the TODO with actual blocking JIRA IDs #25415

[SPARK-28390][SQL][PYTHON][TESTS] [FOLLOW-UP] Update the TODO with actual blocking JIRA IDs #25415

Uh oh!

Conversation

shivusondur commented Aug 12, 2019

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

shivusondur commented Aug 12, 2019

Uh oh!

dongjoon-hyun Aug 12, 2019

Choose a reason for hiding this comment

Uh oh!

shivusondur Aug 12, 2019

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Aug 12, 2019

Choose a reason for hiding this comment

Uh oh!

shivusondur Aug 12, 2019

Choose a reason for hiding this comment

Uh oh!

HyukjinKwon Aug 13, 2019

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun commented Aug 12, 2019

Uh oh!

dongjoon-hyun commented Aug 14, 2019

Uh oh!

HyukjinKwon commented Aug 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SparkQA commented Aug 15, 2019

Uh oh!

shivusondur commented Aug 18, 2019

Uh oh!

HyukjinKwon commented Aug 19, 2019

Uh oh!

SparkQA commented Aug 19, 2019

Uh oh!

HyukjinKwon commented Aug 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

HyukjinKwon commented Aug 14, 2019 •

edited

Loading