Invalid
Details
Details
Assignee
Russell Spitzer
Russell SpitzerReporter
Artem Soloviov
Artem SoloviovComponents
Affects versions
Priority
Created December 13, 2016 at 2:59 PM
Updated February 19, 2017 at 6:51 PM
Resolved December 13, 2016 at 10:29 PM
In general:
Using two columns table and LIMIT clause leeds to NPE in
DataFrame.distinct()
if limit less then the table size.No such behaviour observed for Elasticsearch Spark connector on the same data and code (select -> distinct)
Steps to reproduce: use the code published in GitHub repo