DVT output is not proper and giving ambiguous output with --filters, --use-random-row and --random-row-batch-size options #552

kanhaPrayas · 2022-08-05T09:10:51Z

We are running DVT with filters, --random-row-batch-size and --use-random-row(for sampling) options. The source and target tables both are Teradata. The DVT execution is going thru properly. However the output generated is not correct and is ambiguous.
We are trying to run the DVT with 10,000 batch size, but the output being generated is only for very handful rows(3/5/7).

nehanene15 · 2022-08-08T19:17:40Z

Can you provide an example command and an example of the output generated? Does this occur for both text and BQ result handler? Is the query generated correct?

nehanene15 · 2022-09-07T16:31:02Z

The issue here is that DVT currently treat filters and random row mutually exclusively, when in fact we need to apply the filter inside the random row. I.e. when getting the X IDs randomly, we need to apply the filter 'timestamp > Y'.

nehanene15 added the priority: p0 Highest priority. Critical issue. Will be fixed prior to next release. label Sep 1, 2022

nehanene15 self-assigned this Sep 1, 2022

nehanene15 added the type: bug Error or flaw in code with unintended results or allowing sub-optimal usage patterns. label Sep 1, 2022

nehanene15 assigned kanhaPrayas and unassigned nehanene15 Sep 7, 2022

nehanene15 mentioned this issue Sep 8, 2022

fix: random rows with filter option #582

Merged

kanhaPrayas closed this as completed in #582 Sep 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DVT output is not proper and giving ambiguous output with --filters, --use-random-row and --random-row-batch-size options #552

DVT output is not proper and giving ambiguous output with --filters, --use-random-row and --random-row-batch-size options #552

kanhaPrayas commented Aug 5, 2022

nehanene15 commented Aug 8, 2022

nehanene15 commented Sep 7, 2022

DVT output is not proper and giving ambiguous output with --filters, --use-random-row and --random-row-batch-size options #552

DVT output is not proper and giving ambiguous output with --filters, --use-random-row and --random-row-batch-size options #552

Comments

kanhaPrayas commented Aug 5, 2022

nehanene15 commented Aug 8, 2022

nehanene15 commented Sep 7, 2022