vendettaa Posted February 23, 2018 Report Share Posted February 23, 2018 I have two tables in hive each has billions of records and 200 columns i want to compare each column against other table column based on primary key and Trigger email containing mismatched records spark code is also fine Quote Link to comment Share on other sites More sharing options...
alooparata Posted February 23, 2018 Report Share Posted February 23, 2018 come in @vendetta id Quote Link to comment Share on other sites More sharing options...
chary69 Posted February 23, 2018 Report Share Posted February 23, 2018 3 minutes ago, vendettaa said: I have two tables in hive each has billions of records and 200 columns i want to compare each column against other table column based on primary key and Trigger email containing mismatched records spark code is also fine Code not the fine ? Quote Link to comment Share on other sites More sharing options...
chary69 Posted February 23, 2018 Report Share Posted February 23, 2018 Just now, alooparata said: come in @vendetta id Why to come to other thread post rica why? Quote Link to comment Share on other sites More sharing options...
alooparata Posted February 23, 2018 Report Share Posted February 23, 2018 Just now, chary69 said: Why to come to other thread post rica why? because everything causes the cause inbuilt around the inner senses of the spoon Quote Link to comment Share on other sites More sharing options...
chary69 Posted February 23, 2018 Report Share Posted February 23, 2018 Just now, alooparata said: because everything causes the cause inbuilt around the inner senses of the spoon Why to get the spoon between us ? Quote Link to comment Share on other sites More sharing options...
alooparata Posted February 23, 2018 Report Share Posted February 23, 2018 Just now, chary69 said: Why to get the spoon between us ? its bhagamathis but bahubali stolen from lokayya Quote Link to comment Share on other sites More sharing options...
chary69 Posted February 23, 2018 Report Share Posted February 23, 2018 Just now, alooparata said: its bhagamathis but bahubali stolen from lokayya Whose to lokayya for Minky? Quote Link to comment Share on other sites More sharing options...
alooparata Posted February 23, 2018 Report Share Posted February 23, 2018 Just now, chary69 said: Whose to lokayya for Minky? minkys donkey was stolen by pinkys ponky Quote Link to comment Share on other sites More sharing options...
vendettaa Posted February 23, 2018 Author Report Share Posted February 23, 2018 6 minutes ago, chary69 said: Code not the fine ? Edokati man e piece teliste yamls create chesi automate cheyali spark or hive is fine Quote Link to comment Share on other sites More sharing options...
NPReddy Posted February 24, 2018 Report Share Posted February 24, 2018 1 minute ago, vendettaa said: Edokati man e piece teliste yamls create chesi automate cheyali spark or hive is fine Im not an expert but I think you can use except data frame api to perform this. put table 1 data into data frame 1 table 2 data into another data frame 2. dataframe1.select(keyColumn).except.dataframe2.select(keycolumn) you will get data from dataframe 1 which is not present in df2. May not be a perfect answer but you can change it according to your use case. Quote Link to comment Share on other sites More sharing options...
vendettaa Posted February 24, 2018 Author Report Share Posted February 24, 2018 6 minutes ago, NPReddy said: Im not an expert but I think you can use except data frame api to perform this. put table 1 data into data frame 1 table 2 data into another data frame 2. dataframe1.select(keyColumn).except.dataframe2.select(keycolumn) you will get data from dataframe 1 which is not present in df2. May not be a perfect answer but you can change it according to your use case. Ok how to do this on hive am not sure whether we have commands to invoke spark yaml but thanks Quote Link to comment Share on other sites More sharing options...
NPReddy Posted February 24, 2018 Report Share Posted February 24, 2018 1 minute ago, vendettaa said: Ok how to do this on hive am not sure whether we have commands to invoke spark yaml but thanks This is not in hive, read and process hive table data using spark, write a spark application which perform this or you can do it in spark shell directly. Im not sure about email. Quote Link to comment Share on other sites More sharing options...
vendettaa Posted February 24, 2018 Author Report Share Posted February 24, 2018 1 minute ago, NPReddy said: This is not in hive, read and process hive table data using spark, write a spark application which perform this or you can do it in spark shell directly. Im not sure about email. Ok without spark ela ani asking Quote Link to comment Share on other sites More sharing options...
NPReddy Posted February 24, 2018 Report Share Posted February 24, 2018 Just now, vendettaa said: Ok without spark ela ani asking Not sure buddy. Since you asked for spark, i just tried to help you. How do u write hive queries? In shell? I think, You can create some functions to performa this. Since you are dealing with billions of records, i do not think it is recommended. Wait for experts. Quote Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.