Jump to content

Big Data Experts raavali please


kranthi111983

Recommended Posts

Please Big Data experts vachi koncham help cheyyandi

I'm a J2EE developer with 5 years original experience. Thinking of jumping into Big Data. But Big data lo ekkada start avvali, emi nerchukovali emi artham kaavatledhu. I have been listening to lot of Big Data technologies/tools but ekkada edhi fit avutadho teliyatledhu. Spring, Hibernate, Struts lo work chesina. I know where they all fit in big scope of applications but ee Big Data artham kaavatledhu. Please vachin koncham aa different technologies/terms gurinchi brief ga cheppandi.

Naa goal frst programming loki poyyi eventually Data Scientist kaavali ani. I think I'm good at analyzing data(based on my experience and my interest & strengths). So daaniki nenemi nerchukovali ? Asalu ee Hadoop, Hive, Pig, Spark, Map Reduce, Cassandra, Tableau, Splunk etc ante endhi ? Ivvi Big Data framework lo ekkada fit avutaayi, emi chestayi ?

Link to comment
Share on other sites

Hadoop is a framework to process big data

Hive - similar to sql, you can query data 

pig - another way of querying data

map reduce - Java lo code rasthavu for Maper and reducer. 

Spark - pinavi anni every operation need disk read whereas spark framework reads from disk and does computation in memory. So basically faster. You use Scala, Python , Java etc for coding

Casandra , Impala - third party databases. Hbase is Hadoop DB.

tableau - process chesina data no visualize cheyadaniki you use reporting tools like tableau, qlikview ( for end users)

splunk - no idea 

 

 

 

 

  • Upvote 1
Link to comment
Share on other sites

33 minutes ago, President said:

Hadoop is a framework to process big data

Hive - similar to sql, you can query data 

pig - another way of querying data

map reduce - Java lo code rasthavu for Maper and reducer. 

Spark - pinavi anni every operation need disk read whereas spark framework reads from disk and does computation in memory. So basically faster. You use Scala, Python , Java etc for coding

Casandra , Impala - third party databases. Hbase is Hadoop DB.

tableau - process chesina data no visualize cheyadaniki you use reporting tools like tableau, qlikview ( for end users)

splunk - no idea 

 

 

 

 

Thanks Bhayya

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...