Jump to content

data engineers o paali itu randi


mettastar

Recommended Posts

meru vaade query engine (db) enti to serve users ?

high concurrency and low latency (sub second) unna db ethukuthunna..

any suggestions from your experience ?

 

TPS - 400 (400 requests per second support chesavi kaavali or close to that)

Latency - sub seconds to less than 5 secs okay

should be able to handle TBs of data 

SQL support unte manchidhi

Link to comment
Share on other sites

kaka......asalu ni req enti, nuvu nosql db kosam looking aa???

do you know CAP theorem ???? if not check it out......

 

basic architecture iythea 

raw data ---> Spark/Hadoop(etl)--->nosql db ---> UX for users to generate reports  

 

Link to comment
Share on other sites

1 minute ago, kasi said:

kaka......asalu ni req enti, nuvu nosql db kosam looking aa???

do you know CAP theorem ???? if not check it out......

 

basic architecture iythea 

raw data ---> Spark/Hadoop(etl)--->nosql db ---> UX for users to generate reports  

 

Elasticsearch vaduthunnam kaani bulk loading chesthunte sasthundi adi. As it does indexing. 

Kylin ani okati undi explore chesa kaani adi olap cube based .. but ma data emo continues updates untayi historical data kuda .. rebuilding cubes is not easy adi kuda ruled out for now.

 

Any other nosql db in specific ? NoSqls tho problem endhante non key columns meda sorting or join btw other tables not easy run time lo

Link to comment
Share on other sites

22 minutes ago, mettastar said:

Elasticsearch vaduthunnam kaani bulk loading chesthunte sasthundi adi. As it does indexing. 

Kylin ani okati undi explore chesa kaani adi olap cube based .. but ma data emo continues updates untayi historical data kuda .. rebuilding cubes is not easy adi kuda ruled out for now.

 

Any other nosql db in specific ? NoSqls tho problem endhante non key columns meda sorting or join btw other tables not easy run time lo

did you try increasing the resources.....scaling the server??? 

ES is distributed, so i think you can try increasing the sever resources 

Link to comment
Share on other sites

27 minutes ago, mettastar said:

Elasticsearch vaduthunnam kaani bulk loading chesthunte sasthundi adi. As it does indexing. 

Kylin ani okati undi explore chesa kaani adi olap cube based .. but ma data emo continues updates untayi historical data kuda .. rebuilding cubes is not easy adi kuda ruled out for now.

 

Any other nosql db in specific ? NoSqls tho problem endhante non key columns meda sorting or join btw other tables not easy run time lo

did u explore DocumentDB...

Link to comment
Share on other sites

11 minutes ago, Spartan said:

did u explore DocumentDB...

ninnane chadiva but use cases lo analytical ki vadachu ani mention cheyaledu .. bulk load ela untado chudali

Link to comment
Share on other sites

5 minutes ago, mettastar said:

ninnane chadiva but use cases lo analytical ki vadachu ani mention cheyaledu .. bulk load ela untado chudali

elastic search enduku vadutunnaru....reports kosama kibana through generate cheyadam kosam

Link to comment
Share on other sites

57 minutes ago, kasi said:

kaka......asalu ni req enti, nuvu nosql db kosam looking aa???

do you know CAP theorem ???? if not check it out......

 

basic architecture iythea 

raw data ---> Spark/Hadoop(etl)--->nosql db ---> UX for users to generate reports  

 

typical architecture.

Link to comment
Share on other sites

20 minutes ago, mettastar said:

ninnane chadiva but use cases lo analytical ki vadachu ani mention cheyaledu .. bulk load ela untado chudali

https://segment.com/blog/choosing-a-database-for-analytics/

Link to comment
Share on other sites

Apache Cassandra. A row partitioned distributed NO SQL data store.   We use it for storing inventory from thousands of locations and serve inventory lookup queries

What kind of data are you storing?

Link to comment
Share on other sites

31 minutes ago, dewarist said:

Apache Cassandra. A row partitioned distributed NO SQL data store.   We use it for storing inventory from thousands of locations and serve inventory lookup queries

What kind of data are you storing?

Sales invent forecast click stream customer behavior

Link to comment
Share on other sites

33 minutes ago, dewarist said:

Apache Cassandra. A row partitioned distributed NO SQL data store.   We use it for storing inventory from thousands of locations and serve inventory lookup queries

What kind of data are you storing?

Volume ey size lo untadi bro? And size of cluster? Query latency? Does it support ansi sql? 

Link to comment
Share on other sites

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.

Guest
Reply to this topic...

×   Pasted as rich text.   Paste as plain text instead

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.

×
×
  • Create New...