If you belive yourself, anything is possible.
--Your friends at LectureNotes

Previous Year Exam Questions for BIG DATA - BD of 2018 - CEC by Bput Toppers

  • 2018
  • PYQ
  • Biju Patnaik University of Technology BPUT - BPUT
  • Information Technology Engineering
  • B.Tech
  • 10 Offline Downloads
  • Uploaded 1 year ago
Bput Toppers
Bput Toppers
0 User(s)
Download PDFOrder Printed Copy

Share it with your friends

Leave your Comments

Text from page-1

Registration No : Total Number of Pages : 02 B.Tech. PIT6J006 6th Semester Regular Examination 2017-18 BIG DATA ANALYTICS BRANCH : IT Time : 3 Hours Max Marks : 100 Q.CODE : C372 Answer Part-A which is compulsory and any four from Part-B. The figures in the right hand margin indicate marks. Q1. a) b) c) d) e) f) g) h) i) j) Q2. a) b) c) d) e) f) g) h) i) j) Part – A (Answer all the questions) Answer the following questions : multiple type or dash fill up type : ------------ phase of the data analytics lifecycle usually takes the longest? ------------- algorithms is most likely to use the Within Sum of Squares metric? ----------- algorithms is an unsupervised method, is descriptive (not predictive), is used to find hidden relationships in data, and is represented as rules? In the Apriori algorithm, ---------- metrics measures how much more often the left-hand-side variable and the right-hand-side variable occur together than expected if statistically independent? __________ can best be described as a programming model used to develop Hadoop-based applications that can process massive amounts of data. According to analysts, traditional IT systems can provide a foundation when they’re integrated with big data technologies like Hadoop for ------------? ------------ phase in the systems life cycle involves designing alternative systems, selecting the best system, and writing a systems design report? The type of conversion that is done by abandoning the old and starting up the new is -------------? Most NoSQL databases support automatic-------------, meaning that you get high availability and disaster recovery. ----------- stores are used to store information about networks, such as social connections. Answer the following questions : Short answer type : What are the three important goals of GINA? What is the difference between structured and unstructured data? Mention the role of data analytics in health care system. What do you mean by clustering streams? What is the difference between K-means and hierarchical clustering? Define Apriori algorithm. Mention the limitations of Apriori algorithm. Mention different ways of data discovery in data analytics. Mention 4 application areas of association rules. What is HIVE? Why we need big data in modern world? (2 x 10) (2 x 10)

Text from page-2

Q3. a) b) Q4. a) Part – B (Answer any four questions) What is Hadoop? Describe the role of hadoop in big data analysis? Also Explain core components of Hadoop? Mention the limitations of Hadoop. What is map reduce? Describe the working principle of map reduce with suitable example. (10) (5) (10) b) What do you mean by NOSQL? Describe various business drivers, data architectural patterns of NOSQL? Briefly explain How NOSQL can be used to manage BIG DATA. Explain Hadoop Ecosystem briefly. Q5. a) b) What is Big data? Explain characteristics of big data? What are the challenges of big data ecosystem? (10) (5) Q6 a) b) Define Data Analytics. Describe life cycle of data analytics briefly. Write short note on GINA? (10) (5) Q7. a) What is clustering? What are the different types of clustering? Describe different types of clustering algorithm briefly? Mention various applications of clustering. Clustering can improve supervised learning algorithms. True or False? Justify your answer with one example. (10) (10) b) What do you mean by classification? How can decision tree be used for classification? Describe decision tree algorithm and evaluate it with the help of suitable example? Define Naïve byes theorem? Explain Naïve byes classifier briefly. a) b) c) d) e) Write short notes on any THREE : Association Rules Mango DB Partitioning methods Data preparation phase of life cycle of data analytics Advantages of big data approach over traditional data approach b) Q8. a) Q9. (5) (5) (5) (5 x 3)

Lecture Notes