Scalable Big Data Architecture ─ A Practitioners Guide to Choosing Relevant Big Data Architecture

ISBN13：9781484213278
出版社：Springer-Verlag New York Inc
作者：Bahaaldine Azarmi
出版日：2016/01/27
裝訂／頁數：平裝／141頁
規格：25.4cm*17.8cm*1.3cm (高/寬/厚)
關鍵字： Scalable Big Data Architecture ─ A Practitioners Guide to Choosing Relevant Big Data Architecture、 Scalable、 Big、 Data、 Architecture、 Practitioners、 Guide、 to、 Choosing、 Relevant、 Springer-Verlag New York Inc、 Bahaaldine Azarmi、外文書、 Data processing, Computer science、

杜威圖書分類

：

Data processing, Computer science

定價

：NT$ 2899 元

領券後再享88折起

領

無庫存，下單後進貨(到貨天數約30-45天)

下單可得紅利積點：86 點

商品簡介

作者簡介

商品簡介

Most people think that Big Data projects start directly with the deployment of large distributed clusters of heavy map reduce jobs, whereas reality shows that there isn't any unique/perfect solution to solving problems when dealing with large volumes of data.

By knowing the different Big Data integration patterns, you will understand why most of the time you will have to deploy a heterogeneous architecture that fulfills different needs, and furthermore what limits each pattern that may lead you to choose effective alternates.

We will go through real concrete industry use cases that leverage these patterns such as REST API which requests large amount of data stored in No-SQL like Couchbase and Elasticsearch. We will see how massive data processing can be done in such No-SQL databases without the need of diving deep into Big Data.

But when the volume is too high and the data structures gets too complex, the kind of pattern being employed reaches its limits and that's when we can start thinking of delegating complex data processing jobs to, for example, a Hadoop based Big Data architecture.

The difficulty is to then choose a relevant combination of big data technologies available within the Hadoop ecosystem. We will focus on processing long jobs, architecture, stream data patterns, log analysis, and real time analytics. Every pattern will be illustrated with practical examples, which uses the different apache projects such as Avro, Spark, Kafka, and so on.

Traditional Big Data infrastructures are built for digesting and rendering data synthesis and analytics from large amount of data. This book will also help you to understand why you should consider using machine learning algorithms early on in the project, before being overwhelmed by constraints implied by dealing with high throughput of Big data.

作者簡介

Bahaaldine Azarmi is the co-founder and CTO of reach five, a Social Data Marketing Platform. Bahaaldine has a strong background and expertise skills in REST API and Big Data architecture. Prior to founding reach five, Bahaaldine worked as a technical architect & evangelist for large software vendors such as Oracle & Talend.

He has a master’s degree of computer science from Polytech’Paris engineering school, Paris.

主題書展