無料のKindleアプリをダウンロードして、スマートフォン、タブレット、またはコンピューターで今すぐKindle本を読むことができます。Kindleデバイスは必要ありません。
ウェブ版Kindleなら、お使いのブラウザですぐにお読みいただけます。
携帯電話のカメラを使用する - 以下のコードをスキャンし、Kindleアプリをダウンロードしてください。
Agile Data Science: Building Data Analytics Applications with Hadoop ペーパーバック – 2013/11/5
Mining big data requires a deep investment in people and time. How can you be sure you’re building the right models? With this hands-on book, you’ll learn a flexible toolset and methodology for building effective analytics applications with Hadoop.
Using lightweight tools such as Python, Apache Pig, and the D3.js library, your team will create an agile environment for exploring data, starting with an example application to mine your own email inboxes. You’ll learn an iterative approach that enables you to quickly change the kind of analysis you’re doing, depending on what the data is telling you. All example code in this book is available as working Heroku apps.
- Create analytics applications by using the agile big data development methodology
- Build value from your data in a series of agile sprints, using the data-value stack
- Gain insight by using several data structures to extract multiple features from a single dataset
- Visualize data with charts, and expose different aspects through interactive reports
- Use historical data to predict the future, and translate predictions into action
- Get feedback from users after each sprint to keep your project on track
- 本の長さ178ページ
- 言語英語
- 出版社O'Reilly Media
- 発売日2013/11/5
- 寸法17.78 x 0.97 x 23.34 cm
- ISBN-101449326269
- ISBN-13978-1449326265
商品の説明
著者について
Russell Jurney cut his data teeth in casino gaming, building web apps to analyze the performance of slot machines in the US and Mexico. After dabbling in entrepreneurship, interactive media and journalism, he moved to silicon valley to build analytics applications at scale at Ning and LinkedIn. He lives on the ocean in Pacifica, California with his wife Kate and two fuzzy dogs.
登録情報
- 出版社 : O'Reilly Media; 第1版 (2013/11/5)
- 発売日 : 2013/11/5
- 言語 : 英語
- ペーパーバック : 178ページ
- ISBN-10 : 1449326269
- ISBN-13 : 978-1449326265
- 寸法 : 17.78 x 0.97 x 23.34 cm
- Amazon 売れ筋ランキング: - 864,137位洋書 (洋書の売れ筋ランキングを見る)
- - 1,253位Data Mining
- - 1,494位Computer Modeling & Simulation
- - 1,497位Information Theory
- カスタマーレビュー:
著者について
著者の本をもっと発見したり、よく似た著者を見つけたり、著者のブログを読んだりしましょう
他の国からのトップレビュー
My reason for the poor review is not reflective of the content, just the disappointment in that it's only really a booklet and that you're paying 50p per page.
The book does a great job of summarizing the agility needed and the tools used, but the code to implement these tools is lacking. I expected the code in the book to be outdated even though the book was only published a year ago. There is a github repository for the code but it is incomplete.
One benefit of the code not working or the instructions being vague is having to debug it yourself or search for solutions. This is a great learning tool. I don't think this is a benefit the author would like and should put some more time in the second iteration of making the instructions clearer. Since the book has only 164 pages there is significant room for growth.
Jurney nails it! He offers tools and methodologies adapted to common data science workflows and their associated pitfalls wherein we spend 85% of our time plumbing and 15% of our time integrating some off-the-shelf algorithm to find deep insight.
So, for new data scientists or 3rd-4th year grad students who have balanced their Twitter API hack with NSF grant deadlines, this is ABSOLUTELY REQUIRED READING.
I"m half way through the book, have been practicing Agile development techniques for several years, and I am not quite sure what in particular makes this book about Data Science 'Agile' based.
One thing that he does nicely is explain the Pig code he uses, but I can't use those programs because the Python programs that gather the data that feed Pig will not compile, even after I de-bugged his code for several hours. (Example: the author made reference to an RFC inline in the Python code that would have NEVER compiled. NEVER. Line 11 gmail.py from call to email utilitiies)