Error loading page.
Try refreshing the page. If that doesn't work, there may be a network issue, and you can use our self test page to see what's preventing the page from loading.
Learn more about possible network issues or contact support for more help.

Bentley University

Search

Search

Browse menu

×

More titles and copies may be available to you. Sign in to see the full collection.

Title details for Programming Hive by Edward Capriolo - Available

Programming Hive

by Edward Capriolo
Dean Wampler

ebook

Read a sample Read a sample

Description
Creators
Details

Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop's data warehouse infrastructure. You'll quickly learn how to use Hive's SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop's distributed filesystem.

This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You'll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data.

Use Hive to create, alter, and drop databases, tables, views, functions, and indexes

Customize data formats and storage options, from files to external databases

Load and extract data from tables—and use queries, grouping, filtering, joining, and other conventional query methods

Gain best practices for creating user defined functions (UDFs)

Learn Hive patterns you should use and anti-patterns you should avoid

Integrate Hive with other data processing programs

Use storage handlers for NoSQL databases and other datastores

Learn the pros and cons of running Hive on Amazon's Elastic MapReduce

Expand title description text

Edward Capriolo - Author
Dean Wampler - Author
Jason Rutherglen - Author

Publisher: O'Reilly Media

Kindle Book

Release date: September 18, 2012

OverDrive Read

ISBN: 9781449326975
File size: 2823 KB
Release date: September 18, 2012

EPUB ebook

ISBN: 9781449326975
File size: 2821 KB
Release date: September 18, 2012

Formats

Kindle Book
OverDrive Read
EPUB ebook

subjects

Computer Technology Nonfiction

Languages

English

Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop's data warehouse infrastructure. You'll quickly learn how to use Hive's SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop's distributed filesystem.

This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You'll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data.

Use Hive to create, alter, and drop databases, tables, views, functions, and indexes

Customize data formats and storage options, from files to external databases

Load and extract data from tables—and use queries, grouping, filtering, joining, and other conventional query methods

Gain best practices for creating user defined functions (UDFs)

Learn Hive patterns you should use and anti-patterns you should avoid

Integrate Hive with other data processing programs

Use storage handlers for NoSQL databases and other datastores

Learn the pros and cons of running Hive on Amazon's Elastic MapReduce

Expand title description text

Computer Technology Nonfiction

Details

Publisher:
O'Reilly Media

Kindle Book
Release date: September 18, 2012

OverDrive Read
ISBN: 9781449326975
File size: 2823 KB
Release date: September 18, 2012

EPUB ebook
ISBN: 9781449326975
File size: 2821 KB
Release date: September 18, 2012
Creators
- Edward Capriolo - Author
- Dean Wampler - Author
- Jason Rutherglen - Author
Formats

Kindle Book
OverDrive Read
EPUB ebook
Languages

English