Amr Awadallah

Amr Awadallah

DJ Patil

DJ Patil

Hadoop Bootcamp – End-to-end Development & Deployment

A whole day Hadoop Workshop which focuses on creating Hadoop power users with hands-on labs that showcases real life Hadoop usage patterns that are commonly used in industries.

This Workshop showcases:

  • Real life usage of Hadoop, HBase and Hive
  • Various patterns of data consumption (real-time and batch)
  • Various patterns of ETL into Hadoop, HBase and Hive
  • Various patterns of data consumption from Hadoop into datamarts and BI tools like Pentaho BI or just plain web services.

Hadoop Bootcamp

This workshop consists of two labs:

1.Click Stream Analysis

2.Order Processing in a Hadoop Data Warehouse

These are HANDS-ON labs.Students will be doing the lab in real Hadoop/HBase clusters.They will learn the concepts and also do full implementation using Java programming language and Hadoop/HBase/MapREduce APIs.

1.Click Stream Analysis

In this part of the lab, we will do a ‘Click Stream Analysis’ workload. We will simulate an online ad serving agency by tracking an ad campaign performance -- how many views did we get vs for how many clicks.

This lab involves:

  • Ingesting clickstream data into Hadoop HDFS
  • Analyzing the logs using MapReduce
  • Using HBase to store campaign performance data
  •            Displaying a ‘Dashboard’ to clients on how well their ad campaigns are doing

2.Order Processing in a Hadoop Data warehouse

In this lab, we will simulate a scenario where order files are processed in a Hadoop Data warehouse using techniques like HBase bulkloader and migrated to Hive for HQL analysis.

This lab involves:

  • Loading order data into Hadoop
  • Bulkload data into HBase
  • Create external tables in Hive out of HBase tables
  • Run analytics on Hive order tables
  • Run map/reduce to filter order data and extract things like "Delivered orders" only.

Prerequisites :

Developers with basic understanding of database and ACID transactions.

Audience:

Developers,Database administrators, Data Analytics professionals, Data architects, Managers.

Tickets :

For further information, please contact:

Third Eye's Training Services training@thirdeyecss.com (408) 290-9949 Ext 3
Hadoop Bootcamp – End-to-end Development & Deployment

Big Data Analytics for Financial Services

Big Data Analytics for Financial Services

Financial Services industry is moving towards cloud computing and big data processing with analytics. Hadoop based applications over the cloud are becoming more and more popular with extra large real-time data. Harnessing the power of big data can enhance organisational performance. However, it is not a technological question. It is a strategic one about how an organisation derives genuine insight from their data and changes the way they interact with customers, competitors and the market through fact-driven decision-making. This will set the trend in customer service, improve profitability and respond more rapidly to the evolving regulatory demands of the industry. This class will discuss about new integrated methodologies for six sigma, business process optimization across financial industry and operational business intelligence. This class will also focus on complex risk management and regulatory compliance issues. Big data analytics with Hadoop, Hbase, Hive and BI suites will be discussed.

Instructor Profile


Dr. Shyam Sundar Sarkar is an entrepreneur and software researcher for over 25 years. He worked for several companies like Unisys, Informix, Oracle, Reconnex etc. as architect, researcher and senior executive. He was a serial entrepreneur and co-founder of multiple startup companies in the past. He has several patents and publications. He writes blogs in social networking sites and made valuable suggestions to influence healthcare law, green technology and financial regulations. He is currently CEO of a startup company working on Dodd-Frank financial regulations and Big Data Analytics. He can be reached at ssarkar@ayushnet.com.

Lab Work


We will go through deploying a cluster in the lab with example applications.

Prerequisites


Exposure to Financial services business processes, concepts in various trading processes and latest Dodd-Frank regulations for financial industry

Audience


C-level executives, executives, analysts, architects, business developers, code developers, IT Administrators;

Recommended Readings


- Dodd Frank Cheat Sheet - Dodd–Frank Wall Street Reform and Consumer Protection Act - How the financial services sector uses big data analytics to predict client behaviour

Class Timings


November 18th 2011 on 10:00 pm to 3:30 pm (lunch included)

Class Location


“The Network Meeting Center” at TechMart 5201 Great America Parkway, Santa Clara, CA 95054

Class Date: November 29th 2011


For further information, please contact


Dj Das djdas@thirdeyecloud.com 408-431-1487
Big Data Analytics for Financial Services

Hadoop – Basics & Administration

Overview of the key ideas behind Map-Reduce that make it an almost universal template for parallel programming, why Hadoop is important for cloud computing, the technology forces that fuel Map-Reduce, and what kind of problems are suitable and not suitable for this programming paradigm. The essential server components of Hadoop are reviewed and related to the theory of Map-Reduce computation, along with the most significant tuning parameters. Hadoop programs are demonstrated on a Hadoop cluster and compared to serial execution.  

Instructor Profile


Sujee Maniyam has been consulting as a BigData architect. His clients include early stage startups in online advertising and enterprise companies like Hitachi Data Systems. Sujee has been developing software for the past 12 years in a variety of technologies (enterprise, web and mobile). His current interests are in Hadoop, BigData and NOSQL. His articles about Hadoop and Amazon EMR and his open source work can be found at this site.  

Lab Work


We will do a demo on a Hadoop cluster running on Amazon EC2.  

Prerequisites


  • Basic Linux command line skills.
  • Laptop with WiFi for connecting to lab systems.

Audience


Developers with Linux command line experience.  

Recommended Readings


 

Class Dates


Nov 29th 2011  

Class Timings


10:00 am to 3:00 pm  

Class Location


“The Network Meeting Center” at TechMart 5201 Great America Parkway, Santa Clara, CA 95054  

Class Price


 

For further information, please contact


Dj Das djdas@thirdeyecloud.com 408-431-1487
Hadoop – Basics & Administration

Platform MapReduce: Training for IT Managers, Developers, System and Storage Administrators

Course Description


Platform MapReduce: Training for IT Managers, Developers, System and Storage Administrators offers IT Managers an insight into how Platform MapReduce addresses big data challenges in an enterprise–class production environment. The course provides a deep dive into the unique advantages and benefits that Platform MapReduce delivers, as well as requirements for installing, deploying and managing the MapReduce application. This is an instructor-led course with live demonstration of the product.  

Who Should Attend?


IT Managers, Data Scientists, Hadoop Application Developers, System and Storage Administrators

Prerequisites


None

Course Materials


Participants will have access to all training material.

Requirements


Participants are required to bring their own laptops.

Class Timings


10:00 am to 3:00 pm

Class Location


“The Network Meeting Center” at TechMart 5201 Great America Parkway, Santa Clara, CA 95054

Class Date: November 29th 2011


Registration


Register for Platform MapReduce training including the event (new account setup may be required)  

For further information, please contact


Platform Computing Training Services
Platform MapReduce: Training for IT Managers, Developers, System and Storage Administrators

HBase – Development & Administration

  Learn the basics of HBase which is a NOSQL database built on top of Hadoop.
      how nosql database is solving the scalability issue
      HBase architecture
      designing schemas for HBase
      loading data into HBase
      quering HBase.

Instructor Profile


Sujee Maniyam has been consulting as a BigData architect. His clients include early stage startups in online advertising and enterprise companies like Hitachi Data Systems. Sujee has been developing software for the past 12 years in a variety of technologies (enterprise, web and mobile). His current interests are in Hadoop, BigData and NOSQL. His articles about Hadoop and Amazon EMR and his open source work can be found at this site.  

Lab Work


We will work with a HBase cluster, look at configurations & load & query data.  

Prerequisites


Developers with Java knowledge.  

Audience


Developers, Data Analytics professionals, Business Analysts, Managers  

Recommended Readings


- Hbase Architecture  

Class Timings


10:00 am to 3:00 pm  

Class Location


“The Network Meeting Center” at TechMart 5201 Great America Parkway, Santa Clara, CA 95054   

Class Date: November 29th 2011


For further information, please contact


Dj Das djdas@thirdeyecloud.com 408-431-1487
HBase – Development & Administration
© Copyright 2013 Big Data Cloud Inc.
A not-for-profit Organization for Evangelizing & Training around Big Data.


Founded & Operated by:
Third Eye Consulting Services & Solutions LLC.