
Amr Awadallah
Amr Awadallah
BigDataCloud Today! Event Pictures
DJ Patil
DJ Patil
BigDataCloud Today! Event PicturesHadoop Bootcamp – End-to-end Development & Deployment
A whole day Hadoop Workshop which focuses on creating Hadoop power users with hands-on labs that showcases real life Hadoop usage patterns that are commonly used in industries.
This Workshop showcases:
-
Real life usage of Hadoop, HBase and Hive
-
Various patterns of data consumption (real-time and batch)
-
Various patterns of ETL into Hadoop, HBase and Hive
-
Various patterns of data consumption from Hadoop into datamarts and BI tools like Pentaho BI or just plain web services.

This workshop consists of two labs:
1.Click Stream Analysis
2.Order Processing in a Hadoop Data Warehouse
These are HANDS-ON labs.Students will be doing the lab in real Hadoop/HBase clusters.They will learn the concepts and also do full implementation using Java programming language and Hadoop/HBase/MapREduce APIs.
1.Click Stream Analysis
In this part of the lab, we will do a ‘Click Stream Analysis’ workload. We will simulate an online ad serving agency by tracking an ad campaign performance -- how many views did we get vs for how many clicks.
This lab involves:
-
Ingesting clickstream data into Hadoop HDFS
-
Analyzing the logs using MapReduce
-
Using HBase to store campaign performance data
-
Displaying a ‘Dashboard’ to clients on how well their ad campaigns are doing
2.Order Processing in a Hadoop Data warehouse
In this lab, we will simulate a scenario where order files are processed in a Hadoop Data warehouse using techniques like HBase bulkloader and migrated to Hive for HQL analysis.
This lab involves:
-
Loading order data into Hadoop
-
Bulkload data into HBase
-
Create external tables in Hive out of HBase tables
-
Run analytics on Hive order tables
-
Run map/reduce to filter order data and extract things like "Delivered orders" only.
Prerequisites :
Developers with basic understanding of database and ACID transactions.
Audience:
Developers,Database administrators, Data Analytics professionals, Data architects, Managers.
Tickets :
For further information, please contact:
Third Eye's Training Services training@thirdeyecss.com (408) 290-9949 Ext 3Hadoop Bootcamp – End-to-end Development & Deployment
June 1 2013
Big Data Analytics for Financial Services
Big Data Analytics for Financial Services
Financial Services industry is moving towards cloud computing and big data processing with analytics. Hadoop based applications over the cloud are becoming more and more popular with extra large real-time data. Harnessing the power of big data can enhance organisational performance. However, it is not a technological question. It is a strategic one about how an organisation derives genuine insight from their data and changes the way they interact with customers, competitors and the market through fact-driven decision-making. This will set the trend in customer service, improve profitability and respond more rapidly to the evolving regulatory demands of the industry. This class will discuss about new integrated methodologies for six sigma, business process optimization across financial industry and operational business intelligence. This class will also focus on complex risk management and regulatory compliance issues. Big data analytics with Hadoop, Hbase, Hive and BI suites will be discussed.Instructor Profile
Dr. Shyam Sundar Sarkar is an entrepreneur and software researcher for over 25 years. He worked for several companies like Unisys, Informix, Oracle, Reconnex etc. as architect, researcher and senior executive. He was a serial entrepreneur and co-founder of multiple startup companies in the past. He has several patents and publications. He writes blogs in social networking sites and made valuable suggestions to influence healthcare law, green technology and financial regulations. He is currently CEO of a startup company working on Dodd-Frank financial regulations and Big Data Analytics. He can be reached at ssarkar@ayushnet.com.
Lab Work
We will go through deploying a cluster in the lab with example applications.
Prerequisites
Exposure to Financial services business processes, concepts in various trading processes and latest Dodd-Frank regulations for financial industry
Audience
C-level executives, executives, analysts, architects, business developers, code developers, IT Administrators;
Recommended Readings
- Dodd Frank Cheat Sheet - Dodd–Frank Wall Street Reform and Consumer Protection Act - How the financial services sector uses big data analytics to predict client behaviour
Class Timings
November 18th 2011 on 10:00 pm to 3:30 pm (lunch included)
Class Location
“The Network Meeting Center” at TechMart 5201 Great America Parkway, Santa Clara, CA 95054
Class Date: November 29th 2011
For further information, please contact
Dj Das djdas@thirdeyecloud.com 408-431-1487
Big Data Analytics for Financial Services
HadoopHadoop – Basics & Administration
Overview of the key ideas behind Map-Reduce that make it an almost universal template for parallel programming, why Hadoop is important for cloud computing, the technology forces that fuel Map-Reduce, and what kind of problems are suitable and not suitable for this programming paradigm. The essential server components of Hadoop are reviewed and related to the theory of Map-Reduce computation, along with the most significant tuning parameters. Hadoop programs are demonstrated on a Hadoop cluster and compared to serial execution.Instructor Profile
Sujee Maniyam has been consulting as a BigData architect. His clients include early stage startups in online advertising and enterprise companies like Hitachi Data Systems. Sujee has been developing software for the past 12 years in a variety of technologies (enterprise, web and mobile). His current interests are in Hadoop, BigData and NOSQL. His articles about Hadoop and Amazon EMR and his open source work can be found at this site.
Lab Work
We will do a demo on a Hadoop cluster running on Amazon EC2.
Prerequisites
- Basic Linux command line skills.
- Laptop with WiFi for connecting to lab systems.
Audience
Developers with Linux command line experience.
Recommended Readings
Class Dates
Nov 29th 2011
Class Timings
10:00 am to 3:00 pm
Class Location
“The Network Meeting Center” at TechMart 5201 Great America Parkway, Santa Clara, CA 95054
Class Price
For further information, please contact
Dj Das djdas@thirdeyecloud.com 408-431-1487

Hadoop – Basics & Administration
Hadoop
Platform MapReduce: Training for IT Managers, Developers, System and Storage Administrators
Course Description
Platform MapReduce: Training for IT Managers, Developers, System and Storage Administrators offers IT Managers an insight into how Platform MapReduce addresses big data challenges in an enterprise–class production environment. The course provides a deep dive into the unique advantages and benefits that Platform MapReduce delivers, as well as requirements for installing, deploying and managing the MapReduce application. This is an instructor-led course with live demonstration of the product.
Who Should Attend?
IT Managers, Data Scientists, Hadoop Application Developers, System and Storage Administrators
Prerequisites
None
Course Materials
Participants will have access to all training material.
Requirements
Participants are required to bring their own laptops.
Class Timings
10:00 am to 3:00 pm
Class Location
“The Network Meeting Center” at TechMart 5201 Great America Parkway, Santa Clara, CA 95054
Class Date: November 29th 2011
Registration
Register for Platform MapReduce training including the event (new account setup may be required)
For further information, please contact
Platform Computing Training Services
Platform MapReduce: Training for IT Managers, Developers, System and Storage Administrators
Hadoop
HBase – Development & Administration
Learn the basics of HBase which is a NOSQL database built on top of Hadoop.- how nosql database is solving the scalability issue
- HBase architecture
- designing schemas for HBase
- loading data into HBase
- quering HBase.
Instructor Profile
Sujee Maniyam has been consulting as a BigData architect. His clients include early stage startups in online advertising and enterprise companies like Hitachi Data Systems. Sujee has been developing software for the past 12 years in a variety of technologies (enterprise, web and mobile). His current interests are in Hadoop, BigData and NOSQL. His articles about Hadoop and Amazon EMR and his open source work can be found at this site.
Lab Work
We will work with a HBase cluster, look at configurations & load & query data.
Prerequisites
Developers with Java knowledge.
Audience
Developers, Data Analytics professionals, Business Analysts, Managers
Recommended Readings
- Hbase Architecture
Class Timings
10:00 am to 3:00 pm
Class Location
“The Network Meeting Center” at TechMart 5201 Great America Parkway, Santa Clara, CA 95054
Class Date: November 29th 2011
For further information, please contact
Dj Das djdas@thirdeyecloud.com 408-431-1487