Edu-Videos | Learn All About Apache Spark (100x Faster than Hadoop MapReduce)

Apache Spark is an open-source data analytics cluster computing framework originally developed in the AMPLab at UC Berkeley. Spark fits into the Hadoop open-source community, building on top of the Hadoop Distributed File System (HDFS). However, Spark promises performance up to 100 times faster than Hadoop MapReduce for certain applications…and that’s why you should care!

Spark’s in-memory cluster computing is very well suited to machine learning algorithms. These Videos will give you a nice introduction to Spark, how it’s being used in business and why you should care…Watch Videos…

Read Article →

On-Demand Webinar | Top CTOs Discuss How to Overcome the Big Data Skills Gap

Watch On-Demand 60 minute Webinar with the CTOs of Cloudera and Splunk about how their companies are working together to make it easier for every knowledge worker to rapidly explore, analyze and visualize raw unstructured data in a Hadoop enterprise data hub without specialized training. Read More

Read Article →

Webinars | Success with “R” & Moving from SAS to “R”

Adoption of the R language has grown rapidly in the last few years, and is ranked as the number-one data science language in several surveys.

Here are 2 Webinars from R. (1) On July 29th at 1pm EST will examine successful applications of R in business, gov’t and public sectors. (2) On August 7th at 1pm EST will review the key differences between SAS and R from the user’s perspective, and provide you with the tools to move forward. Read More

Read Article →

Early Release Book | Hadoop Application Architectures “Designing Real World Big Data Applications”

EARLY RELEASE Available Now via O’Reilly books. Print Copy not out until next Spring.

Get expert guidance on architecting end-to-end data management solutions with Apache Hadoop. While many sources explain how to use various components in the Hadoop ecosystem, this practical book takes you through architectural considerations necessary to tie those components together into a complete tailored application, based on your particular use case. Read More

Read Article →

Webinar | Visual Data Discovery & Streaming Data: New Technologies for Real-Time Analytics

WATCH ON-DEMAND| WEBINAR of Dan Potter, VP of Product Marketing at Datawatch on real-time analytics.

ABOUT: New technologies and approaches are now necessary to succeed with real-time analytics in the era of big data. You will learn about the key technologies, how they work, and how they are enabling businesses in the real world to act on data in ways not possible ever before. Read More

Read Article →

DATA CHANGE | SEPT 18th Great NYC Meetup “Data Skeptics” with Capgemini’s Chief Data Scientist

DATA CHANGE! Thursday Sept 18th at 6:30p at Pivotal’s Offices, Dr. Jerry A. Smith, the Chief Data Scientist for Capgeminiā€™s Advance Digital Intelligence (ADI) group and Data Science & Analytics (DSA) group will be the guest speaker.

In this discussion, we will explore how data science is impacted by one of the most complex data sets of all times, the deep web. In doing so we use an open source intelligence driven data science framework to explore dark kinetic world of anarchy (the deep dark web) Read More

Read Article →