Webinar:Learn about Apache Spark and Big Data on AWS

January 23, 2017
stackArmor

Webinar on Apache Spark and Big Data on AWS

stackArmor is an Advanced AWS Partner that enables security focused customers in Financial Services, Healthcare, Telecom, Non-profits and Public sector markets rapidly setup and deploy analytics environments.

Webinar on learning more about Apache Spark on Amazon Web Services. Some of the topics we covered are described below.

Topics:
– Apache Spark and Big Data Ecosystem Overview
– Role of Spark with respect to Hadoop, AWS, EMR, and popular big data technologies
– Analytics and ETL with SparkSQL and DataFrame/Dataset APIs
– Basics of Spark Execution and Memory
– Visualizing Data with Zeppelin (and possibly Tableau, time permitting)
– Intro to Machine Learning with SparkML
– Intro to Spark Streaming
– Spark on YARN: Clustering and Operations within EMR
– Business Cases and Architecture Patterns with Spark

Technologies:
Some of the technologies we will talk about and demonstrate include:
– Amazon EMR clusters supporting Apache Spark 2.0, HDFS and/or EMRFS, Apache Zeppelin with support for at least Scala (Spark), PySpark, (Spark)SQL, sh, hdfs interpreters

Presenter(s):

Gaurav “GP” Pal is the Founder of stackArmor and a well known expert in big data architectures on cloud based platforms such as AWS with many years of implementation experience on large data centric platforms such as USAspending.gov and Recovery.gov.

Adam Breindel is a stackArmor Big Data Consultant focused on consulting and teaching Apache Spark. Adam’s experience includes work with banks on neural-net fraud detection, streaming analytics, cluster management code, and web apps, as well as development at a variety of startup and established companies in the travel, productivity, and entertainment industries. He is excited by the way that Spark and other modern big-data tech remove so many old obstacles to system design and make it possible to explore new categories of interesting, fun, hard problems.

Learn more about stackArmor and our Analytics offerings on our website https://stackarmor.com/solutions-2/cdo-managed-data-platforms-for-chief-data-officers/

MOST RECENT

Continuous ATO: Going from Authority to Operate (ATO) to Ability to Respond

This white paper explores best practices designed to help reduce the time and cost of ATOs while improving access to risk data using process automation.

Is it time to enforce an Authority-to-Operate (ATO) for Healthcare Organizations?

The Change Healthcare security breach has impacted over 94% of hospitals as reported by the American Health Association (AHA). A cascading set of events was

GSA Small Business Office and FedRAMP PMO looking for Small Business Cloud Solutions

General Services Administration (GSA), Office of Small and Disadvantaged Business Utilization (OSDBU) and The FedRAMP PMO are hosting a webinar on March 21, 2024 to

stackArmor provides FedRAMP, FISMA/RMF, and CMMC/DFARS compliance acceleration services on Amazon Web Services (AWS). stackArmor’s ThreatAlert® Security Platform reduces the time and cost of an ATO by 40%. We serve enterprise customers in Defense, Aerospace, Space, Government, and Healthcare markets as well as ISV’s looking to offer cloud solutions for Government.

Blog

Meeting M-24-10 Deadlines: Can We Adapt FIPS199 to Classify AI Risk?

Security Analyst

Continuous ATO: Going from Authority to Operate (ATO) to Ability to Respond

Washington DC Office:

8300 Greensboro Drive, Suite 990, McLean VA 22102

Form

Webinar:Learn about Apache Spark and Big Data on AWS

Webinar on Apache Spark and Big Data on AWS

Webinar on learning more about Apache Spark on Amazon Web Services. Some of the topics we covered are described below.

Technologies: Some of the technologies we will talk about and demonstrate include: – Amazon EMR clusters supporting Apache Spark 2.0, HDFS and/or EMRFS, Apache Zeppelin with support for at least Scala (Spark), PySpark, (Spark)SQL, sh, hdfs interpreters

Continuous ATO: Going from Authority to Operate (ATO) to Ability to Respond

Is it time to enforce an Authority-to-Operate (ATO) for Healthcare Organizations?

GSA Small Business Office and FedRAMP PMO looking for Small Business Cloud Solutions

Technologies:
Some of the technologies we will talk about and demonstrate include:
– Amazon EMR clusters supporting Apache Spark 2.0, HDFS and/or EMRFS, Apache Zeppelin with support for at least Scala (Spark), PySpark, (Spark)SQL, sh, hdfs interpreters