1. 程式人生 > >Predictive Data Science with Amazon SageMaker and a Data Lake on AWS

Predictive Data Science with Amazon SageMaker and a Data Lake on AWS

This Quick Start builds a data lake environment for building, training, and deploying machine learning (ML) models with Amazon SageMaker on the Amazon Web Services (AWS) Cloud. The deployment, which takes about 10-15 minutes, uses AWS services such as Amazon Simple Storage Service (Amazon S3), Amazon API Gateway, AWS Lambda, Amazon Kinesis Data Streams, and Amazon Kinesis Data Firehose.

Amazon SageMaker is a managed platform that enables developers and data scientists to build, train, and deploy ML models quickly and easily.

This Quick Start is for users who want to unleash the power of their data to make predictive and prescriptive models for business value, without needing to configure complex ML hardware clusters. It enables end-to-end data science, starting with raw data and ending with a prediction REST API in a production system.

The Quick Start also provides a demo scenario developed by Pariveda Solutions. The demo shows how to store raw data in Amazon S3, transform the data for consumption in Amazon SageMaker, use Amazon SageMaker to build an ML model, and host the model in a prediction API for Amazon Elastic Compute Cloud (Amazon EC2) Spot pricing.

相關推薦

Predictive Data Science with Amazon SageMaker and a Data Lake on AWS

This Quick Start builds a data lake environment for building, training, and deploying machine learning (ML) models with Amazon SageMaker on the Am

Segmenting brain tissue using Apache MXNet with Amazon SageMaker and AWS Greengrass ML Inference

In Part 1 of this blog post, we demonstrated how to train and deploy neural networks to automatically segment brain tissue from an MRI scan in a s

Machine Learning with Amazon SageMaker and Cloudwick

Cloudwick’s Machine Learning with Amazon SageMaker Platform on Amazon Web Services (AWS) helps developers and business users of all skillsets leve

Building a Big Data Pipeline With Airflow, Spark and Zeppelin

Building a Big Data Pipeline With Airflow, Spark and Zeppelin“black tunnel interior with white lights” by Jared Arango on UnsplashIn this data-driven era,

How SimilarWeb analyze hundreds of terabytes of data every month with Amazon Athena and Upsolver

This is a guest post by Yossi Wasserman, a data collection & innovation team leader at Similar Web. SimilarWeb, in their own words: Si

PyTorch 1.0 preview now available in Amazon SageMaker and the AWS Deep Learning AMIs

Amazon SageMaker and the AWS Deep Learning AMIs (DLAMI) now provide an easy way to evaluate the PyTorch 1.0 preview release. PyTorch 1.0 adds seam

The Huge Role of Data Science in Artificial Intelligence and Machine Learning

Data science and big data analytics are gradually making waves with advanced technologies like artificial intelligence (AI), machine learning (ML), and dee

Continuous Delivery with Amazon EKS and Jenkins X

Amazon Elastic Container Service for Kubernetes (Amazon EKS) provides a container orchestration platform for building and deploying modern cloud a

Modernize Your Data Warehouse with Amazon Redshift

Data in every organization is growing in volume and complexity faster than ever. Yet, only a small fraction of this invaluable asset is available

Build data driven apps with real time and offline capabilities based on GraphQL

AWS AppSync is a serverless back-end for mobile, web, and enterprise applications. AWS AppSync makes it easy to build data driven mobile a

Unified Service Discovery with Amazon ECS and Kubernetes

Starting today, you can leverage unified service discovery for services managed by Amazon Elastic Container Service (Amazon ECS)

How to Build an AWS DeepLens Project with Amazon SageMaker

Amazon Web Services is Hiring. Amazon Web Services (AWS) is a dynamic, growing business unit within Amazon.com. We are currently hiring So

Discovering Data Science with Romeo Kienzler

Read Romeo’s tutorial series on deep learning Romeo presents at Jazoon Tech Days about using deep learning on IoT data in Apache Spark. In this video: Rome

Building with Watson: Streaming data enhanced with PubNub BLOCKS and Conversation

Join Josh Marinacci, Head of Developer Relations at PubNub, and his geology-themed chatbot, Mr. Rockbot, as he demonstrates how easy it is both to manage

R語言讀取資料(Practical Data Science with R 第二章)

1、用R語言讀取檔案中的資料 1.1、用R語言讀取結構化資料 以University of California Irvine Machine Learning Repository (http://archive.ics.uci.edu/ml/)的car資料為例: u

Data Lake on AWS with Talend

An out-of-the-box open data lake solution with AWS and Talend allows you to build, manage, and govern your cloud data lake in the AWS Cloud so tha

Hybrid Data Lake on AWS

This Quick Start deploys a hybrid cloud environment that integrates on-premises Hadoop clusters with a data lake on the Amazon Web Services (AWS)

Identity Federation and SSO for SaaS on AWS

Editor’s note: For the latest information, visit the . By Matt Yanchyshyn, Senior Manager of Partner Solutions Architecture at AWS

Informatica Data Lake on AWS

Informatica’s intelligent data lake management solution significantly reduces the complexity of deploying and deriving value from a data lake on A