Amazon Web Services’ Unique Value Proposition in Scaling Autonomous Vehicle (AV) Development and Deployment

Connected and automated vehicles (CAVs) signal the start of a transformative era in the automotive industry. These technologies are anticipated to support safer, more accessible, cost-effective, and immersive mobility services while disrupting traditional business models. At a macro level, the industry is working to address legal and legislative challenges and encourage social acceptance. Simultaneously, it attempts to resolve core technological issues, both on-board and off-board the AVs. Within AVs, today’s focus is on improving the software stack’s detection, classification, path planning, and motion control modules while also securing the potential cyber vulnerabilities that may arise due to the increased electronic content. The off-board and on-board computing aspects of autonomous development present new challenges for OEMs and suppliers. Autonomous developers need the large storage, high-performance computing, and deep learning capacity that the cloud provides. Integrated with on-board edge computing, it can provide real-time compute and machine learning (ML) inference as well as data reduction to decrease bandwidth loads.

Even as the industry adjusts to the changes caused by COVID-19, it needs to address the following multidimensional challenges related to autonomous development:

Hyperscaling: Compute and data management issues with respect to the significant amounts of data ingestion, storage, and processing required
Agility and Speed: Ability to reduce software development and validation costs to enable faster time to market
Cost and Safety: Lack of AV expertise, infrastructure (on-premise compute, large scale computing, data storage, etc.) costs, and human capital investments. Functional safety and fail-safe methodologies as decision-making is transferred from the driver to the vehicle.
Ecosystem and Software-defined: Addition of new software from various 3^rd party vendors/ partners leading to issues with interoperability of workloads which, in turn, cause integration and testing concerns
Compliance: The need for global compliance in security and data privacy.

Over the last five years, OEMs and investment firms have spent several millions of dollars developing the building blocks of autonomous driving. However, an important element in establishing these services is the integration of cloud infrastructure and software deployment. Trials of specific use cases and pilots notwithstanding, progress towards building systems with the vision of scaling them towards full-fledged service solutions remains limited.

AWS’s Two-pronged Approach towards AV Deployment Strategies

The ability to continually build, train, simulate, and test is essential to improving the accuracy of perception and path planning models. AWS addresses these challenges with a suite of solutions and tools— classified as “Infinite Loop” and “Big Loop”—across the entire AV development cycle.

The “Infinite Loop” Approach: This integrated holistic approach centers on the infinity workflow, i.e., a process that never ends but continues to evolve and improvize. This loop consists of five significant steps where AWS and its partner network provide crucial services that become the essential building blocks in customers’ successful AV deployment journeys.

Exhibit 1: The Five Steps of the AWS “Infinity” Workflow

Data Management, Processing, and Analysis

A typical test vehicle generates at least 10-120 TB of data during a 6-8 hour drive. The massive data sets are transferred online to AWS data centers using Direct Connect, S3 Transfer Acceleration/Kinesis Data Firehose/Data sync services. When data volumes are unlimited, they can be transferred physically using AWS’s Snowball family of devices that offer physical storage with computing capabilities. However, it’s not just about storing data, there is also a cost component based on the frequency with which data needs to be accessed. Tiered services, such as the S3 glacier deep archive for infrequent data access, support customers with cost optimization. In addition, the recently created AV data lake v2 has deep constructs that target specific customer needs, becoming the central part of any organizations’ data strategy. V2 is an MDF4/Rosbag-based data ingestion and processing pipeline that allows data scientists/developers to access data with their choice of analytic tools/frameworks and draw insights from ML-based models.

Labeling

Labeling is a tedious, costly process, massive in scale, and requires very high accuracy. Unfortunately, not all labeling can happen automatically, necessitating a human labeler. Sagemaker Ground Truth provides tools to support human labelers across 3D point cloud, video, and image data sets received from the customers. A typical customer can choose their own workforce to label or leverage AWS’s partner network to support this process with a “pay-as-you-go” option. For 2D models, the company also offers auto-labeling services where an active learning model is trained using human-labeled data sets.

Model and Algorithm Development

Complex simultaneous simulations require large compute (CPU and GPU) capacity, which is expensive to build and lacks the flexibility to meet development timeline pressures. It also requires internal expertise, which is lacking across several automakers and suppliers. AWS provides this service with the option to create an AV ML stack through Sagemaker. In addition, AWS provides various types of instances with different configurations of CPU, memory, storage, and networking resources to suit user needs. EC2 instance is a virtual server that can serve as a practically unlimited set of virtual machines. Each type is available in different sizes to address specific workload requirements. For example, P3/P4 uses the latest NVIDIA GPUs and is designed for HPC and large distributed ML training jobs. Graviton 2 is AWS’s ARM-based, CPU-based instance. It offers 30% performance improvement as well as significant cost optimization, which is the foundation for environmental parity.

Toyota Research Institute (TRI-AD) is one of many examples where Amazon’s EC2 P3 instances were used to reduce the time taken to train ML models by 75%. This has allowed TRI-AD to incorporate new data sets to retrain models and introduce new features.

Simulation and Verification &Validation (V&V)

As the auto industry is highly regulated, ISO 26262 V-Model prescribes methodologies to mitigate safety risks across automotive applications. This pattern is reflected in most AV development projects where hardware in the loop (HiL) simulation is required for much of system-level testing and validation phases on the right side of the V-Model. AWS provides several services to support this process via open and closed-loop workflows. A typical perception module uses an open-loop workflow, whereas the planning/control module uses a closed-loop workflow.

Mobileye is one of the several companies that leverage EC2 Spot and AWS batch to run large-scale simulations that allow it to scale up/down AV workloads flexibly.

AV Development Workspace

Last but not least is the need to provide a holistic view of all the running pipelines and jobs to architects developing workflows while overcoming challenges such as disparate data sources based on specific use cases, cleansed data, and downstream consumption preparation. Amazon Managed Workflows for Apache Airflow (MWAA) environment enables end-to-end data pipelines to be set up and operated in the cloud at scale. In Continental Automotive Edge (CAEdge), AWS provides an example of a virtual workbench that offers toolchains to develop, supply, and maintain software-intensive system functions. These toolchains also provide access to the AWS Partner Network (APN) tools, which are well drawn out in the workspace to support data scientists and developers to do their jobs efficiently.

The “Big Loop” Approach

The “Big Loop” approach showcases how software assets prepared through the “Infinite Loop” approach can be deployed in shadow mode or through an edge intelligent agent.

Exhibit 2: AWS “Big Loop” Workflow

In strategic partnership with AWS, BlackBerry has co-developed the Blackberry IVY platform to offer wide-ranging expertise in the automotive cloud. AWS has successfully taken its massive portfolio down to the edge and expanded its portfolio of cloud features. Software is deployed through over-the-air (OTA) agents across the autonomous ECU, where it is not just the software in production but a test version that will run in parallel, which is called the shadowing mode. The edge intelligent agent will be able to compare the performance of the production software with the software under test and highlight if there is an agreement/disagreement between these versions of the software, and then send it back online via the infinite loop. The increase in vehicle edge intelligence will increase the ability to capture more pertinent information, which, in turn, will improve the overall competency of the system.

Through this process, AWS brings insights closer to the edge, enabling seamless deployments through OTA updates, thereby resulting in a continuous reinvention of the customer experience. Instead of exhausting data to the cloud, the data required can be applied close to the edge in an organized way, saving expensive data transmission costs, reducing incompatible niche point solutions, and supporting better scaling services. AWS brings all this together through its IoT Greengrass, service (i.e., cloud management, analytics, and storage) that extends functionality on the edge. Recently at AWS re: Invent 2021, the company announced IoT FleetWise, where automakers can easily collect, organize and standardize data in any format present in their vehicles for easy data analysis in the cloud. This service supports automakers by using intelligent filtering capabilities that allow developers to reduce network traffic.

In addition, this “Big Loop” does not move data but moves software, with this software requiring parity from a software-defined vehicle (SDV) perspective. While several automotive ecosystem participants tend to develop software services in the cloud and then deploy it on the edge, there is a difference in the software architecture as seen, for example, in the Intel processor in the cloud with Arm processor in the vehicle or vice versa. This difference in software architecture is expected to create additional steps such as cross-compilation, which are highly error-prone. In partnership with Arm, AWS will now be able to achieve environmental parity between software executions in the cloud and on the embedded edge, which is one of their important strategies for AV deployment. This environment parity will help achieve two very vital objectives:

The ability to deploy bit-perfectly equal binaries between cloud and the vehicle edge, leveraging instruction set parity because in both places – cloud and vehicle edge – there is an Arm 64 architecture
On the other side, AWS can use cloud to host most real automotive applications. For example, running auto-grade software in the cloud natively means V&V activities can be performed at scale with native properties, a development that has profound implications.

AWS is one of the co-founding members of SOAFEE, an industry initiative to extend cloud-native software experiences to automotive workloads. It includes an open-source reference implementation to enable commercial and non-commercial offerings. AWS presented this idea at the Arm DevSummit workshop, showcasing real-world examples. This has led to the company creating a parity-enabled SDV ecosystem.

Conclusion: What This Means for the Auto Industry

While today the technology maturity curve for autonomous driving is still nascent, key challenges centered on complex driving scenarios are being addressed by improving the fidelity of sensors, enhancing perception through ML, and developing overall infrastructure. Frost & Sullivan believes that such accelerated improvements in capabilities have resulted in the need for new technology support systems that draw heavily on cloud and embedded edge-based solutions.

As AV development progresses from the pilot phase to commercial deployment, developers will need to look for cost-effective yet rapidly scalable solutions that meet data ingestion and compute demands. Along the software development life cycle of AV applications, cloud and infrastructure support partners like AWS and the AWS Partner Network (APN) play a key role in addressing various challenges related to data handling. Over the next decade, AV developers will face significant pressure to reduce costs and achieve profitability. The ability to scale efficiently and the need to invest financial resources wisely will be vital to sustaining autonomous business models. From the S3 and the EC2 P3 to its parity-enabled SDV solution, AWS provides a complete data handling platform for AV developers allowing them to realize scale cost-effectively. This underlines AWS as an integral partner in AV deployment.

Cookie	Duration	Description
__cfruid	session	This cookie is set by the provider Cloudflare. This cookie is used for load balancing and for identifying trusted web traffic.
_GRECAPTCHA	5 months 27 days	This cookie is set by Google. In addition to certain standard Google cookies, reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
_PCCID	5 years	Identifies the visitor across devices and visits, in order to optimize the chat-box function on the website.
_PCCSID_363163	20 minutes	Required for functioning of the Pure Chat box.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
JSESSIONID	past	Used by sites written in JSP. General purpose platform session cookies that are used to maintain users' state across page requests.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
aka_debug	session	This cookie is set by the provider Vimeo.This cookie is essential for the website to play video functionality. The cookie collects statistical information like how many times the video is displayed and what settings are used for playback.
bcookie	2 years	This cookie is set by linkedIn. The purpose of the cookie is to enable LinkedIn functionalities on the page.
lang	session	This cookie is used to store the language preferences of a user to serve up content in that stored language the next time user visit the website.
lidc	1 day	This cookie is set by LinkedIn and used for routing.
player	1 year	This cookie is used by Vimeo. This cookie is used to save the user's preferences when playing embedded videos from Vimeo.
vc	never	This cookie is set by addthis.com on sites that allow sharing on social media.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_ga_6JHN0QW8FW	2 years	This cookie is installed by Google Analytics.
_gat_gtag_UA_197764616_1	1 minute	This cookie is set by Google and is used to distinguish users.
_gat_gtag_UA_53927943_3	1 minute	Set by Google to distinguish users.
_gd_session	4 hours	This cookie is used for collecting information on users visit to the website. It collects data such as total number of visits, average time spent on the website and the pages loaded.
_gd_svisitor	session	This cookie is set by the Google Analytics. This cookie is used for tracking the signup commissions via affiliate program.
_gd_visitor	2 years	This cookie is used for collecting information on the users visit such as number of visits, average time spent on the website and the pages loaded for displaying targeted ads.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.
CONSENT	16 years 4 months 10 days 14 hours	These cookies are set via embedded youtube-videos. They register anonymous statistical data on for example how many times the video is displayed and what settings are used for playback.No sensitive data is collected unless you log in to your google account, in that case your choices are linked with your account, for example if you click “like” on a video.
vuid	2 years	This domain of this cookie is owned by Vimeo. This cookie is used by vimeo to collect tracking information. It sets a unique ID to embed videos to the website.

Cookie	Duration	Description
bscookie	2 years	This cookie is a browser ID cookie set by Linked share Buttons and ad tags.
i	never	The purpose of the cookie is not known yet.
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.
YSC	session	This cookies is set by Youtube and is used to track the views of embedded videos.
yt-remote-connected-devices	never	These cookies are set via embedded youtube-videos.
yt-remote-device-id	never	These cookies are set via embedded youtube-videos.
yt.innertube::nextId	never	These cookies are set via embedded youtube-videos.
yt.innertube::requests	never	These cookies are set via embedded youtube-videos.

Cookie	Duration	Description
__wpdm_client	session	No description
_an_uid	session	No description available.
_techvalidate_session	session	No description
6suuid	2 years	No description available.
et_pb_ab_view_page_63974	session	No description
li_gc	2 years	No description
ppwp_wp_session	30 minutes	No description
raygun4js-userid	never	Description unavailable.
ruid	6 months	No description
sync_active	never	No description available.
thirdPartyCookiesEnabled	1 day	No description available.
visitorId	1 year	No description

Recent Posts

Select Your Transformation Journey

Schedule Your Growth Dialog™

Solutions

About Us

Media & Partnerships