top of page
_700x300 v3.png

What is Amazon Nova Act? A Powerful New AI Agent SDK for Web Automation

Amazon has unveiled Amazon Nova Act, a new artificial intelligence (AI) agent designed to operate directly within your web browser. Amazon Nova Act is a new AI model trained specifically to perform actions within a web browser. Think of it as a digital assistant capable of autonomously performing website tasks, like filling out forms and navigating complex interfaces to manage pop-ups.


This move shows Amazon pushing AI beyond conversation chatbots and information retrieval into task automation tools. Alongside this trained AI model, Amazon has released the Nova Act SDK (Software Development Kit), allowing developers to start experimenting with building agents for simple browser-related tasks.


Why AI Agents Like Nova Act Matter for Business

Today, most AI tools are great at conversation (like chatbots) or finding information. However, their ability to take action within digital environments is often limited. Many current AI agents rely heavily on predefined connections (APIs) to interact with software. The problem? Most websites and many real-world business processes lack these comprehensive APIs, creating a bottleneck for automation.


Amazon envisions a future where AI agents like Nova Act can handle complex, multi-step tasks directly for users. Amazon wants to create AI agents that can handle all sorts of tasks, such as automating event planning logistics, streamlining IT support processes, and more, without manual human intervention at every step. While agentic AI technology is constantly developing and often requires human oversight, the goal is truly autonomous workflow automation.


What is Amazon Nova Act? A Powerful New AI Agent for Web Automation

Amazon Nova Act is specifically an AI agent trained to understand and interact with web browser elements. Using simple commands, it can learn to perform actions typically done by humans. The technology is currently available as a research preview via the SDK. Amazon Nova Act allows developers to build AI agents capable of handling tasks such as:

  • Scheduling appointments

  • Managing emails

  • Filling out online forms

  • Navigating websites

  • Processing online orders

Amazon Nova Act

Performance and Benchmarks

Amazon reports that Nova Act performs well in internal benchmarks measuring its ability to control web elements accurately. The AI model has shown strong results compared to other approaches on specific interaction tests. However, broader comparisons against all standard AI agent evaluation metrics are still pending as it's in a research preview phase.

Amazon Nova Act Performance and Benchmarks

Key Features and Capabilities:

  • Web-Native Interaction: Nova Act is designed from the ground up to work with buttons, forms, dropdowns, date pickers, and other common web elements.

  • Developer Focused (Initially): The Nova Act SDK provides tools for developers to build and test prototype agents, breaking down complex tasks into manageable steps.

  • Reliable Building Blocks: The SDK uses foundational commands (like 'search,' 'click,' 'fill field,' and 'checkout') designed for high accuracy, even on tricky web interfaces. Amazon is targeting over 90% success rates in internal tests for these core actions.

  • Customizable Instructions: Developers can provide detailed guidance, such as instructing an agent to decline optional upsells during checkout or to extract specific information from a page.

  • Integration Flexibility: Allows agents to call external APIs or run snippets of Python code for custom logic, validation, or checks.

  • Works Behind the Scenes: Once configured, Nova Act agents can operate without direct human observation ("headlessly") or run on a schedule.

  • Potential Versatility: Early tests suggest Nova Act's understanding of interfaces might extend to other web-based environments, potentially even web games.

Amazon Nova Act user action

The Challenge: Achieving Consistent Automation

The biggest hurdle for all autonomous AI agents is reliability and consistency. Early systems can be slow, prone to errors when encountering unexpected website changes, and struggle with nuances humans handle easily.


Amazon's strategy with Nova Act is to focus on highly reliable foundational actions, hoping this provides a more stable base for building dependable agents. The real test will be how effectively developers can use the SDK to create powerful solutions for real-world business challenges.


Conclusion:

Amazon Nova Act represents a clear and significant move by Amazon into the rapidly growing field of AI agents and agentic AI. Amazon wants to make browser automation more practical by focusing on reliable web interaction, a key weakness in many current systems.


Providing developers with the Nova Act SDK encourages experimentation and the creation of AI agents that could significantly impact everyday business productivity by automating routine web-based tasks.


This release from Amazon Science intensifies competition in the AI workflow automation space. While the journey towards truly autonomous, consistently performing AI agents continues, Nova Act is a notable step forward, highlighting AI's immense potential to inform and do.

Screenshot 2025-05-02 at 9.53.35 AM.png
bottom of page