oxylabs-ai-studio-py: a Python SDK for AI-powered web data extraction and agentic browsing via natural language prompts
oxylabs-ai-studio-py: a Python SDK for AI-powered web data extraction and agentic browsing via natural language prompts
What it solves
This SDK provides a simplified way for developers to integrate AI-powered web data extraction and browsing capabilities into their Python applications. It eliminates the need to manually handle complex scraping logic, proxy management, and JavaScript rendering, while allowing users to extract structured data from the web using natural language prompts.
How it works
The SDK acts as a wrapper for the Oxylabs AI Studio API, providing dedicated classes for different data extraction tasks:
- AiCrawler: Uses natural language prompts to find and extract data from multiple pages across a website.
- AiScraper: Extracts specific data from a single page, with the ability to generate JSON schemas for structured output.
- AiSearch: Performs AI-enhanced web searches and retrieves content in markdown format.
- BrowserAgent: An agentic browser that can interact with websites (e.g., using a search bar) to find and specific information based on a user prompt.
- AiMap: Maps a domain to identify specific types of URLs based on keywords or natural language prompts.
Who it’s for
Developers who need to collect structured data from the web at scale, automate web browsing tasks, and integrate AI-driven search and discovery of web content into their software.
Highlights
- Natural Language Extraction: Use prompts to guide the AI in finding and extracting specific data points.
- Structured Output: Supports multiple formats including JSON, CSV, and Markdown, with schema generation for JSON outputs.
- Agentic Browsing: Includes a Browser Agent capable of performing actions on a page to reach a target goal.
- Global Proxy Support: Built-in support for geo-location targeting to bypass regional restrictions.
- Async Support: Every method has a corresponding asynchronous version for high-performance applications.
Sources
- undefinedoxylabs/oxylabs-ai-studio-py