Skip to content
@apify

apify

We're making the web more programmable.

Apify Banner

Apify is the largest ecosystem where developers build, deploy, and publish data extraction and web automation tools. We call them Actors.

Learn About Apify 🧑‍🎓

  • Find hundreds of ready-made Actors for your web scraping or automation project on Apify Store.
  • Learn everything about web scraping and automation with our free courses that will turn you into an expert scraping developer.
  • Publish your web scrapers as paid Actors on the Apify platform, attract people who need these solutions, and get regular passive income.
  • View our livestreams and video content at the Apify YouTube channel.
  • Learn more through tutorials and thought leadership content about web scraping on Apify Blog and Crawlee Blog.

We are hiring! 🕸️

Check out the open positions at Apify and help us make the web more programmable.

Pinned Loading

  1. crawlee crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 19.6k 1k

  2. apify-cli apify-cli Public

    Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

    TypeScript 155 30

  3. fingerprint-suite fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 1.7k 150

  4. impit impit Public

    impit | rust library for browser impersonation

    Rust 171 12

  5. actor-whitepaper actor-whitepaper Public

    This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosop…

    Python 74 2

  6. crawlee-python crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

    Python 6.3k 440

Repositories

Showing 10 of 183 repositories
  • proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

    apify/proxy-chain’s past year of commit activity
    JavaScript 944 Apache-2.0 159 13 (1 issue needs help) 13 Updated Sep 27, 2025
  • apify-mcp-server Public

    The Apify MCP server enables your AI agents to extract data from social media, search engines, maps, e-commerce sites, or any other website using thousands of ready-made scrapers, crawlers, and automation tools available on the Apify Store.

    apify/apify-mcp-server’s past year of commit activity
    TypeScript 409 MIT 51 17 5 Updated Sep 27, 2025
  • actor-whitepaper-web Public

    Documentation site for the Actor Programming Model – a fresh take on serverless microapps. Built with Astro.

    apify/actor-whitepaper-web’s past year of commit activity
    MDX 5 MIT 1 4 15 Updated Sep 27, 2025
  • mcp-servers Public

    A curated collection of awesome MCP servers, published and monetized on Apify

    apify/mcp-servers’s past year of commit activity
    TypeScript 5 MIT 1 0 8 Updated Sep 26, 2025
  • actor-templates Public

    This project is the 🏠 home of Apify Actor templates to help users quickly get started. Contributions welcome!

    apify/actor-templates’s past year of commit activity
    Python 35 29 15 10 Updated Sep 26, 2025
  • apify-docs Public

    This project is the home of Apify's documentation.

    apify/apify-docs’s past year of commit activity
    JavaScript 48 Apache-2.0 125 123 (2 issues need help) 28 Updated Sep 26, 2025
  • crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee-python’s past year of commit activity
    Python 6,330 Apache-2.0 440 67 9 Updated Sep 26, 2025
  • crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    apify/crawlee’s past year of commit activity
    TypeScript 19,573 Apache-2.0 1,007 167 (1 issue needs help) 21 Updated Sep 26, 2025
  • apify-actor-docker Public

    Base Docker images for Apify actors.

    apify/apify-actor-docker’s past year of commit activity
    Dockerfile 86 Apache-2.0 28 7 1 Updated Sep 26, 2025
  • n8n-nodes-apify-content-crawler Public

    singe Actor node of Website Content Crawler Actor on n8n

    apify/n8n-nodes-apify-content-crawler’s past year of commit activity
    TypeScript 1 MIT 1 4 2 Updated Sep 26, 2025