Skip to content
Change the repository type filter

All

    Repositories list

    • crawlee

      Public
      Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
      TypeScript
      Apache License 2.0
      66916k11714Updated Nov 19, 2024Nov 19, 2024
    • Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
      Python
      Apache License 2.0
      3194.6k828Updated Nov 19, 2024Nov 19, 2024
    • Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.
      TypeScript
      26000Updated Nov 19, 2024Nov 19, 2024
    • The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.
      Python
      Apache License 2.0
      11120141Updated Nov 19, 2024Nov 19, 2024
    • apify-cli

      Public
      Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.
      TypeScript
      19122353Updated Nov 19, 2024Nov 19, 2024
    • openapi

      Public
      An OpenAPI specification for the Apify API.
      JavaScript
      MIT License
      02173Updated Nov 19, 2024Nov 19, 2024
    • workflows

      Public
      Apify's reusable github workflows
      Python
      4744Updated Nov 19, 2024Nov 19, 2024
    • This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
      Apache License 2.0
      0274Updated Nov 19, 2024Nov 19, 2024
    • Utilities and constants shared across Apify projects.
      TypeScript
      Apache License 2.0
      111250Updated Nov 19, 2024Nov 19, 2024
    • Apify ESLint preset to be shared between projects
      JavaScript
      Apache License 2.0
      0210Updated Nov 18, 2024Nov 18, 2024
    • Apify API client for JavaScript / Node.js.
      JavaScript
      Apache License 2.0
      2768164Updated Nov 18, 2024Nov 18, 2024
    • RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.
      TypeScript
      Apache License 2.0
      0311Updated Nov 18, 2024Nov 18, 2024
    • Apify SDK monorepo
      TypeScript
      Apache License 2.0
      35123107Updated Nov 18, 2024Nov 18, 2024
    • Apify API client for Python
      Python
      Apache License 2.0
      114983Updated Nov 18, 2024Nov 18, 2024
    • This project is the home of Apify's documentation.
      API Blueprint
      Apache License 2.0
      76297022Updated Nov 18, 2024Nov 18, 2024
    • Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
      TypeScript
      Apache License 2.0
      1039871911Updated Nov 18, 2024Nov 18, 2024
    • Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.
      JavaScript
      Apache License 2.0
      145850711Updated Nov 17, 2024Nov 17, 2024
    • Apify integration for Zapier
      JavaScript
      Apache License 2.0
      1852Updated Nov 15, 2024Nov 15, 2024
    • Scrape list of available integrations from Make
      TypeScript
      0001Updated Nov 15, 2024Nov 15, 2024
    • Scrape list of Zapier integrations from Zapier website
      TypeScript
      0001Updated Nov 15, 2024Nov 15, 2024
    • Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)
      Python
      Apache License 2.0
      4400Updated Nov 14, 2024Nov 14, 2024
    • Python
      Apache License 2.0
      0300Updated Nov 13, 2024Nov 13, 2024
    • The Github action that makes sure that each PR is correctly set up and has a milestone set.
      TypeScript
      Apache License 2.0
      1110Updated Nov 13, 2024Nov 13, 2024
    • Base Docker images for Apify actors.
      Dockerfile
      Apache License 2.0
      227093Updated Nov 8, 2024Nov 8, 2024
    • This tool integrates with AWS to monitor service usage costs and posts a summary of these costs to a Slack channel. The summary includes costs for various AWS services along with a chart that provides a visual breakdown of the costs over time.
      TypeScript
      MIT License
      0001Updated Nov 5, 2024Nov 5, 2024
    • This project is the 🏠 home of Apify actor template projects to help users quickly get started.
      Python
      182681Updated Oct 25, 2024Oct 25, 2024
    • A Homebrew tap for Apify tools
      Ruby
      1804Updated Oct 25, 2024Oct 25, 2024
    • HTTP client made for scraping based on got.
      TypeScript
      44557151Updated Oct 23, 2024Oct 23, 2024
    • This action simplify creating of release PR
      JavaScript
      Apache License 2.0
      0000Updated Oct 23, 2024Oct 23, 2024
    • airbyte

      Public
      Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
      Python
      Other
      4.1k000Updated Oct 3, 2024Oct 3, 2024