Dec29, 2025

How to Integrate Helium with CapSolver for Seamless CAPTCHA Solving

Lucas Mitchell

Automation Engineer

1. Introduction

Web automation requires tools that are both powerful and easy to use. However, modern websites deploy sophisticated anti-bot measures and CAPTCHAs that can halt automation scripts.

The combination of Helium and CapSolver provides an elegant solution:

Helium: A lightweight Python library that makes Selenium easier to use with a simple, human-readable API
CapSolver: An AI-powered CAPTCHA solving service that handles Cloudflare Turnstile, reCAPTCHA and more

Together, these tools enable seamless web automation that handles CAPTCHA challenges automatically.

1.1. Integration Objectives

This guide will help you achieve three core goals:

Simplify Browser Automation - Use Helium's intuitive API for clean, readable code
Automatically Solve CAPTCHAs - Integrate CapSolver's API to handle CAPTCHA challenges without manual intervention
Maintain Flexibility - Access Selenium's full power when needed while keeping code simple

2. What is Helium?

Helium is a Python library that makes Selenium much easier to use. It provides a high-level API that lets you write browser automation in plain English.

2.1. Key Features

Simple Syntax - Write click("Submit") instead of complex XPath selectors
Auto-waiting - Automatically waits for elements to appear
Human-Readable - Code reads like instructions: write("Hello", into="Search")
Selenium Compatible - Access the underlying Selenium driver when needed
Lightweight - Minimal overhead on top of Selenium

2.2. Installation

bash Copy

# Install Helium
pip install helium

# Install requests library for CapSolver API
pip install requests

2.3. Basic Usage

python Copy

from helium import *

# Start browser and navigate
start_chrome("https://wikipedia.org")

# Type into search box
write("Python programming", into=S("input[name='search']"))

# Click search button
click(Button("Search"))

# Check if text exists
if Text("Python").exists():
    print("Found Python article!")

# Close browser
kill_browser()

3. What is CapSolver?

CapSolver is an AI-powered automatic CAPTCHA solving service that supports a wide range of CAPTCHA types. It provides a simple API that allows you to submit CAPTCHA challenges and receive solutions within seconds.

3.1. Supported CAPTCHA Types

Cloudflare Turnstile - The most common modern anti-bot challenge
reCAPTCHA v2 - Image-based and invisible variants
reCAPTCHA v3 - Score-based verification
AWS WAF - Amazon Web Services CAPTCHA
DataDome - Enterprise bot protection
And many more...

3.2. Getting Started with CapSolver

Sign up at capsolver.com
Add funds to your account
Get your API key from the dashboard

Bonus: Use code HELIUM when registering to receive bonus credits!

3.3. API Endpoints

Server A: https://api.capsolver.com
Server B: https://api-stable.capsolver.com

4. Pre-Integration Challenges

Before combining Helium with CapSolver, web automation faced several challenges:

Challenge	Impact
CAPTCHA challenges	Manual solving required, breaking automation
Complex selectors	Selenium requires verbose XPath/CSS selectors
Timing issues	Elements not ready when accessed
Code readability	Automation scripts become hard to maintain

The Helium + CapSolver integration solves these challenges with clean, readable code.

5. Integration Methods

5.1. API Integration (Recommended)

The API integration approach gives you full control over the CAPTCHA solving process and works with any CAPTCHA type.

5.1.1. Setup Requirements

bash Copy

pip install helium requests

5.1.2. Core Integration Pattern

python Copy

import time
import requests
from helium import *

CAPSOLVER_API_KEY = "YOUR_API_KEY"
CAPSOLVER_API = "https://api.capsolver.com"


def create_task(task_payload: dict) -> str:
    """Create a CAPTCHA solving task and return the task ID."""
    response = requests.post(
        f"{CAPSOLVER_API}/createTask",
        json={
            "clientKey": CAPSOLVER_API_KEY,
            "task": task_payload
        }
    )
    result = response.json()
    if result.get("errorId") != 0:
        raise Exception(f"CapSolver error: {result.get('errorDescription')}")
    return result["taskId"]


def get_task_result(task_id: str, max_attempts: int = 120) -> dict:
    """Poll for task result until solved or timeout."""
    for _ in range(max_attempts):
        response = requests.post(
            f"{CAPSOLVER_API}/getTaskResult",
            json={
                "clientKey": CAPSOLVER_API_KEY,
                "taskId": task_id
            }
        )
        result = response.json()

        if result.get("status") == "ready":
            return result["solution"]
        elif result.get("status") == "failed":
            raise Exception(f"Task failed: {result.get('errorDescription')}")

        time.sleep(1)

    raise TimeoutError("CAPTCHA solving timed out")


def solve_captcha(task_payload: dict) -> dict:
    """Complete CAPTCHA solving workflow."""
    task_id = create_task(task_payload)
    return get_task_result(task_id)

5.2. Browser Extension

You can also use the CapSolver browser extension with Helium for automatic CAPTCHA detection and solving.

5.2.1. Installation Steps

Download the CapSolver extension from capsolver.com/en/extension
Extract the extension files
Configure your API key in the extension's config.js file:

javascript Copy

// In the extension folder, edit: assets/config.js
var defined = {
    apiKey: "YOUR_CAPSOLVER_API_KEY",  // Replace with your actual API key
    enabledForBlacklistControl: false,
    blackUrlList: [],
    enabledForRecaptcha: true,
    enabledForRecaptchaV3: true,
    enabledForTurnstile: true,
    // ... other settings
}

Load it into Chrome via Helium:

python Copy

from helium import *
from selenium.webdriver import ChromeOptions

options = ChromeOptions()
options.add_argument('--load-extension=/path/to/capsolver-extension')

start_chrome(options=options)
# The extension will automatically detect and solve CAPTCHAs

Note: The extension must have a valid API key configured before it can solve CAPTCHAs automatically.

6. Code Examples

6.1. Solving reCAPTCHA v2

This example solves reCAPTCHA v2 on Google's demo page with automatic site key detection:

python Copy

import time
import requests
from helium import *
from selenium.webdriver import ChromeOptions

CAPSOLVER_API_KEY = "YOUR_API_KEY"
CAPSOLVER_API = "https://api.capsolver.com"


def solve_recaptcha_v2(site_key: str, page_url: str) -> str:
    """Solve reCAPTCHA v2 and return the token."""
    # Create the task
    response = requests.post(
        f"{CAPSOLVER_API}/createTask",
        json={
            "clientKey": CAPSOLVER_API_KEY,
            "task": {
                "type": "ReCaptchaV2TaskProxyLess",
                "websiteURL": page_url,
                "websiteKey": site_key,
            }
        }
    )
    result = response.json()

    if result.get("errorId") != 0:
        raise Exception(f"Error: {result.get('errorDescription')}")

    task_id = result["taskId"]
    print(f"Task created: {task_id}")

    # Poll for result
    while True:
        result = requests.post(
            f"{CAPSOLVER_API}/getTaskResult",
            json={
                "clientKey": CAPSOLVER_API_KEY,
                "taskId": task_id
            }
        ).json()

        if result.get("status") == "ready":
            return result["solution"]["gRecaptchaResponse"]
        elif result.get("status") == "failed":
            raise Exception(f"Failed: {result.get('errorDescription')}")

        print("  Waiting for solution...")
        time.sleep(1)


def main():
    target_url = "https://www.google.com/recaptcha/api2/demo"

    # Configure browser with anti-detection
    options = ChromeOptions()
    options.add_experimental_option('excludeSwitches', ['enable-automation'])
    options.add_experimental_option('useAutomationExtension', False)
    options.add_argument('--disable-blink-features=AutomationControlled')

    print("Starting browser...")
    start_chrome(target_url, options=options)
    driver = get_driver()

    try:
        time.sleep(2)

        # Auto-detect site key from page
        recaptcha_element = driver.find_element("css selector", ".g-recaptcha")
        site_key = recaptcha_element.get_attribute("data-sitekey")
        print(f"Detected site key: {site_key}")

        # Solve the CAPTCHA
        print("\nSolving reCAPTCHA v2 with CapSolver...")
        token = solve_recaptcha_v2(site_key, target_url)
        print(f"Got token: {token[:50]}...")

        # Inject the token
        print("\nInjecting token...")
        driver.execute_script(f'''
            var responseField = document.getElementById('g-recaptcha-response');
            responseField.style.display = 'block';
            responseField.value = '{token}';
        ''')
        print("Token injected!")

        # Submit using Helium's simple syntax
        print("\nSubmitting form...")
        click("Submit")

        time.sleep(3)

        # Check for success
        if "Verification Success" in driver.page_source:
            print("\n=== SUCCESS! ===")
            print("reCAPTCHA was solved and form was submitted!")

    finally:
        kill_browser()


if __name__ == "__main__":
    main()

Test it yourself:

bash Copy

python demo_recaptcha_v2.py

6.2. Solving Cloudflare Turnstile

Cloudflare Turnstile is one of the most common CAPTCHA challenges. Here's how to solve it:

python Copy

import time
import requests
from helium import *
from selenium.webdriver import ChromeOptions

CAPSOLVER_API_KEY = "YOUR_API_KEY"
CAPSOLVER_API = "https://api.capsolver.com"


def solve_turnstile(site_key: str, page_url: str) -> str:
    """Solve Cloudflare Turnstile and return the token."""
    response = requests.post(
        f"{CAPSOLVER_API}/createTask",
        json={
            "clientKey": CAPSOLVER_API_KEY,
            "task": {
                "type": "AntiTurnstileTaskProxyLess",
                "websiteURL": page_url,
                "websiteKey": site_key,
            }
        }
    )
    result = response.json()

    if result.get("errorId") != 0:
        raise Exception(f"Error: {result.get('errorDescription')}")

    task_id = result["taskId"]

    while True:
        result = requests.post(
            f"{CAPSOLVER_API}/getTaskResult",
            json={
                "clientKey": CAPSOLVER_API_KEY,
                "taskId": task_id
            }
        ).json()

        if result.get("status") == "ready":
            return result["solution"]["token"]
        elif result.get("status") == "failed":
            raise Exception(f"Failed: {result.get('errorDescription')}")

        time.sleep(1)


def main():
    target_url = "https://your-target-site.com"
    turnstile_site_key = "0x4XXXXXXXXXXXXXXXXX"  # Find in page source

    # Configure browser
    options = ChromeOptions()
    options.add_argument('--disable-blink-features=AutomationControlled')

    start_chrome(target_url, options=options)
    driver = get_driver()

    try:
        # Wait for Turnstile to load
        time.sleep(3)

        # Solve the CAPTCHA
        print("Solving Turnstile...")
        token = solve_turnstile(turnstile_site_key, target_url)
        print(f"Got token: {token[:50]}...")

        # Inject the token
        driver.execute_script(f'''
            document.querySelector('input[name="cf-turnstile-response"]').value = "{token}";

            // Trigger callback if present
            const callback = document.querySelector('[data-callback]');
            if (callback) {{
                const callbackName = callback.getAttribute('data-callback');
                if (window[callbackName]) {{
                    window[callbackName]('{token}');
                }}
            }}
        ''')

        # Submit the form using Helium
        if Button("Submit").exists():
            click("Submit")

        print("Turnstile bypassed!")

    finally:
        kill_browser()


if __name__ == "__main__":
    main()

6.3. Solving reCAPTCHA v3

reCAPTCHA v3 is score-based and doesn't require user interaction:

python Copy

import time
import requests
from helium import *
from selenium.webdriver import ChromeOptions

CAPSOLVER_API_KEY = "YOUR_API_KEY"
CAPSOLVER_API = "https://api.capsolver.com"


def solve_recaptcha_v3(
    site_key: str,
    page_url: str,
    action: str = "verify",
    min_score: float = 0.7
) -> str:
    """Solve reCAPTCHA v3 with specified action and minimum score."""
    response = requests.post(
        f"{CAPSOLVER_API}/createTask",
        json={
            "clientKey": CAPSOLVER_API_KEY,
            "task": {
                "type": "ReCaptchaV3TaskProxyLess",
                "websiteURL": page_url,
                "websiteKey": site_key,
                "pageAction": action,
                "minScore": min_score
            }
        }
    )
    result = response.json()

    if result.get("errorId") != 0:
        raise Exception(f"Error: {result.get('errorDescription')}")

    task_id = result["taskId"]

    while True:
        result = requests.post(
            f"{CAPSOLVER_API}/getTaskResult",
            json={
                "clientKey": CAPSOLVER_API_KEY,
                "taskId": task_id
            }
        ).json()

        if result.get("status") == "ready":
            return result["solution"]["gRecaptchaResponse"]
        elif result.get("status") == "failed":
            raise Exception(f"Failed: {result.get('errorDescription')}")

        time.sleep(1)


def main():
    target_url = "https://your-target-site.com"
    recaptcha_v3_key = "6LcXXXXXXXXXXXXXXXXXXXXXXXXX"

    # Setup headless browser for v3
    options = ChromeOptions()
    options.add_argument('--headless')

    start_chrome(target_url, options=options)
    driver = get_driver()

    try:
        # Solve reCAPTCHA v3 with "login" action
        print("Solving reCAPTCHA v3...")
        token = solve_recaptcha_v3(
            recaptcha_v3_key,
            target_url,
            action="login",
            min_score=0.9
        )

        # Inject the token
        driver.execute_script(f'''
            var responseField = document.querySelector('[name="g-recaptcha-response"]');
            if (responseField) {{
                responseField.value = '{token}';
            }}
            // Call callback if exists
            if (typeof onRecaptchaSuccess === 'function') {{
                onRecaptchaSuccess('{token}');
            }}
        ''')

        print("reCAPTCHA v3 bypassed!")

    finally:
        kill_browser()


if __name__ == "__main__":
    main()

7. Best Practices

7.1. Browser Configuration

Configure Chrome to appear more like a regular browser:

python Copy

from helium import *
from selenium.webdriver import ChromeOptions

options = ChromeOptions()
options.add_experimental_option('excludeSwitches', ['enable-automation'])
options.add_experimental_option('useAutomationExtension', False)
options.add_argument('--disable-blink-features=AutomationControlled')
options.add_argument('--window-size=1920,1080')

start_chrome(options=options)

7.2. Combining Helium with Selenium

Use Helium's simple syntax for most operations, but access Selenium when needed:

python Copy

from helium import *

start_chrome("https://target-site.com")

# Use Helium for simple interactions
write("username", into="Email")
write("password", into="Password")

# Access Selenium driver for complex operations
driver = get_driver()
driver.execute_script("window.scrollTo(0, document.body.scrollHeight)")

# Back to Helium
click("Login")

7.3. Rate Limiting

Avoid triggering rate limits by adding random delays:

python Copy

import random
import time

def human_delay(min_sec=1.0, max_sec=3.0):
    """Random delay to mimic human behavior."""
    time.sleep(random.uniform(min_sec, max_sec))

# Use between actions
click("Next")
human_delay()
write("data", into="Input")

7.4. Error Handling

Always implement proper error handling for CAPTCHA solving:

python Copy

def solve_with_retry(task_payload: dict, max_retries: int = 3) -> dict:
    """Solve CAPTCHA with retry logic."""
    for attempt in range(max_retries):
        try:
            return solve_captcha(task_payload)
        except TimeoutError:
            if attempt < max_retries - 1:
                print(f"Timeout, retrying... ({attempt + 1}/{max_retries})")
                time.sleep(5)
            else:
                raise
        except Exception as e:
            if "balance" in str(e).lower():
                raise  # Don't retry balance errors
            if attempt < max_retries - 1:
                time.sleep(2)
            else:
                raise

7.5. Headless Mode

Use headless mode for background automation:

python Copy

from helium import *
from selenium.webdriver import ChromeOptions

options = ChromeOptions()
options.add_argument('--headless')
options.add_argument('--disable-gpu')

start_chrome("https://target-site.com", options=options)

8. Helium vs Selenium: Quick Comparison

Operation	Selenium	Helium
Click button	`driver.find_element(By.XPATH, "//button[text()='Submit']").click()`	`click("Submit")`
Type text	`driver.find_element(By.NAME, "email").send_keys("test@test.com")`	`write("test@test.com", into="Email")`
Press Enter	`element.send_keys(Keys.ENTER)`	`press(ENTER)`
Check text exists	`"Welcome" in driver.page_source`	`Text("Welcome").exists()`

9. Conclusion

The integration of Helium and CapSolver creates an elegant toolkit for web automation:

Helium provides a clean, readable API for browser automation
CapSolver handles CAPTCHAs with AI-powered solving
Together they enable seamless automation with minimal code

Whether you're building web scrapers, automated testing systems, or data collection pipelines, this combination provides simplicity and power.

Bonus: Use code HELIUM when signing up at CapSolver to receive bonus credits!

10. FAQ

10.1. Why choose Helium over pure Selenium?

Helium makes Selenium easier to use:

Much simpler, human-readable syntax
Automatic waiting for elements
Less verbose code
Still has full Selenium access when needed
Faster development time

10.2. Which CAPTCHA types work best with this integration?

CapSolver supports all major CAPTCHA types. Cloudflare Turnstile and reCAPTCHA v2/v3 have the highest success rates. The integration works seamlessly with any CAPTCHA that CapSolver supports.

10.3. Can I use this in headless mode?

Yes! Helium supports headless mode via ChromeOptions. For reCAPTCHA v3 and token-based CAPTCHAs, headless mode works perfectly. For v2 visible CAPTCHAs, headed mode may provide better results.

10.4. How do I find the site key for a CAPTCHA?

Look in the page source for:

Turnstile: data-sitekey attribute or cf-turnstile elements
reCAPTCHA: data-sitekey attribute on g-recaptcha div

10.5. What if CAPTCHA solving fails?

Common solutions:

Verify your API key and balance
Ensure the site key is correct
Check that the page URL matches where the CAPTCHA appears
For v3, try adjusting the action parameter and minimum score
Implement retry logic with delays

10.6. Can I still use Selenium features with Helium?

Yes! Call get_driver() to access the underlying Selenium WebDriver for any operation Helium doesn't cover directly.

Web ScrapingApr 22, 2026

Rust Web Scraping Architecture for Scalable Data Extraction

Learn scalable Rust web scraping architecture with reqwest, scraper, async scraping, headless browser scraping, proxy rotation, and compliant CAPTCHA handling.

Lucas Mitchell

Web ScrapingApr 17, 2026

How to Scrape Job Listings Without Getting Blocked

Learn the best techniques to scrape job listings without getting blocked. Master Indeed scraping, Google Jobs API, and web scraping API with CapSolver.

Dec29, 2025

How to Integrate Helium with CapSolver for Seamless CAPTCHA Solving

Lucas Mitchell

Automation Engineer

1. Introduction

Web automation requires tools that are both powerful and easy to use. However, modern websites deploy sophisticated anti-bot measures and CAPTCHAs that can halt automation scripts.

The combination of Helium and CapSolver provides an elegant solution:

Helium: A lightweight Python library that makes Selenium easier to use with a simple, human-readable API
CapSolver: An AI-powered CAPTCHA solving service that handles Cloudflare Turnstile, reCAPTCHA and more

Together, these tools enable seamless web automation that handles CAPTCHA challenges automatically.

1.1. Integration Objectives

This guide will help you achieve three core goals:

Simplify Browser Automation - Use Helium's intuitive API for clean, readable code
Automatically Solve CAPTCHAs - Integrate CapSolver's API to handle CAPTCHA challenges without manual intervention
Maintain Flexibility - Access Selenium's full power when needed while keeping code simple

2. What is Helium?

Helium is a Python library that makes Selenium much easier to use. It provides a high-level API that lets you write browser automation in plain English.

2.1. Key Features

Simple Syntax - Write click("Submit") instead of complex XPath selectors
Auto-waiting - Automatically waits for elements to appear
Human-Readable - Code reads like instructions: write("Hello", into="Search")
Selenium Compatible - Access the underlying Selenium driver when needed
Lightweight - Minimal overhead on top of Selenium

2.2. Installation

bash Copy

# Install Helium
pip install helium

# Install requests library for CapSolver API
pip install requests

2.3. Basic Usage

python Copy

from helium import *

# Start browser and navigate
start_chrome("https://wikipedia.org")

# Type into search box
write("Python programming", into=S("input[name='search']"))

# Click search button
click(Button("Search"))

# Check if text exists
if Text("Python").exists():
    print("Found Python article!")

# Close browser
kill_browser()

3. What is CapSolver?

3.1. Supported CAPTCHA Types

Cloudflare Turnstile - The most common modern anti-bot challenge
reCAPTCHA v2 - Image-based and invisible variants
reCAPTCHA v3 - Score-based verification
AWS WAF - Amazon Web Services CAPTCHA
DataDome - Enterprise bot protection
And many more...

3.2. Getting Started with CapSolver

Sign up at capsolver.com
Add funds to your account
Get your API key from the dashboard

Bonus: Use code HELIUM when registering to receive bonus credits!

3.3. API Endpoints

Server A: https://api.capsolver.com
Server B: https://api-stable.capsolver.com

4. Pre-Integration Challenges

Before combining Helium with CapSolver, web automation faced several challenges:

Challenge	Impact
CAPTCHA challenges	Manual solving required, breaking automation
Complex selectors	Selenium requires verbose XPath/CSS selectors
Timing issues	Elements not ready when accessed
Code readability	Automation scripts become hard to maintain

The Helium + CapSolver integration solves these challenges with clean, readable code.

5. Integration Methods

5.1. API Integration (Recommended)

The API integration approach gives you full control over the CAPTCHA solving process and works with any CAPTCHA type.

5.1.1. Setup Requirements

bash Copy

pip install helium requests

5.1.2. Core Integration Pattern

python Copy

import time
import requests
from helium import *

CAPSOLVER_API_KEY = "YOUR_API_KEY"
CAPSOLVER_API = "https://api.capsolver.com"


def create_task(task_payload: dict) -> str:
    """Create a CAPTCHA solving task and return the task ID."""
    response = requests.post(
        f"{CAPSOLVER_API}/createTask",
        json={
            "clientKey": CAPSOLVER_API_KEY,
            "task": task_payload
        }
    )
    result = response.json()
    if result.get("errorId") != 0:
        raise Exception(f"CapSolver error: {result.get('errorDescription')}")
    return result["taskId"]


def get_task_result(task_id: str, max_attempts: int = 120) -> dict:
    """Poll for task result until solved or timeout."""
    for _ in range(max_attempts):
        response = requests.post(
            f"{CAPSOLVER_API}/getTaskResult",
            json={
                "clientKey": CAPSOLVER_API_KEY,
                "taskId": task_id
            }
        )
        result = response.json()

        if result.get("status") == "ready":
            return result["solution"]
        elif result.get("status") == "failed":
            raise Exception(f"Task failed: {result.get('errorDescription')}")

        time.sleep(1)

    raise TimeoutError("CAPTCHA solving timed out")


def solve_captcha(task_payload: dict) -> dict:
    """Complete CAPTCHA solving workflow."""
    task_id = create_task(task_payload)
    return get_task_result(task_id)

5.2. Browser Extension

You can also use the CapSolver browser extension with Helium for automatic CAPTCHA detection and solving.

5.2.1. Installation Steps

Download the CapSolver extension from capsolver.com/en/extension
Extract the extension files
Configure your API key in the extension's config.js file:

javascript Copy

// In the extension folder, edit: assets/config.js
var defined = {
    apiKey: "YOUR_CAPSOLVER_API_KEY",  // Replace with your actual API key
    enabledForBlacklistControl: false,
    blackUrlList: [],
    enabledForRecaptcha: true,
    enabledForRecaptchaV3: true,
    enabledForTurnstile: true,
    // ... other settings
}

Load it into Chrome via Helium:

python Copy

from helium import *
from selenium.webdriver import ChromeOptions

options = ChromeOptions()
options.add_argument('--load-extension=/path/to/capsolver-extension')

start_chrome(options=options)
# The extension will automatically detect and solve CAPTCHAs

Note: The extension must have a valid API key configured before it can solve CAPTCHAs automatically.

6. Code Examples

6.1. Solving reCAPTCHA v2

This example solves reCAPTCHA v2 on Google's demo page with automatic site key detection:

python Copy

import time
import requests
from helium import *
from selenium.webdriver import ChromeOptions

CAPSOLVER_API_KEY = "YOUR_API_KEY"
CAPSOLVER_API = "https://api.capsolver.com"


def solve_recaptcha_v2(site_key: str, page_url: str) -> str:
    """Solve reCAPTCHA v2 and return the token."""
    # Create the task
    response = requests.post(
        f"{CAPSOLVER_API}/createTask",
        json={
            "clientKey": CAPSOLVER_API_KEY,
            "task": {
                "type": "ReCaptchaV2TaskProxyLess",
                "websiteURL": page_url,
                "websiteKey": site_key,
            }
        }
    )
    result = response.json()

    if result.get("errorId") != 0:
        raise Exception(f"Error: {result.get('errorDescription')}")

    task_id = result["taskId"]
    print(f"Task created: {task_id}")

    # Poll for result
    while True:
        result = requests.post(
            f"{CAPSOLVER_API}/getTaskResult",
            json={
                "clientKey": CAPSOLVER_API_KEY,
                "taskId": task_id
            }
        ).json()

        if result.get("status") == "ready":
            return result["solution"]["gRecaptchaResponse"]
        elif result.get("status") == "failed":
            raise Exception(f"Failed: {result.get('errorDescription')}")

        print("  Waiting for solution...")
        time.sleep(1)


def main():
    target_url = "https://www.google.com/recaptcha/api2/demo"

    # Configure browser with anti-detection
    options = ChromeOptions()
    options.add_experimental_option('excludeSwitches', ['enable-automation'])
    options.add_experimental_option('useAutomationExtension', False)
    options.add_argument('--disable-blink-features=AutomationControlled')

    print("Starting browser...")
    start_chrome(target_url, options=options)
    driver = get_driver()

    try:
        time.sleep(2)

        # Auto-detect site key from page
        recaptcha_element = driver.find_element("css selector", ".g-recaptcha")
        site_key = recaptcha_element.get_attribute("data-sitekey")
        print(f"Detected site key: {site_key}")

        # Solve the CAPTCHA
        print("\nSolving reCAPTCHA v2 with CapSolver...")
        token = solve_recaptcha_v2(site_key, target_url)
        print(f"Got token: {token[:50]}...")

        # Inject the token
        print("\nInjecting token...")
        driver.execute_script(f'''
            var responseField = document.getElementById('g-recaptcha-response');
            responseField.style.display = 'block';
            responseField.value = '{token}';
        ''')
        print("Token injected!")

        # Submit using Helium's simple syntax
        print("\nSubmitting form...")
        click("Submit")

        time.sleep(3)

        # Check for success
        if "Verification Success" in driver.page_source:
            print("\n=== SUCCESS! ===")
            print("reCAPTCHA was solved and form was submitted!")

    finally:
        kill_browser()


if __name__ == "__main__":
    main()

Test it yourself:

bash Copy

python demo_recaptcha_v2.py

6.2. Solving Cloudflare Turnstile

Cloudflare Turnstile is one of the most common CAPTCHA challenges. Here's how to solve it:

python Copy

import time
import requests
from helium import *
from selenium.webdriver import ChromeOptions

CAPSOLVER_API_KEY = "YOUR_API_KEY"
CAPSOLVER_API = "https://api.capsolver.com"


def solve_turnstile(site_key: str, page_url: str) -> str:
    """Solve Cloudflare Turnstile and return the token."""
    response = requests.post(
        f"{CAPSOLVER_API}/createTask",
        json={
            "clientKey": CAPSOLVER_API_KEY,
            "task": {
                "type": "AntiTurnstileTaskProxyLess",
                "websiteURL": page_url,
                "websiteKey": site_key,
            }
        }
    )
    result = response.json()

    if result.get("errorId") != 0:
        raise Exception(f"Error: {result.get('errorDescription')}")

    task_id = result["taskId"]

    while True:
        result = requests.post(
            f"{CAPSOLVER_API}/getTaskResult",
            json={
                "clientKey": CAPSOLVER_API_KEY,
                "taskId": task_id
            }
        ).json()

        if result.get("status") == "ready":
            return result["solution"]["token"]
        elif result.get("status") == "failed":
            raise Exception(f"Failed: {result.get('errorDescription')}")

        time.sleep(1)


def main():
    target_url = "https://your-target-site.com"
    turnstile_site_key = "0x4XXXXXXXXXXXXXXXXX"  # Find in page source

    # Configure browser
    options = ChromeOptions()
    options.add_argument('--disable-blink-features=AutomationControlled')

    start_chrome(target_url, options=options)
    driver = get_driver()

    try:
        # Wait for Turnstile to load
        time.sleep(3)

        # Solve the CAPTCHA
        print("Solving Turnstile...")
        token = solve_turnstile(turnstile_site_key, target_url)
        print(f"Got token: {token[:50]}...")

        # Inject the token
        driver.execute_script(f'''
            document.querySelector('input[name="cf-turnstile-response"]').value = "{token}";

            // Trigger callback if present
            const callback = document.querySelector('[data-callback]');
            if (callback) {{
                const callbackName = callback.getAttribute('data-callback');
                if (window[callbackName]) {{
                    window[callbackName]('{token}');
                }}
            }}
        ''')

        # Submit the form using Helium
        if Button("Submit").exists():
            click("Submit")

        print("Turnstile bypassed!")

    finally:
        kill_browser()


if __name__ == "__main__":
    main()

6.3. Solving reCAPTCHA v3

reCAPTCHA v3 is score-based and doesn't require user interaction:

python Copy

import time
import requests
from helium import *
from selenium.webdriver import ChromeOptions

CAPSOLVER_API_KEY = "YOUR_API_KEY"
CAPSOLVER_API = "https://api.capsolver.com"


def solve_recaptcha_v3(
    site_key: str,
    page_url: str,
    action: str = "verify",
    min_score: float = 0.7
) -> str:
    """Solve reCAPTCHA v3 with specified action and minimum score."""
    response = requests.post(
        f"{CAPSOLVER_API}/createTask",
        json={
            "clientKey": CAPSOLVER_API_KEY,
            "task": {
                "type": "ReCaptchaV3TaskProxyLess",
                "websiteURL": page_url,
                "websiteKey": site_key,
                "pageAction": action,
                "minScore": min_score
            }
        }
    )
    result = response.json()

    if result.get("errorId") != 0:
        raise Exception(f"Error: {result.get('errorDescription')}")

    task_id = result["taskId"]

    while True:
        result = requests.post(
            f"{CAPSOLVER_API}/getTaskResult",
            json={
                "clientKey": CAPSOLVER_API_KEY,
                "taskId": task_id
            }
        ).json()

        if result.get("status") == "ready":
            return result["solution"]["gRecaptchaResponse"]
        elif result.get("status") == "failed":
            raise Exception(f"Failed: {result.get('errorDescription')}")

        time.sleep(1)


def main():
    target_url = "https://your-target-site.com"
    recaptcha_v3_key = "6LcXXXXXXXXXXXXXXXXXXXXXXXXX"

    # Setup headless browser for v3
    options = ChromeOptions()
    options.add_argument('--headless')

    start_chrome(target_url, options=options)
    driver = get_driver()

    try:
        # Solve reCAPTCHA v3 with "login" action
        print("Solving reCAPTCHA v3...")
        token = solve_recaptcha_v3(
            recaptcha_v3_key,
            target_url,
            action="login",
            min_score=0.9
        )

        # Inject the token
        driver.execute_script(f'''
            var responseField = document.querySelector('[name="g-recaptcha-response"]');
            if (responseField) {{
                responseField.value = '{token}';
            }}
            // Call callback if exists
            if (typeof onRecaptchaSuccess === 'function') {{
                onRecaptchaSuccess('{token}');
            }}
        ''')

        print("reCAPTCHA v3 bypassed!")

    finally:
        kill_browser()


if __name__ == "__main__":
    main()

7. Best Practices

7.1. Browser Configuration

Configure Chrome to appear more like a regular browser:

python Copy

from helium import *
from selenium.webdriver import ChromeOptions

options = ChromeOptions()
options.add_experimental_option('excludeSwitches', ['enable-automation'])
options.add_experimental_option('useAutomationExtension', False)
options.add_argument('--disable-blink-features=AutomationControlled')
options.add_argument('--window-size=1920,1080')

start_chrome(options=options)

7.2. Combining Helium with Selenium

Use Helium's simple syntax for most operations, but access Selenium when needed:

python Copy

from helium import *

start_chrome("https://target-site.com")

# Use Helium for simple interactions
write("username", into="Email")
write("password", into="Password")

# Access Selenium driver for complex operations
driver = get_driver()
driver.execute_script("window.scrollTo(0, document.body.scrollHeight)")

# Back to Helium
click("Login")

7.3. Rate Limiting

Avoid triggering rate limits by adding random delays:

python Copy

import random
import time

def human_delay(min_sec=1.0, max_sec=3.0):
    """Random delay to mimic human behavior."""
    time.sleep(random.uniform(min_sec, max_sec))

# Use between actions
click("Next")
human_delay()
write("data", into="Input")

7.4. Error Handling

Always implement proper error handling for CAPTCHA solving:

python Copy

def solve_with_retry(task_payload: dict, max_retries: int = 3) -> dict:
    """Solve CAPTCHA with retry logic."""
    for attempt in range(max_retries):
        try:
            return solve_captcha(task_payload)
        except TimeoutError:
            if attempt < max_retries - 1:
                print(f"Timeout, retrying... ({attempt + 1}/{max_retries})")
                time.sleep(5)
            else:
                raise
        except Exception as e:
            if "balance" in str(e).lower():
                raise  # Don't retry balance errors
            if attempt < max_retries - 1:
                time.sleep(2)
            else:
                raise

7.5. Headless Mode

Use headless mode for background automation:

python Copy

from helium import *
from selenium.webdriver import ChromeOptions

options = ChromeOptions()
options.add_argument('--headless')
options.add_argument('--disable-gpu')

start_chrome("https://target-site.com", options=options)

8. Helium vs Selenium: Quick Comparison

Operation	Selenium	Helium
Click button	`driver.find_element(By.XPATH, "//button[text()='Submit']").click()`	`click("Submit")`
Type text	`driver.find_element(By.NAME, "email").send_keys("test@test.com")`	`write("test@test.com", into="Email")`
Press Enter	`element.send_keys(Keys.ENTER)`	`press(ENTER)`
Check text exists	`"Welcome" in driver.page_source`	`Text("Welcome").exists()`

9. Conclusion

The integration of Helium and CapSolver creates an elegant toolkit for web automation:

Helium provides a clean, readable API for browser automation
CapSolver handles CAPTCHAs with AI-powered solving
Together they enable seamless automation with minimal code

Whether you're building web scrapers, automated testing systems, or data collection pipelines, this combination provides simplicity and power.

Bonus: Use code HELIUM when signing up at CapSolver to receive bonus credits!

10. FAQ

10.1. Why choose Helium over pure Selenium?

Helium makes Selenium easier to use:

Much simpler, human-readable syntax
Automatic waiting for elements
Less verbose code
Still has full Selenium access when needed
Faster development time

10.2. Which CAPTCHA types work best with this integration?

CapSolver supports all major CAPTCHA types. Cloudflare Turnstile and reCAPTCHA v2/v3 have the highest success rates. The integration works seamlessly with any CAPTCHA that CapSolver supports.

10.3. Can I use this in headless mode?

Yes! Helium supports headless mode via ChromeOptions. For reCAPTCHA v3 and token-based CAPTCHAs, headless mode works perfectly. For v2 visible CAPTCHAs, headed mode may provide better results.

10.4. How do I find the site key for a CAPTCHA?

Look in the page source for:

Turnstile: data-sitekey attribute or cf-turnstile elements
reCAPTCHA: data-sitekey attribute on g-recaptcha div

10.5. What if CAPTCHA solving fails?

Common solutions:

Verify your API key and balance
Ensure the site key is correct
Check that the page URL matches where the CAPTCHA appears
For v3, try adjusting the action parameter and minimum score
Implement retry logic with delays

10.6. Can I still use Selenium features with Helium?

Yes! Call get_driver() to access the underlying Selenium WebDriver for any operation Helium doesn't cover directly.

Web ScrapingApr 22, 2026

Rust Web Scraping Architecture for Scalable Data Extraction

Learn scalable Rust web scraping architecture with reqwest, scraper, async scraping, headless browser scraping, proxy rotation, and compliant CAPTCHA handling.

Lucas Mitchell

Web ScrapingApr 17, 2026

How to Scrape Job Listings Without Getting Blocked

Learn the best techniques to scrape job listings without getting blocked. Master Indeed scraping, Google Jobs API, and web scraping API with CapSolver.

How to Integrate Helium with CapSolver for Seamless CAPTCHA Solving

1. Introduction

1.1. Integration Objectives

2. What is Helium?

2.1. Key Features

2.2. Installation

2.3. Basic Usage

3. What is CapSolver?

3.1. Supported CAPTCHA Types

3.2. Getting Started with CapSolver

3.3. API Endpoints

4. Pre-Integration Challenges

5. Integration Methods

5.1. API Integration (Recommended)

5.1.1. Setup Requirements

5.1.2. Core Integration Pattern

5.2. Browser Extension

5.2.1. Installation Steps

6. Code Examples

6.1. Solving reCAPTCHA v2

6.2. Solving Cloudflare Turnstile

6.3. Solving reCAPTCHA v3

7. Best Practices

7.1. Browser Configuration

7.2. Combining Helium with Selenium

7.3. Rate Limiting

7.4. Error Handling

7.5. Headless Mode

8. Helium vs Selenium: Quick Comparison

9. Conclusion

10. FAQ

10.1. Why choose Helium over pure Selenium?

10.2. Which CAPTCHA types work best with this integration?

10.3. Can I use this in headless mode?

10.4. How do I find the site key for a CAPTCHA?

10.5. What if CAPTCHA solving fails?

10.6. Can I still use Selenium features with Helium?

More

Rust Web Scraping Architecture for Scalable Data Extraction

How to Scrape Job Listings Without Getting Blocked

How to Integrate Helium with CapSolver for Seamless CAPTCHA Solving

1. Introduction

1.1. Integration Objectives

2. What is Helium?

2.1. Key Features

2.2. Installation

2.3. Basic Usage

3. What is CapSolver?

3.1. Supported CAPTCHA Types

3.2. Getting Started with CapSolver

3.3. API Endpoints

4. Pre-Integration Challenges

5. Integration Methods

5.1. API Integration (Recommended)

5.1.1. Setup Requirements

5.1.2. Core Integration Pattern

5.2. Browser Extension

5.2.1. Installation Steps

6. Code Examples

6.1. Solving reCAPTCHA v2

6.2. Solving Cloudflare Turnstile

6.3. Solving reCAPTCHA v3

7. Best Practices

7.1. Browser Configuration

7.2. Combining Helium with Selenium

7.3. Rate Limiting

7.4. Error Handling

7.5. Headless Mode

8. Helium vs Selenium: Quick Comparison

9. Conclusion

10. FAQ

10.1. Why choose Helium over pure Selenium?

10.2. Which CAPTCHA types work best with this integration?

10.3. Can I use this in headless mode?

10.4. How do I find the site key for a CAPTCHA?

10.5. What if CAPTCHA solving fails?

10.6. Can I still use Selenium features with Helium?

More

Rust Web Scraping Architecture for Scalable Data Extraction

How to Scrape Job Listings Without Getting Blocked