Imprint Extractor

Extract structured company data from German Impressum (legal notice) pages, including register identifiers, tax identifiers, website URL, and social profile links.

[!NOTE] See the API Guide for the shared request lifecycle (sync, async polling, async webhook), error envelope, rate limits, custom_vars, and deduplication. This page documents only the fields and behaviour unique to this endpoint.

Try it

Pricing

Endpoint

Input Parameters

Examples

Output Fields

The extractor preserves the original fields (firmenname, adresse, telefon, email, rechtlicher_verantwortlicher, geschaeftsfuehrer, and impressum_url) and also returns:

ust_id — German VAT ID such as DE123456789, or an empty string.
steuernummer — German tax number as listed, or an empty string.
handelsregister_number — register number such as HRB 123456 or HRA 350654, or an empty string.
handelsregister_court — register court, or an empty string.
website — canonical website URL when available.
social — object with linkedin, twitter, facebook, instagram, youtube, and tiktok keys. Missing profiles are null.

Outbound website fetches are protected by shared SSRF-safe URL validation. Redirect hops are validated before following, and unsafe hosts, IP ranges, ports, userinfo, and non-HTTP(S) URLs are rejected.

Error Responses

Validation Error (422)

{
    "message": "The domain field is required.",
    "errors": {
        "domain": ["The domain field is required."]
    }
}

Failed Enrichment (422 Unprocessable Entity)

When the enrichment fails, the HTTP status code is 422 and the response body contains "status": "failed" with an error_message:

{
    "id": 123,
    "type": "imprint_extractor",
    "status": "failed",
    "input": {
        "domain": "invalid-domain"
    },
    "output": null,
    "error_message": "Unable to extract impressum data"
}

Typical Response Times

Average: 5-10 seconds