Imprint Extractor
Imprint Extractor
Extract structured company data from German Impressum (legal notice) pages, including register identifiers, tax identifiers, website URL, and social profile links.
[!NOTE]
See the API Guide for the shared request lifecycle (sync, async polling, async webhook), error envelope, rate limits, custom_vars, and deduplication. This page documents only the fields and behaviour unique to this endpoint.
Try it
Pricing
Endpoint
Examples
Output Fields
The extractor preserves the original fields (firmenname, adresse, telefon, email, rechtlicher_verantwortlicher, geschaeftsfuehrer, and impressum_url) and also returns:
ust_id — German VAT ID such as DE123456789, or an empty string.
steuernummer — German tax number as listed, or an empty string.
handelsregister_number — register number such as HRB 123456 or HRA 350654, or an empty string.
handelsregister_court — register court, or an empty string.
website — canonical website URL when available.
social — object with linkedin, twitter, facebook, instagram, youtube, and tiktok keys. Missing profiles are null.
Outbound website fetches are protected by shared SSRF-safe URL validation. Redirect hops are validated before following, and unsafe hosts, IP ranges, ports, userinfo, and non-HTTP(S) URLs are rejected.
Error Responses
Validation Error (422)
{
"message": "The domain field is required.",
"errors": {
"domain": ["The domain field is required."]
}
}
Failed Enrichment (422 Unprocessable Entity)
When the enrichment fails, the HTTP status code is 422 and the response body contains "status": "failed" with an error_message:
{
"id": 123,
"type": "imprint_extractor",
"status": "failed",
"input": {
"domain": "invalid-domain"
},
"output": null,
"error_message": "Unable to extract impressum data"
}
Typical Response Times