Deploy the Connector

Run the Pauhu® data connector on your own infrastructure. Docker, Kubernetes, or Azure Arc.

Prerequisites

1 Subscribe

Get your API key from staging.pauhu.eu/pricing or Azure Marketplace. Your key will be emailed after checkout.

2 Runtime

Docker 20.10+, Podman, or any OCI-compatible container runtime.

3 Resources

4 GB RAM minimum (8 GB recommended). 50 GB storage for the full annotated dataset.

Quick Start

Pull and run

# Pull the connector image
docker pull pauhu/data:latest

# Run with your API key
docker run -d \
  --name pauhu-connector \
  -p 8003:8003 \
  -e PAUHU_API_KEY="pk_..." \
  -e PAUHU_SOURCES="eurlex,ted,iate" \
  -e PAUHU_LANGUAGES="en,fi,sv,de,fr" \
  -v pauhu-data:/data \
  pauhu/data:latest

Verify

# Health check
curl http://localhost:8003/health

# Query a source
curl http://localhost:8003/v1/eurlex/latest?limit=5

Configuration

VariableRequiredDescription
PAUHU_API_KEYYesYour API key from checkout (pk_...)
PAUHU_SOURCESNoComma-separated source list. Default: all 20 sources.
PAUHU_LANGUAGESNoComma-separated language codes. Default: all 24 EU languages.
PAUHU_SYNC_INTERVALNoSync frequency in minutes. Default: 15
PAUHU_PORTNoHTTP port. Default: 8003
PAUHU_DATA_DIRNoData directory. Default: /data

Available Sources

Use any combination of these source identifiers in PAUHU_SOURCES:

eurlex        EUR-Lex regulations, directives, decisions
ted           Tenders Electronic Daily (public procurement)
iate          IATE terminology (2.4M multilingual terms)
eurostat      Eurostat statistical datasets
curia         Court of Justice case law
ecb           European Central Bank decisions
commission    European Commission documents
consilium     Council of the EU documents
europarl      European Parliament resolutions
publications  EU Publications Office catalog
dataeuropa    EU open data portal
echa          European Chemicals Agency
ema           European Medicines Agency
epo           European Patent Office
oeil          Legislative Observatory
cordis        EU research projects
whoiswho      EU institutional directory
dpp           Digital Product Passport classifications
lex           National law databases (28 countries)
wiki          Wikidata EU entity graph

Eclipse Dataspace Connector

For connector-to-connector data exchange via the European Language Data Space, the Pauhu connector supports the Eclipse Dataspace Protocol.

# Enable LDS protocol
docker run -d \
  --name pauhu-connector \
  -p 8003:8003 \
  -p 8282:8282 \
  -e PAUHU_API_KEY="pk_..." \
  -e PAUHU_LDS_ENABLED="true" \
  -e PAUHU_LDS_PARTICIPANT_ID="your-lds-participant-id" \
  -v pauhu-data:/data \
  pauhu/data:latest

Port 8282 serves the Eclipse Dataspace Protocol endpoints. Register your connector in the LDS Participant Registry to enable contract negotiation with other participants.

Kubernetes

apiVersion: apps/v1
kind: Deployment
metadata:
  name: pauhu-connector
spec:
  replicas: 1
  selector:
    matchLabels:
      app: pauhu-connector
  template:
    metadata:
      labels:
        app: pauhu-connector
    spec:
      containers:
      - name: connector
        image: pauhu/data:latest
        ports:
        - containerPort: 8003
        env:
        - name: PAUHU_API_KEY
          valueFrom:
            secretKeyRef:
              name: pauhu-secrets
              key: api-key
        - name: PAUHU_SOURCES
          value: "eurlex,ted,iate"
        volumeMounts:
        - name: data
          mountPath: /data
      volumes:
      - name: data
        persistentVolumeClaim:
          claimName: pauhu-data-pvc

Support

Questions about deployment? Contact support@pauhu.eu or check the API documentation.

Get your API key