Installing Data Quality & Observability Classic on self-hosted Spark Standalone

Data Quality & Observability Classic can be installed and operated on a single standalone host, which is useful when large scale and high concurrency checks are not required. In this mode, Data Quality & Observability Classic leverages a Spark Standalone pseudo cluster in which the master and workers run and use resources from the same server.

The DQ Standalone application consists of the following components:

  • DQ Web
  • DQ Agent
  • DQ Metastore (PostgreSQL database)
  • Spark (Pullup only)

Fig 1: Architecture overview of Full Standalone Installation mode

Data Quality & Observability Classic provides the option to include the PostgreSQL metastore in the installation or use an external PostgreSQL metastore (recommended).

What's next

Before you install on self-hosted Spark Standalone