Volve Database Logo

VOLVE DATABASE

Equinor Volve Field Dataset · Parquet Format

Download Parquet Files

High-performance columnar storage · Ready for analysis · Compressed & optimized

About This Dataset

This repository provides the Equinor Volve field data in Parquet format, converted from the original dataset. The Volve field was Norway's first fully disclosed oil field dataset, offering real-world data for data engineering and analysis workflows.

The data has been structured into three normalized tables: daily metrics, monthly aggregations, and well metadata. Perfect for learning SQL, data engineering, or building analytical pipelines.

Quick Start with DuckDB

Query the Parquet files directly without loading into a database. Here's a simple example using DuckDB:

import duckdb

# Connect to DuckDB (in-memory)
con = duckdb.connect()

# Query daily production data
result = con.execute("""
    SELECT
        w.wellbore_name,
        SUM(d.oil_volume) as total_oil,
        SUM(d.gas_volume) as total_gas,
        SUM(d.water_volume) as total_water
    FROM 'daily_production.parquet' d
    JOIN 'wells.parquet' w
        ON d.npd_wellbore_code = w.npd_wellbore_code
    WHERE d.date BETWEEN '2008-01-01' AND '2008-12-31'
    GROUP BY w.wellbore_name
    ORDER BY total_oil DESC
    LIMIT 10
""").fetchall()

for row in result:
    print(row)

No database setup required. DuckDB reads Parquet files directly and efficiently, making it perfect for exploratory analysis and prototyping.

Database Schema

The dataset consists of three interconnected tables tracking well production metrics at different time granularities.

wells

Well metadata and facility information

npd_wellbore_code
wellbore_code
wellbore_name
npd_field_code
npd_field_name
npd_facility_code
npd_facility_name

daily_production

Daily well production metrics and parameters

date
npd_wellbore_code
on_stream_hours
avg_downhole_pressure
avg_dp_tubing
avg_annulus_pressure
avg_wellhead_pressure
avg_downhole_temperature
avg_wellhead_temperature
avg_choke_size_percent
avg_choke_unit
dp_choke_size
oil_volume
gas_volume
water_volume
water_injection_volume
flow_kind
well_type
→ npd_wellbore_code references wells.npd_wellbore_code

monthly_production

Aggregated monthly production volumes

date
npd_wellbore_code
on_stream_hours
oil_volume_sm3
gas_volume_sm3
water_volume_sm3
gas_injection_sm3
water_injection_sm3
→ npd_wellbore_code references wells.npd_wellbore_code