Analyzing Bluesky data
note
To see video of this example
In this example, you'll build a pipeline with Dagster that:
- Ingestion of data-related Bluesky posts
- Modeling data using dbt
- Representing data in a dashboard with PowerBI
Prerequisites
To follow the steps in this guide, you'll need:
- Basic Python knowledge
- Python 3.9+ installed on your system. Refer to the Installation guide for information.
- Understanding of data pipelines and the extract, transform, and load process (ETL).
- Familiar with dbt and data transformation.
- Usage of BI tools for dashboards.
Step 1: Set up your Dagster environment
First, set up a new Dagster project.
- 
Clone the Dagster repo and navigate to the project: cd examples/docs_project/project_atproto_dashboard
- 
Install the required dependencies with uv:uv sync
- 
Activate the virtual environment: - MacOS
- Windows
 source .venv/bin/activate.venv\Scripts\activate
- 
Ensure the following environments have been populated in your .env file. Start by copying the template: cp .env.example .envAnd then populate the fields. 
Step 2: Launch the Dagster webserver
To make sure Dagster and its dependencies were installed correctly, navigate to the project root directory and start the Dagster webserver:
followed by a bash code snippet for
dg dev
Next steps
- Continue this example with ingestion