General Purpose Pipeline Class

Pipeline class can be used to construct data pipelines by adding tasks together and defining task dependencies.

Example usage

from pipeline.pipeline import Pipeline

pipeline = Pipeline()

@pipeline.task()
def first_task(x):
    return x + 10

@pipeline.task(depends_on=first_task)
def second_task(x):
    return x * 2

@pipeline.task(depends_on=second_task)
def third_task(x):
    return x - 5

output = pipeline.run(5)

Results

pipeline.run() returns a dictionary object with the results for each task/function.

Examples of Pipelines

hn_top_keywords_pipeline.py - the pipeline to ingest JSON data, clean it, analyze it and write top keywords analysis results to a CSV file.
csv_to_postgres_pipeline.py - the pipeline to load CSV file into a staging table, do some data transformations and load the data into a final Postgres table.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
example_pipelines		example_pipelines
pipeline		pipeline
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General Purpose Pipeline Class

Example usage

Results

Examples of Pipelines

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Languages

Folders and files

Latest commit

History

Repository files navigation

General Purpose Pipeline Class

Example usage

Results

Examples of Pipelines

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 0

Languages

Packages

Contributors