Synapse Framework

Overview

Synapse is a highly efficient and pluggable, open-source crawling/scraping framework; for both local and distributed workloads.

There're two integration paths, based on level of control:

High-Level API: Built for standard crawling workloads. Extend with built-in plugins and get moving immediately without fiddling with the underlying mechanics. [TODO]
Low-Level API: For architecting custom scrapers/crawlers with (Sub)component-level control. Extend with your own implementations. [WIP]

Status

The distributed architecture is [WIP]; essentially tinkering with distributed state-machine. Currently, in experimental phase. Expect breaking changes as the architecture evolves.

Documentation

Efforts are currently prioritized toward solid core abstractions over polished public documentation. Implementation-specific details are available within each component's directory for developers diving into the internals.

Development

Contributions are welcome!

Start by checking contribution guidelines.
Any questions, ask on discussions

Why this naming?

In neurobiology, a synapse is the junction for signal transmission between neurons. This framework serves as the interface between the web and application-specific logic, decoupling data acquisition from downstream processing.

Ethical Considerations

It's not intended for any malicious or unethical web scraping/crawling activities. Please ensure you comply with the website's robots.txt directives and terms of service (TOS) before crawling/scraping.

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
backend		backend
extract		extract
fetcher/http		fetcher/http
frontier		frontier
internal/testutil		internal/testutil
lifecycle		lifecycle
model		model
pipeline		pipeline
spooler		spooler
.gitignore		.gitignore
.golangci.yml		.golangci.yml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Synapse Framework

Overview

Status

Documentation

Development

Why this naming?

Ethical Considerations

About

Uh oh!

Uh oh!

Languages

License

vyrelabs/synapse

Folders and files

Latest commit

History

Repository files navigation

Synapse Framework

Overview

Status

Documentation

Development

Why this naming?

Ethical Considerations

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages