Wiznote Data Extractor

English | 中文

A tool for extracting and saving notes from Wiznote (note.wiz.cn) as Markdown files with proper formatting.

Features

default.mp4

Extracts note title, content, tags, and last modified date
Converts HTML content to Markdown format
Downloads and saves embedded images locally
Properly formats code blocks
Preserves metadata in YAML frontmatter

Requirements

Python 3.6+
Chrome browser
ChromeDriver
Required Python packages (see Installation)

Installation

Clone this repository:

git clone https://github.com/yourusername/WiznoteExtractor.git
cd WiznoteExtractor

Install required Python packages:

pip install selenium beautifulsoup4 html2text requests

Install ChromeDriver:
- Download the ChromeDriver version that matches your Chrome browser from ChromeDriver official site
- For Windows:
  - Extract the downloaded zip file
  - Add the ChromeDriver location to your PATH environment variable, or
  - Place chromedriver.exe in a directory that's already in your PATH
- For macOS:
```
brew install chromedriver
```
- For Linux:
```
sudo apt install chromium-chromedriver
```
  or download the Linux version from the official site and add it to your PATH
Configure the user data directory:
- Open WiznoteDataExtractor.py and modify the user_data_dir path in the init_driver() function to a location on your system

Usage

Run the script:
```
python WiznoteDataExtractor.py
```
A Chrome browser window will open. You need to manually log in to your Wiznote account when prompted.
After logging in, navigate to the note you want to collect.
Press Enter in the terminal to collect the current page.
The script will extract the note content, convert it to Markdown, download any embedded images, and save everything to a Markdown file in the same directory as the script.
Continue navigating to other notes and press Enter to collect them, or type 'exit' to quit the program.

Notes

The script creates a user profile for Chrome to maintain login sessions.
Images are saved in an index_files folder adjacent to the Markdown files.
If image download fails using the browser method, it will fall back to using the requests library.

License

[Add your license information here]

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wiznote Data Extractor

Features

Requirements

Installation

Usage

Notes

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

C4jiang/WiznoteDataExtractor

Folders and files

Latest commit

History

Repository files navigation

Wiznote Data Extractor

Features

Requirements

Installation

Usage

Notes

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages