Skip to content

Commit a9251d8

Browse files
authored
Merge branch 'OthersideAI:main' into installer-script
2 parents 45ba6e6 + 51d9993 commit a9251d8

File tree

4 files changed

+273
-65
lines changed

4 files changed

+273
-65
lines changed

.gitignore

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -165,7 +165,10 @@ cython_debug/
165165
screenshot.png
166166
screenshot_with_grid.png
167167
screenshot_with_labeled_grid.png
168+
screenshot_mini.png
169+
screenshot_mini_with_grid.png
168170
grid_screenshot.png
169171
grid_reflection_screenshot.png
170172
reflection_screenshot.png
171-
summary_screenshot.png
173+
summary_screenshot.png
174+
operate/screenshots/

CONTRIBUTING.md

Lines changed: 25 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
# Contributing
2+
We appreciate your contributions!
3+
4+
## Process
5+
1. Fork it
6+
2. Create your feature branch (`git checkout -b my-new-feature`)
7+
3. Commit your changes (`git commit -am 'Add some feature'`)
8+
4. Push to the branch (`git push origin my-new-feature`)
9+
5. Create new Pull Request
10+
11+
## Contribution Ideas
12+
- **Remove necessity for `pip install .`**: I think by uploading packages to PyPi we can reduce the installation code steps by consolidating `pip install -r requirements.txt` and `pip install .`. If that's possible that'd be great.
13+
- **Improve performance by finding optimal screenshot grid**: A primary element of the framework is that it overlays a percentage grid on the screenshot which GPT-4v uses to estimate click locations. If someone is able to find the optimal grid and some evaluation metrics to confirm it is an improvement on the current method then we will merge that PR.
14+
- **Improve the `SUMMARY_PROMPT`**
15+
- **Create an evaluation system**
16+
- **Improve Linux and Windows compatibility**: There are still some issues with Linux and Windows compatibility. PRs to fix the issues are encouraged.
17+
- **Enabling New Mouse Capabilities**: (drag, hover, etc.)
18+
- **Adding New Multimodal Models**: Integration of new multimodal models is welcomed. If you have a specific model in mind that you believe would be a valuable addition, please feel free to integrate it and submit a PR.
19+
- **Framework Architecture Improvements**: Think you can enhance the framework architecture described in the intro? We welcome suggestions and PRs.
20+
21+
## Guidelines
22+
This will primarily be a [Software 2.0](https://karpathy.medium.com/software-2-0-a64152b37c35) project. For this reason:
23+
24+
- Let's try to hold off refactors into separate files until `main.py` is more than 1000 lines
25+

README.md

Lines changed: 18 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -46,8 +46,15 @@ git clone https://github.com/OthersideAI/self-operating-computer.git
4646
cd self-operating-computer
4747
```
4848

49+
3. **Create a Python virtual environment with Poetry**.
50+
51+
```
52+
cat requirements.txt | xargs poetry add
53+
```
54+
4955
3. **Create a Python virtual environment**. [Learn more about Python virtual environment](https://docs.python.org/3/library/venv.html).
5056

57+
5158
```
5259
python3 -m venv venv
5360
```
@@ -82,13 +89,13 @@ operate
8289
<img src="https://github.com/OthersideAI/self-operating-computer/blob/main/readme/terminal-access-2.png" width="300" style="margin: 10px;"/>
8390
</div>
8491

85-
### Contributions are Welcomed! Some Ideas:
86-
- **Prompt Improvements**: Noticed any areas for prompt improvements? Feel free to make suggestions or submit a pull request (PR).
87-
- **Enabling New Mouse Capabilities** (drag, hover, etc.)
88-
- **Adding New Multimodal Models**: Integration of new multimodal models is welcomed. If you have a specific model in mind that you believe would be a valuable addition, please feel free to integrate it and submit a PR.
89-
- **Framework Architecture Improvements**: Think you can enhance the framework architecture described in the intro? We welcome suggestions and PRs.
92+
### Contributions are Welcomed!:
9093

91-
For any input on improving this project, feel free to reach out to [Josh](https://twitter.com/josh_bickett) on Twitter.
94+
If you want to contribute yourself, see [CONTRIBUTING.md](https://github.com/OthersideAI/self-operating-computer/blob/main/CONTRIBUTING.md).
95+
96+
### Feedback
97+
98+
For any input on improving this project, feel free to reach out to [Josh](https://twitter.com/josh_bickett) on Twitter.
9299

93100
### Follow HyperWriteAI for More Updates
94101

@@ -97,4 +104,8 @@ Stay updated with the latest developments:
97104
- Follow HyperWriteAI on [LinkedIn](https://www.linkedin.com/company/othersideai/).
98105

99106
### Compatibility
100-
- This project is compatible with Mac OS, Windows, and Linux (with X server installed).
107+
- This project is compatible with Mac OS, Windows, and Linux (with X server installed).
108+
109+
### Star History
110+
111+
[![Star History Chart](https://api.star-history.com/svg?repos=OthersideAI/self-operating-computer&type=Timeline)](https://star-history.com/#OthersideAI/self-operating-computer&Timeline)

0 commit comments

Comments
 (0)