You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CONTRIBUTING.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,7 +24,7 @@ Upon completion of each objective, GPT-4v will give an evaluation and determine
24
24
25
25
`evaluate.py` will print out if each test case `[PASSED]` or `[FAILED]`. In addition, a justification will be given on why the pass/fail was given.
26
26
27
-
It is **strongly**recommended that a screenshot of the `evaluate.py` output is included in any PR which could impact the performance of SOC.
27
+
It is recommended that a screenshot of the `evaluate.py` output is included in any PR which could impact the performance of SOC.
28
28
29
29
## Contribution Ideas
30
30
-**Improve performance by finding optimal screenshot grid**: A primary element of the framework is that it overlays a percentage grid on the screenshot which GPT-4v uses to estimate click locations. If someone is able to find the optimal grid and some evaluation metrics to confirm it is an improvement on the current method then we will merge that PR.
0 commit comments