Skip to content

Commit 34823ba

Browse files
authored
Update README.md
1 parent 94a016c commit 34823ba

File tree

1 file changed

+15
-9
lines changed

1 file changed

+15
-9
lines changed

README.md

Lines changed: 15 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -28,11 +28,10 @@
2828
<li>
2929
<a href="#getting-started">Getting Started</a>
3030
<ul>
31-
<li><a href="#composition">File Composiiton</a></li>
32-
<li><a href="#installation">Installation</a></li>
31+
<li><a href="#dataset-structure">File Composiiton</a></li>
32+
<li><a href="#citation">Citation</a></li>
3333
</ul>
3434
</li>
35-
<li><a href="#usage">Usage</a></li>
3635
<li><a href="#contact">Contact</a></li>
3736
<li><a href="#acknowledgments">Acknowledgments</a></li>
3837
</ol>
@@ -62,29 +61,36 @@ Of course, there are limitations to this dataset as code classification by an LL
6261
<!-- GETTING STARTED -->
6362
## Getting Started
6463

65-
### Composition
64+
### Dataset Structure
6665

6766
Here's a breakdown of the files in this dataset:
6867
* 976 total files
6968
* 666 files of original authors
7069
* 108 rewritten files using Bing GPT-4 (61 formatted, 47 non-formatted)
7170
* 202 rewritten files using ChatGPT-3.5 (59 formatted, 143 non-formatted)
7271

73-
### Installation
74-
75-
To download this dataset, simply download it as a zip file and extract it from this GitHub page.
7672

7773
<p align="right">(<a href="#readme-top">back to top</a>)</p>
7874

75+
## Citation
76+
If you use this dataset, please cite:
77+
78+
```bibtex
79+
@misc{P24_Java,
80+
author = {Paek, Timothy},
81+
title = {GPT Java Dataset: A Dataset for LLM-Generated Code Detection},
82+
year = {2024},
83+
howpublished = {GitHub Repository},
84+
url = {https://github.com/tipaek/GPT-Java-Dataset}
85+
}
86+
```
7987

8088

8189
<!-- CONTACT -->
8290
## Contact
8391

8492
Timothy Paek - [Linked-In](https://www.linkedin.com/in/timothy-paek/) - tipaek@syr.edu
8593

86-
Project Link: [https://github.com/tipaek/GPTJavaDataset](https://github.com/tipaek/GPTJavaDataset)
87-
8894
<p align="right">(<a href="#readme-top">back to top</a>)</p>
8995

9096

0 commit comments

Comments
 (0)