Skip to content

Commit eeae6a3

Browse files
Update README.md
1 parent 6c99616 commit eeae6a3

File tree

1 file changed

+7
-2
lines changed

1 file changed

+7
-2
lines changed

README.md

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,11 @@ pip install schema-matching
3434
```
3535
from schema_matching import schema_matching
3636
37-
df_pred,df_pred_labels,predicted_pairs = schema_matching("Test Data/authors")
37+
df_pred,df_pred_labels,predicted_pairs = schema_matching("Test Data/QA/Table1.json","Test Data/QA/Table2.json")
38+
print(df_pred)
39+
print(df_pred_labels)
40+
for pair_tuple in predicted_pairs:
41+
print(pair_tuple)
3842
```
3943

4044
#### Return:
@@ -43,7 +47,8 @@ df_pred,df_pred_labels,predicted_pairs = schema_matching("Test Data/authors")
4347
- predicted_pairs: Predict label == 1 column pairs, in tuple format.
4448

4549
#### Parameters:
46-
- pth: Path to test data folder, must contain **"Table1.csv" and "Table2.csv" or "Table1.json" and "Table2.json"**.
50+
- table1_pth: Path to your first **csv, json or jsonl file**.
51+
- table2_pth: Path to your second **csv, json or jsonl file**.
4752
- threshold: Threshold, you can use this parameter to specify threshold value, suggest 0.9 for easy matching(column name very similar). Default value is calculated from training data, which is around 0.15-0.2. This value is used for difficult matching(column name masked or very different).
4853
- strategy: Strategy, there are three options: "one-to-one", "one-to-many" and "many-to-many". "one-to-one" means that one column can only be matched to one column. "one-to-many" means that columns in Table1 can only be matched to one column in Table2. "many-to-many" means that there is no restrictions. Default is "many-to-many".
4954
- model_pth: Path to trained model folder, which must contain at least one pair of ".model" file and ".threshold" file. You don't need to specify this parameter.

0 commit comments

Comments
 (0)