Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to convert spreadsheets to your JSON format? #3

Open
eloukas opened this issue Nov 5, 2021 · 2 comments
Open

How to convert spreadsheets to your JSON format? #3

eloukas opened this issue Nov 5, 2021 · 2 comments

Comments

@eloukas
Copy link

eloukas commented Nov 5, 2021

Hi and thanks for uploading your code repo.

How can someone preprocess their spreadsheet and generate a JSON for it according to your format?
https://github.com/microsoft/TUTA_table_understanding/blob/main/data/pretrain/spreadsheet/spreadsheet-sample.json

@eloukas eloukas changed the title How to convert spreadsheets to your JSON format How to convert spreadsheets to your JSON format? Nov 5, 2021
@HaoAreYuDong
Copy link
Contributor

For spreadsheet tables, table detection is needed. You can train a table detection model using https://github.com/microsoft/TableSense. You can also watch this repo and we will publish the code in the near future.

@izavits
Copy link

izavits commented Nov 30, 2021

Hi, and thanks for sharing this work!
Is this spreadsheets data available somewhere? If not, could you upload a few files (original) and processed (JSON)? This would help understanding how to produce the JSON. (I have also checked the spreadsheet-sampe.json, but some additional samples would help).
Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants