Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Implement Source2Synth for data generation #1218

Open
1 of 2 tasks
Wendong-Fan opened this issue Nov 25, 2024 · 4 comments · May be fixed by #1289
Open
1 of 2 tasks

[Feature Request] Implement Source2Synth for data generation #1218

Wendong-Fan opened this issue Nov 25, 2024 · 4 comments · May be fixed by #1289
Assignees
Labels
Data Related to camel data processing research Task related to research
Milestone

Comments

@Wendong-Fan
Copy link
Member

Required prerequisites

Motivation

Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

https://arxiv.org/html/2409.08239v1

Solution

No response

Alternatives

No response

Additional context

No response

@Wendong-Fan Wendong-Fan added Data Related to camel data processing research Task related to research labels Nov 25, 2024
@Wendong-Fan Wendong-Fan added this to the Sprint 17.5 milestone Nov 25, 2024
@zjrwtx
Copy link
Collaborator

zjrwtx commented Nov 27, 2024

Do the preliminary experiment, and opensource repository: https://github.com/zjrwtx/Source2Synth_refine

the experiment is based on the open source library improvements and increase more functions: https://github.com/sanowl/Source2Synth

still doing the experiment,but some test result like this:

image

@zjrwtx
Copy link
Collaborator

zjrwtx commented Nov 28, 2024

@zjrwtx
Copy link
Collaborator

zjrwtx commented Nov 28, 2024

image

@CaelumF
Copy link
Collaborator

CaelumF commented Nov 28, 2024

I'll work on moving https://github.com/zjrwtx/Source2Synth_refactor into CAMEL.

  • Update dependencies (python 3.8, hopefully there isn't any dependency hell)
  • Define control interfaces
  • Define output interfaces
  • Use passed in camel agents or from passed in config (I appreciate the inference is already done using agents!)
  • Code standards (english, formatting etc.)

@Wendong-Fan Wendong-Fan linked a pull request Dec 9, 2024 that will close this issue
9 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Data Related to camel data processing research Task related to research
Projects
Status: No status
Development

Successfully merging a pull request may close this issue.

3 participants