Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DOC]: Can you provide the code that used Python faker to make the Azure DFP datasets? #1410

Closed
2 tasks done
nyck33 opened this issue Dec 6, 2023 · 2 comments
Closed
2 tasks done
Labels
doc Improvements or additions to documentation external This issue was filed by someone outside of the Morpheus team

Comments

@nyck33
Copy link

nyck33 commented Dec 6, 2023

How would you describe the priority of this documentation request

Low (would be nice)

Describe the future/missing documentation

It says in dfp-feature-selection-demo.ipynb "This notebook includes an introduction to DFP, the suggested steps for DFP feature selection, and the code segments demonstrating how each step can be done in practice.

Note. This notebook uses Azure AD logs as an example but the same process can be applied to any data sources.

Disclaimer. The data was generated using the python faker package. If there is any resemblance to real individuals, it is purely coincidental." and I want to make more data to train on before I deploy to my own Azure cloud with a web app connected to Azure OpenAI services and Snowflake.

Where have you looked?

I did shfit-ctrl-f in VS Code for "faker" and only see it thee and cuda11.8_dev.yml

Code of Conduct

  • I agree to follow this project's Code of Conduct
  • I have searched the open documentation issues and have found no duplicates for this bug report
@nyck33 nyck33 added the doc Improvements or additions to documentation label Dec 6, 2023
@jarmak-nv jarmak-nv added Needs Triage Need team to review and classify external This issue was filed by someone outside of the Morpheus team labels Dec 6, 2023
@jarmak-nv
Copy link
Contributor

Hi @nyck33!

Thanks for submitting this issue - our team has been notified and we'll get back to you as soon as we can!
In the mean time, feel free to add any relevant information to this issue.

@efajardo-nv efajardo-nv removed the Needs Triage Need team to review and classify label Aug 16, 2024
@efajardo-nv
Copy link
Contributor

Sorry @nyck33. The code is not available but should not be too difficult to reproduce using faker.

Here's more info about the dataset:
https://github.com/nv-morpheus/Morpheus/blob/branch-24.10/models/datasets/README.md#dfp-azure-logs

Steps to download it from S3:
https://github.com/nv-morpheus/Morpheus/blob/branch-24.10/examples/digital_fingerprinting/production/README.md#steps-to-run-example-pipeline

@github-project-automation github-project-automation bot moved this from Todo to Done in Morpheus Boards Aug 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
doc Improvements or additions to documentation external This issue was filed by someone outside of the Morpheus team
Projects
Status: Done
Development

No branches or pull requests

3 participants