Jump to content

Draft:Gretel AI

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Mckornfield (talk | contribs) at 19:31, 9 December 2024 (Submitting using AfC-submit-wizard). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Gretel
FoundedJan 2020; 4 years ago (Jan 2020)
HeadquartersSan Diego, California, US
Area servedGlobal
Founder(s)
  • Ali Golshan
  • Alexander Watson
  • John Myers
CEOAli Golshan[1]
IndustrySoftware
Employees50-100[2]
URLgretel.ai
Developer(s)Gretel Labs
Initial releaseMarch 31, 2020; 4 years ago (2020-03-31)
Written inPython
PlatformAmazon Web Services, Microsoft Azure, Google Cloud Platform
LicenseSDK - Apache 2.0, Synthetics - Source-available software
Websitehttps://gretel.ai

Gretel (also known as Gretel Labs or Gretel AI) is a software startup focused around the idea of high quality and private Synthetic data generation. Its primary focus is on generating textual, JSON or tabular data. It accomplishes this using a mix of privacy preservation tools (transformations, differential privacy) in concert with data generation tools (Large language models, Generative adversarial networks and Fine-tuning (deep learning)).

Gretel's quality enforcement is accomplished by performing quality checks during data generation, thereby reducing the amount of low quality data in the final dataset. This also applies with privacy checks that can occur during data generation.


Gretel's Open Source Datasets

Gretel has released a set of open source datasets (licensed under Apache 2.0 on Hugging Face.[3]

These datasets reflect what can be created using Gretel itself, as well as to allow for use in training models, creating tools, or building other sorts of tools.

Gretel in Research

Gretel's synthetics offering and platform have been referenced in a few research/comparison articles. Examples include:

  • Comparison of Synthetic Data Generation Tools Using Internet of Things Data[4]
  • Gretel.ai: Open-Source Artificial Intelligence Tool To Generate New Synthetic Data_[5]
  • Experiments in Reducing NLP Bias and Identifiability for Large LMs[6]

References

  1. ^ "Ali Golshan". Open Data Science Conference. 9 December 2024. Retrieved 2024-12-09.{{cite news}}: CS1 maint: url-status (link)
  2. ^ "About Us (Gretel)". Gretel AI. Archived from the original on 25 November 2024. Retrieved 9 December 2024.
  3. ^ "gretelai (Gretel.ai)". Hugging Face. Archived from the original on 26 November 2024. Retrieved 9 December 2024.
  4. ^ M, Gayathri Hegde and Shenoy, P Deepa and R, Venugopal K (2022). "Performance Analysis of Real and Synthetic Data using Supervised ML Algorithms for Prediction of Chronic Kidney Disease}". 2022 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT): 1–6. doi:10.1109/CONECCT55679.2022.9865722.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  5. ^ Noruzman A, Ghani N, Zulkifli N (2021). "Gretel.ai: Open-Source Artificial Intelligence Tool To Generate New Synthetic Data". MALAYSIAN JOURNAL OF INNOVATION IN ENGINEERING AND APPLIED SOCIAL SCIENCES. 1 (1). Retrieved 9 December 2024.
  6. ^ Herrera J, Bernal D. "Experiments in Reducing NLP Bias and Identifiability for Large LMs". TheEyeCorpus.