Jump to content

Draft:Gretel AI

From Wikipedia, the free encyclopedia
  • Comment: The sources cited in this article are either connected to the subject (Gretel's own website), trivial passing mentions, or about other topics entirely. We need significant coverage about the company in secondary, reliable sources to have an article. If such sources can be found the article can be re-created. WeirdNAnnoyed (talk) 13:54, 15 December 2024 (UTC)

Gretel
FoundedJan 2020; 4 years ago (Jan 2020)
HeadquartersSan Diego, California, US
Area servedGlobal
Founder(s)
  • Ali Golshan
  • Alexander Watson
  • John Myers
CEOAli Golshan[1]
IndustrySoftware
Employees50-100[2]
URLgretel.ai
Developer(s)Gretel Labs
Initial releaseMarch 31, 2020; 4 years ago (2020-03-31)
Written inPython
PlatformAmazon Web Services, Microsoft Azure, Google Cloud Platform
LicenseSDK - Apache 2.0, Synthetics - Source-available software
Websitehttps://gretel.ai

Gretel (also known as Gretel Labs or Gretel AI) is a software startup focused around creating high quality and private Synthetic data. Its primary focus is on generating textual, JSON or tabular data. It accomplishes this using a mix of privacy preservation tools (transformations, differential privacy) in concert with data generation tools (Large language models, and custom Fine-tuning (deep learning)).

Gretel's quality enforcement is accomplished by performing quality checks during data generation, thereby reducing the amount of low quality data in the final dataset.

This type of enforcement can also apply to privacy concerns, by using privacy filters or introducing appropriate levels of noise during data generation.

Gretel's Open Source Datasets

[edit]

Gretel has released a set of open source datasets (licensed under Apache 2.0) on Hugging Face.[3]

These datasets reflect what can be created using Gretel itself, as well as to allow for use in training models, creating tools, or building other sorts of tools.

Gretel in Research

[edit]

Gretel's synthetics offering and platform have been referenced in a few research/comparison articles. Examples include:

  • Comparison of Synthetic Data Generation Tools Using Internet of Things Data[4]
  • Gretel.ai: Open-Source Artificial Intelligence Tool To Generate New Synthetic Data[5]
  • Experiments in Reducing NLP Bias and Identifiability for Large LMs[6]

References

[edit]
  1. ^ "Ali Golshan". Open Data Science Conference. 9 December 2024. Retrieved 2024-12-09.{{cite news}}: CS1 maint: url-status (link)
  2. ^ "About Us (Gretel)". Gretel AI. Archived from the original on 25 November 2024. Retrieved 9 December 2024.
  3. ^ "gretelai (Gretel.ai)". Hugging Face. Archived from the original on 26 November 2024. Retrieved 9 December 2024.
  4. ^ M, Gayathri Hegde and Shenoy, P Deepa and R, Venugopal K (2022). "Performance Analysis of Real and Synthetic Data using Supervised ML Algorithms for Prediction of Chronic Kidney Disease}". 2022 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT): 1–6. doi:10.1109/CONECCT55679.2022.9865722.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  5. ^ Noruzman A, Ghani N, Zulkifli N (2021). "Gretel.ai: Open-Source Artificial Intelligence Tool To Generate New Synthetic Data". MALAYSIAN JOURNAL OF INNOVATION IN ENGINEERING AND APPLIED SOCIAL SCIENCES. 1 (1). Retrieved 9 December 2024.
  6. ^ Herrera J, Bernal D. "Experiments in Reducing NLP Bias and Identifiability for Large LMs". TheEyeCorpus.