📄️ Data Component
The Data Component is used to load Hugging Face dataset into the pipeline, serving as the basis for generating synthetic data samples or preparing data for subsequent evaluation models/fine-tuning models.
📄️ LLM Component
Process the data accessed by this component according to the description of the prompt and structure it into the required format for use by subsequent components.
📄️ Filter Component
The Filter Component is designed to filter input data based on the value of a specific field, retaining only the data that meets the defined conditions.
📄️ Dedup Component
The Dedup Component is designed to identify and remove duplicate records in a dataset, ensuring data uniqueness and accuracy.