AI Model Development Phases
Following #82 adding ai:ModelTraining, this issue collects additional concepts for AI development phases such as data collection, data preparation, data labelling, and model fine-tuning.
@DelaramGlp I have the below concepts from generic uses of these terms. I am thinking we can have ai:ModelLifecycle for models which is distinct from ai:LifecycleStage for an AI system.
| Data Collection | Collection of data for training or operating AI models |
|---|---|
| Data Annotation | Annotation of data for training or operating AI models |
| Data Preparation | Preparing data for training or operating AI models |
| Model Development | Processing which contributes to development of AI models |
| Model Training | Processing of data to train an AI model |
| Model Fine-Tuning | Processing of data to fine-tune an AI model |
I think what we have called "lifecycle stages" should instead be called "development phases" to keep it distinct from the ISO and legally defined lifecycle stages. It should also be aligned with the rest of DPV concepts, similar to how we have ai:Training as a type of dpv:Processing.
To support this, we should have ai:ModelDevelopmentPhase as a subclass of dpv:Process, and indicate specific processing activities involved in each stage:
ai:DataCollectionPhase--> involvesdpv:Collectprocessing operationai:DataAnnotationPhase--> involvesdpv:Annotate(new, parent:dpv:Generate) processing operationai:ModelDevelopmentPhase--> involvesai:Techniqueto produce aai:Modelthat will be trained on data in the next phaseai:ModelTrainingPhase--> involvesai:TrainingDataandai:Modelas input and producesai:TrainedModel- Testing and validation follow similarly from respective data categories and input/output models.
In addition to the above, ISO/IEC 5338 AI system life cycle processes also has specific "phases" (though mentioned as stages) in between the lifecycle stages that should be added to the above.