You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Ioannis and Joshi had a discussion with BEIS regarding the dataset based on the National Energy Efficiency Data-Framework (NEED).
A sample of the aforementioned dataset was provided by the NEED team and the Synthetic data team decided to explore the dataset and investigate whether it is suitable as a test dataset for the Synthetic data generation platform developed at the Data Science Campus.
The text was updated successfully, but these errors were encountered:
The dataset contains both categorical and numerical variables. However, the vast majority of variables are numerical and the categorical variables can be potentially encoded with methods such as 'label encoding' or one 'hot encoding'.
Additionally, the dataset is of good size with about 50K samples and 46 variables.
Therefore, the synthetic data team decided to process the dataset with the Synthetic data generation software developed at the Data Science Campus.
Ioannis and Joshi had a discussion with BEIS regarding the dataset based on the National Energy Efficiency Data-Framework (NEED).
A sample of the aforementioned dataset was provided by the NEED team and the Synthetic data team decided to explore the dataset and investigate whether it is suitable as a test dataset for the Synthetic data generation platform developed at the Data Science Campus.
The text was updated successfully, but these errors were encountered: