Added normalisation and unit test cases #118

sritha272 · 2024-11-24T11:37:05Z

Description

Implemented multiple data transformation functions (normalize, scale, clipping, exponential, standardize, z_score_normalize) to enhance the framework's data processing capabilities. These functions are equipped with comprehensive unit tests to ensure correctness and handle edge cases.
Breakdown of Each Implemented Function

Normalize
Scales data to a specified range [min_value, max_value].
Includes edge case handling for zero range and empty data.
Scale
Scales data by a specified multiplier.
Useful for linear scaling transformations.
Clipping
Clips data to fall within a specified range [min_value, max_value].
Prevents extreme outliers in datasets.
Exponential Transformation
Applies an exponential transformation to data with a specified base.
Handles exponential growth scenarios effectively.
Standardize
Standardizes data to have a mean of 0 and a standard deviation of 1.
Includes custom mean and standard deviation parameters.
Z-Score Normalize
Computes z-scores for data points for standardization.
Handles mixed positive and negative datasets effectively.

Related Issue

Motivation and Context

The newly added functions provide robust data normalization and transformation capabilities, which are critical for preparing data for machine learning, statistical analysis, and other computational tasks. These functions solve the problem of inconsistent data scaling and ensure uniform preprocessing pipelines.

How Has This Been Tested?

Unit Tests: Added unit tests for each function:
Verified outputs for standard, edge, and invalid inputs.
Tests include large numbers, small numbers, mixed data types, and edge cases like empty datasets or identical values.
Environment: Testing performed on:
Python 3.12
OS: Windows 11
Commands:
Ran python -m unittest discover tests to ensure all test cases passed successfully.
Validated compatibility with existing project components.

Screenshots (if appropriate):

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My code follows the code style of this project.
My change requires a change to the documentation.
I have updated the documentation accordingly.
I have read the CONTRIBUTING document.
I have added tests to cover my changes.
All new and existing tests passed.

mikita-sakalouski · 2024-11-24T11:40:13Z

Can you make these functionality based on Transformation step and also put it to the correct module ?

mikita-sakalouski

See comments

dannymeijer · 2024-11-24T12:27:54Z

Thank you for your contribution @sritha272 , really appreciate it.

I do agree with Mikita about the placement of the modules that you chose for this one. Also, can you explain the intended use for this a bit more? Would this be for ML usescases with the input being something like pandas perhaps? Or did you have something else in mind.

I propose that we have a small meetup to discuss, as I would love to add your contribution to our library.

dannymeijer

See my earlier comment

dannymeijer · 2024-11-26T08:48:46Z

Please also see: #129

sritha272 added 2 commits November 24, 2024 03:02

Added normalisations and test cases

851e816

Added normalisations and unit test cases

8b5fbd6

sritha272 requested a review from a team as a code owner November 24, 2024 11:37

mikita-sakalouski changed the base branch from main to release/0.9 November 24, 2024 11:38

mikita-sakalouski added the enhancement New feature or request label Nov 24, 2024

mikita-sakalouski added this to the 0.9.0 milestone Nov 24, 2024

mikita-sakalouski requested changes Nov 24, 2024

View reviewed changes

mikita-sakalouski requested a review from dannymeijer November 24, 2024 11:40

dannymeijer requested changes Nov 24, 2024

View reviewed changes

dannymeijer added the blocked label Nov 25, 2024

dannymeijer removed this from the 0.9.0 milestone Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added normalisation and unit test cases #118

Added normalisation and unit test cases #118

sritha272 commented Nov 24, 2024

mikita-sakalouski commented Nov 24, 2024

mikita-sakalouski left a comment

dannymeijer commented Nov 24, 2024

dannymeijer left a comment

dannymeijer commented Nov 26, 2024

Added normalisation and unit test cases #118

Are you sure you want to change the base?

Added normalisation and unit test cases #118

Conversation

sritha272 commented Nov 24, 2024

Description

Related Issue

Motivation and Context

How Has This Been Tested?

Screenshots (if appropriate):

Types of changes

Checklist:

mikita-sakalouski commented Nov 24, 2024

mikita-sakalouski left a comment

Choose a reason for hiding this comment

dannymeijer commented Nov 24, 2024

dannymeijer left a comment

Choose a reason for hiding this comment

dannymeijer commented Nov 26, 2024