1 minute read

Good evening everyone. A bit tired today. A lot of meeting. Even back to back meeting since yesterday. Also some stuff happening before I leave from work. Another request that apparently that i missed because my mistake.

Back 2 back meeting

Apparently I have another meeting today. But now its for my team. So its quite chill. The target is quite difficult tho. Some of the targets is quite a bit difficult.

That’s all for today, folks! Thanks for reading. I’ll catch you in the next post. Stay tuned for more updates, and don’t forget to follow me for more wholesome content!

DATA CLEANING TECHNIQUES

Data Filtering

Filtering involves removing irrelevant or unnecessary data from a dataset to reduce noise and focus on the most relevant information.

Data Deduplication

Data deduplication involves eliminating duplicate records from a dataset, ensuring that each record is unique.

Data Imputation

Data imputation entails replacing missing or null values with estimated values to maintain data integrity.

Data Standardization

Standardizing data involves putting all data into a common format to facilitate comparison and analysis.

Data Transformation

Data transformation involves modifying existing data to make it more suitable for analysis or modeling.

Outlier Detection

Outlier detection is the process of identifying and managing values that significantly deviate from the rest of the data, often by treating or removing them.

Data Validation

Data validation aims to check if data adheres to defined rules and constraints, identifying and correcting inconsistencies.

Data Encoding

Data encoding involves converting categorical data into a numerical format to make it compatible with machine learning algorithms.

Data Aggregation

Data aggregation entails grouping data by category, time period, or another criterion to obtain summarized statistics.

Data Sampling

Data sampling is the process of selecting a representative subset of data to expedite analysis while preserving data integrity.

Data Cleansing

Data cleansing is process that encompasses the application of multiple techniques to ensure data accuracy, completeness, and compliance with standards.

Data Profiling

Data profiling involves in-depth analysis of data to understand its structure, characteristics, and quality.

“Success doesn’t come from what you do occasionally, it comes from what you do consistently.”

– Marie Forleo

Categories:

Updated:

Leave a comment