All Tags
AWS
algorithm-design
architecture
cloud-principles
cost-reduction
data-centric
data-compression
data-processing
deployment
design
edge-computing
energy-footprint
hardware
libraries
locality
machine-learning
management
measured
migration
model-optimization
model-training
performance
queries
rebuilding
scaling
services
strategies
template
workloads
Tactic: Remove Redundant Data
Tactic sort:
Awesome Tactic
Type: Architectural Tactic
Category: green-ml-enabled-systems
Title
Remove Redundant Data
Description
Identifying and removing redundant data for ML models reduces computing time, number of computations, energy consumption, and memory space. Redundant data refers to data points that do not contribute significantly to improving the accuracy of the model. Thus, removing these unimportant datapoints does not sacrifice much accuracy (Dhabe et al. 2021)
Participant
Data Scientist
Related software artifact
Data
Context
Machine Learning
Software feature
< unknown >
Tactic intent
Enhance energy efficiency by detecting and removing redundant data to reduce the size of input data
Target quality attribute
Energy Efficiency
Other related quality attributes
Accuracy, Data Representativeness
Measured impact
Removing redundant data from the dataset leads to a smaller input data that further decreases computation, computational time, energy consumption, and memory space