All Tags
AWS
ai
algorithm-design
architecture
browser
cloud
cloud-efficiency
cloud-principles
cost-reduction
data-centric
data-compression
data-processing
deployment
design
documentation
edge-computing
email-sharing
energy-efficiency
energy-footprint
enterprise-optimization
green-ai
hardware
libraries
llm
locality
machine-learning
maintainability
management
measured
microservices
migration
mobile
model-optimization
model-training
multi-objective
network-traffic
parameter-tuning
performance
queries
rebuilding
scaling
services
storage-optimization
strategies
tabs
template
testing
workloads
Tactic: RAG Pipeline Parallelism
Tactic sort:
Awesome Tactic
Type: Architectural Tactic
Category: green-ml-enabled-systems
Title
RAG Pipeline Parallelism
Description
Pipeline Parallelism executes different stages of retrieval, encoding, and generation concurrently, rather than sequentially. This reduces idle computation time, leading to higher energy efficiency. PipeRAG has been proposed.
Participant
AI and RAG Practitioners.
Related software artifact
RAG-Based Systems.
Context
RAG. Unsustainable RAG. Green AI.
Software feature
PipeRAG.
Tactic intent
Environmentally Sustainable RAG and through energy efficiency and reduction of computational waste.
Target quality attribute
Energy Efficiency.
Other related quality attributes
< unknown >
Measured impact
< unknown >
