Researchers have found that the carbon footprint of generative AI-based tools that can turn text prompts into images and videos is far worse than we previously thought.
As detailed in a new paper, researchers from the open-source AI platform Hugging Face found that the energy demands of text-to-video generators quadruple when the length of a generated video doubles — indicating that the power required for increasingly sophisticated generations doesn’t scale linearly.
For instance, a six-second AI video clip consumes four times as much energy as a three-second clip.
“These findings highlight both the structural inefficiency of current video diffusion pipelines and the urgent need for efficiency-oriented design,” the researchers concluded in their paper.
Experts are warning that we’re ro