7501 shaares
219 private links
219 private links
Now that all the pieces are in place, here is the economic nexus of semi/genAI that particularly interests me:
If model providers make inference much more efficient, then they will not use enough computing power to consume all that is brought to market by the semiconductor industry. If this happens, it will trigger a downward cycle in this industry, significantly slowing down the production of new hardware and possibly having significant global economic and financial repercussions.
If model providers do not make their inference processes more efficient, they will not be able to structurally reduce their marginal costs and, failing to achieve the desired profitability, will resort to the usual means (advertising, tiered subscriptions), which will slow down adoption.
If adoption slows down, model providers will struggle to achieve profitability (with the exception of those with captive markets), their demand for computing power will weaken, and the semiconductor industry will produce excess capacity and enter a downward cycle, taking part of the AI industry with it.
So, the central issue linking today’s semiconductor industry and genAI model providers is how to define how much efficiency gains are enough. Jokingly, we could call this ‘inference inefficiency optimum’.