DeepMind researchers have published their findings on the JEST technique, allowing large language models to be trained at significantly improved speed and efficiency. It remains to be seen whether companies will adopt it to keep power draw low, or crank things to the max.
DeepMind researchers have published their findings on the JEST technique, allowing large language models to be trained at significantly improved speed and efficiency. It remains to be seen whether companies will adopt it to keep power draw low, or crank things to the max.