A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automat ic Speech Recognition an d Text-to-Speech)