Abstract enterprise AI cost dashboard showing teams, workflows, and token spend

AI Cost Attribution Is Becoming a First-Class AI Architecture Decision

The real lesson in Bedrock’s new cost attribution feature is not the AWS feature itself. It is that AI cost visibility now belongs in the architecture conversation, not as an afterthought for finance.

April 23, 2026 · 6 min · Mohit Joshi
Abstract enterprise AI workflow moving from pilot to production

Why AI Pilots Stall, and What the Teams That Deliver Do Differently

Eleven practical patterns that improve the odds of turning AI experimentation into real business value.

April 22, 2026 · 6 min · Mohit Joshi
Robot hand typing

The Evolution of Natural Language Processing: A Comprehensive Overview of Retrieval Augmented Generation (RAG)

Natural Language Processing (NLP) has witnessed remarkable advancements in recent years, fueled by the convergence of innovative algorithms, abundant data, and computational resources. Among the latest breakthroughs in NLP is the Retrieval Augmented Generation (RAG) model, which represents a significant paradigm shift in how machines comprehend and generate human language. This essay provides a detailed examination of RAG, exploring its architecture, applications, implications, and future directions in the realm of NLP. ...

April 17, 2024 · 6 min · Mohit Joshi
Google Cloud banner

Google Boosts Vertex AI with Powerful LLMs, Agent Builder, and More

Google made a splash at its recent Cloud Next conference with a slew of updates to Vertex AI, its machine learning platform. These updates focus on three key areas: expanding Large Language Model (LLM) capabilities, streamlining Machine Learning Operations (MLOps), and introducing a new agent builder tool. Supercharged LLMs: More Context, Audio Processing, and Live Images Vertex AI now boasts a public preview of the mighty Gemini 1.5 Pro LLM. This powerhouse model tackles massive datasets with its 1-million-token context support. Google claims this eliminates the need for fine-tuning or complex retrieval augmented generation (RAG) techniques for many enterprises. ...

April 16, 2024 · 4 min · Mohit Joshi
Image representing AI

Microsoft Introduces .NET Smart Components: AI-Powered UI Controls

As a software architect, I’m constantly on the lookout for ways to improve development efficiency and user experience. The recent introduction of .NET Smart Components by Microsoft has piqued my interest. These pre-built, AI-powered UI controls promise to streamline development and enhance user interaction within .NET applications. This eliminates the need to invest significant resources in building custom AI features from the ground up, freeing up valuable time and expertise for focusing on core functionalities. ...

March 22, 2024 · 3 min · Mohit Joshi
StarCoder2 banner

StarCoder2: The Next Generation of Open-Source AI-powered Coding Assistant

Are you a programmer looking to boost your productivity and unlock new coding possibilities? Look no further than StarCoder2, the next generation of open-source large language models (LLMs) designed to supercharge your coding workflow. Developed by a powerhouse collaboration between ServiceNow, Hugging Face, and Nvidia, StarCoder2 promises to revolutionize AI-powered coding tools. What is StarCoder2 and Why Should You Care? StarCoder2 isn’t just another AI coding assistant. It’s a family of three open-access and royalty-free LLMs specifically trained to generate code. This means you can leverage the power of AI to streamline your coding tasks, improve efficiency, and potentially unlock new creative avenues. ...

March 13, 2024 · 6 min · Mohit Joshi
Iamge of a robot writing code

Stack Overflow Opens API to Google's Gemini

This partnership between Google and Stack Overflow has major implications for the future of coding. Let’s delve deeper with some key questions: What does this API mean for developers? Traditionally, developers have relied on scouring Stack Overflow forums to find solutions. This new API, called OverflowAPI, grants Google’s AI model, Gemini (used in “Gemini for Google Cloud”), access to Stack Overflow’s vast knowledge base. This translates to faster, more targeted solutions directly within the Google Cloud Console, eliminating the need for context switching. Additionally, solutions will be accompanied by citations to the original Stack Overflow source, ensuring credibility. ...

March 13, 2024 · 2 min · Mohit Joshi

What Is GPT?

Generative Pre-trained Transformer (GPT) represents a groundbreaking advancement in the field of artificial intelligence (AI). Developed by OpenAI, GPT models are part of a family of neural network architectures that leverage the transformer architecture. These models excel at generating human-like text and content, answering questions, and engaging in natural language conversations. Unlike traditional neural networks that provide simple yes/no answers, GPT can produce coherent and contextually relevant responses. Significance of GPT GPT marks a paradigm shift in natural language processing (NLP). Instead of rigid rules, it learns from real-world language, enabling it to understand and generate text with a nuance and versatility unheard of before. This opens doors to a plethora of applications across industries, from streamlining content creation to revolutionizing research and analysis. The rise of GPT models marks a pivotal moment in the widespread adoption of machine learning. Their ability to automate and enhance a wide range of tasks, from language translation and document summarization to creative writing and code generation, has revolutionized various industries. GPT’s speed and scalability enable organizations to achieve higher productivity levels and reimagine customer experiences. ...

February 22, 2024 · 7 min · Mohit Joshi
Portrait of Edmond de Belamy

What is Generative AI and How to Upskill Yourself?

Imagine a world where machines not only understand existing content, but can invent something entirely new. Enter the fascinating realm of Generative AI (Gen AI), where algorithms go beyond analysis and become creators. This blog delves into the essence of Gen AI, exploring its history, inner workings, and groundbreaking applications. We’ll also equip you with insights on how to upskill yourself in this rapidly evolving field. What is Generative AI? Generative AI (AKA Gen AI) refers to a class of machine learning models that create new content rather than making predictions based on existing data. Unlike traditional AI algorithms that identify patterns within a dataset, generative AI generates novel outputs based on the patterns it has learned during training. ...

February 12, 2024 · 9 min · Mohit Joshi
Image of deepfake

Facebook and Instagram to Fight Misinformation with AI Image Labels: Can They Keep Up with the Deepfake Deluge?

In a significant move towards combating the proliferation of manipulated content, Meta, the parent company of Facebook, Instagram, and Threads, has announced plans to label all images created using artificial intelligence (AI). The initiative aims to enhance transparency and empower users to distinguish between authentic and AI-generated visuals. Meta’s Commitment to Detecting AI Fakery Meta’s existing practice involves labeling AI-generated images produced by its own systems with the tag “Imagined with AI.” Now, the company intends to extend this labeling system to include images generated by other companies’ AI tools. The technology, still under development, will be deployed across Facebook, Instagram, and Threads. ...

February 8, 2024 · 4 min · Mohit Joshi