Tag: GenAI
-
Multimodal AI: Combining Text, Images, and Audio in Models (e.g., GPT-4V, LLaVA)
Artificial Intelligence is evolving rapidly—from processing text in chatbots to understanding images and even interpreting audio. At the forefront of this evolution is Multimodal AI: models that can process and reason across multiple data types—text, images, audio, and video—within a unified framework. Multimodal AI is not just a technical leap; it’s reshaping how machines understand…
-
The Misconceptions of LLM: Is a Large Model Really Omnipotent?
In recent years, with the rapid development of large language models (LLMs), many corporate executives have been eagerly embracing this technology, believing it to be a panacea for all problems. Since early 2025, the rise of DeepSeek has further fueled market enthusiasm, especially in Hong Kong, where many enterprises have begun massive investments in LLM-related…
-
Understanding the Proper and Best Practices for Using Generative AI
Generative AI, particularly large language models (LLMs) like OpenAI’s ChatGPT, has revolutionized the way we interact with technology, providing powerful tools for generating text, answering questions, and even creating art. However, many people harbor misconceptions about these tools, believing they can answer anything or solve every problem. As a data scientist, it’s crucial to understand…


