GenAI Archives - Samuel Sum

Multimodal AI: Combining Text, Images, and Audio in Models (e.g., GPT-4V, LLaVA)

On

May 20, 2025

By

admin

Artificial Intelligence is evolving rapidly—from processing text in chatbots to understanding images and even interpreting audio. At the forefront of this evolution is Multimodal AI: models that can process and reason across multiple data types—text, images, audio, and video—within a unified framework. Multimodal AI is not just a technical leap; it’s reshaping how machines understand…

Continue reading

The Misconceptions of LLM: Is a Large Model Really Omnipotent?

On

March 30, 2025

By

admin

In recent years, with the rapid development of large language models (LLMs), many corporate executives have been eagerly embracing this technology, believing it to be a panacea for all problems. Since early 2025, the rise of DeepSeek has further fueled market enthusiasm, especially in Hong Kong, where many enterprises have begun massive investments in LLM-related…

Continue reading

Understanding the Proper and Best Practices for Using Generative AI

On

May 22, 2024

By

admin

Generative AI, particularly large language models (LLMs) like OpenAI’s ChatGPT, has revolutionized the way we interact with technology, providing powerful tools for generating text, answering questions, and even creating art. However, many people harbor misconceptions about these tools, believing they can answer anything or solve every problem. As a data scientist, it’s crucial to understand…

Continue reading

Tag: GenAI

Multimodal AI: Combining Text, Images, and Audio in Models (e.g., GPT-4V, LLaVA)

The Misconceptions of LLM: Is a Large Model Really Omnipotent?

Understanding the Proper and Best Practices for Using Generative AI

Categories

Archives

Tags