Logo
Published on

ChatGPT is Evolving: Are You Keeping Up?

Authors

ChatGPT took a massive leap forward by integrating voice, image, and internet browsing capabilities. Dive into these groundbreaking features as I share my own experience. I generated every piece of AI Art in this post, including the thumbnail.

chatgpt-platform-banner

Introduction

The pace at which artificial intelligence (AI) evolves is breathtaking, unfolding exponentially faster than our own. It’s happening right before our eyes, and the momentum only seems to be accelerating. In the last month, OpenAI has unveiled a series of exciting new features for ChatGPT, amplifying its utility for entrepreneurs and the modern workforce manifold. We are treading a constantly changing arena, where the onus falls on innovators to discover new ways of harnessing this technology for good. Let’s delve into the latest updates and releases from OpenAI and ChatGPT.

AI generated Monalisa in the style of Van Gogh

AI Art: Van Gogh's Monalisa

ChatGPT Can See, Listen and Talk

OpenAI recently rolled out voice and image capabilities for ChatGPT, offering a more intuitive interface for users. As of this writing, the voice feature is available on mobile devices, while image functionality is accessible on both mobile and web platforms.

Now, users can send images to ChatGPT and ask about anything related to the visual content. Whether you need help with a math problem or are planning your next meal with the ingredients available in your fridge, a simple snapshot will do the trick. This feature in particular has saved me a lot of time spent troubleshooting. For example, while reading an article or debugging code, I can take a screenshot and ask ChatGPT to help me understand the text or pinpoint the error. With the tedious task of manually typing or copy-pasting text to provide context no longer needed, my productivity has increased significantly. This tool has countless potential applications, such as analyzing complex graphs or work-related data. Moreover, the mobile app includes a drawing tool, allowing users to focus on a specific part of the image for a more precise analysis.

Screenshot of ChatGPT processing image of a coffee cup

Screenshot: ChatGPT Processing Photo of a Coffee Cup

In addition, ChatGPT now supports back-and-forth conversations. Users even have the choice between different voices to suit their preferences. Like Apple’s Siri or Windows' Cortana, the user interface is intuitive, making the interaction seamless and user-friendly. Next time you find yourself in a heated debate at the dinner table, simply open up the ChatGPT app on your phone and settle the matter. The capability of OpenAI to generate voice based on text opens up new opportunities for accessibility. Imagine podcasters expanding their reach by translating their podcasts into various languages using their own voice, courtesy of ChatGPT’s voice synthesis. Listen to famous podcaster, Lex Freidman, testing a new Spotify feature as he translates his podcast from English to Spanish.

AI generated colourful landscape in the style of contemporary art

AI Art: Colourful Landscape in Contemporary Art Style

Image Generation: Welcoming DALL-E 3

For those of you unfamiliar with DALL-E, it's a remarkable AI model created by OpenAI that can generate images from text prompts. With the integration of its latest version, DALL-E 3, into ChatGPT, the platform is now able to craft diverse, aesthetic, and sometimes whimsical visual representations. This means ChatGPT not only understands and responds to images but can also transform textual descriptions into vivid images with impressive accuracy.

This integration unveils boundless opportunities for businesses across various domains. For instance, in product design, a startup could leverage DALL-E 3’s enhanced image generation to visualize product concepts before even reaching the prototyping stage. By inputting textual descriptions of desired product features and aesthetics, teams can obtain visual prototypes generated by ChatGPT, thus accelerating the design process and fostering more collaborative design discussions.

AI generated prototype sketch of luxury sports car

AI Art: Conceptual Sketch of a Luxury Sports Car

Similarly, marketing teams can harness this new feature to craft compelling marketing campaigns. By translating abstract campaign ideas into tangible visual concepts, ChatGPT with DALL-E 3 integration can aid in creating more engaging and visually appealing marketing material. This not only streamlines the creative process but also provides a means to test and iterate marketing ideas swiftly.

The integration of DALL-E 3 with ChatGPT not only exemplifies the advancement in AI-driven image generation but also hints at a future where the boundaries between text and image, reality and imagination, become increasingly blurred, paving the way for applications previously unknown. It sounds like a paradise for artists and designers.

AI generated interior design of home with ocean view

AI Art: Conceptual Rendering of Luxury Home Interior Design

Internet Browsing: Surf the Web with ChatGPT

The internet is an inexhaustible resource of information, but navigating it efficiently to extract the precise information needed can sometimes be a daunting task. The recent enhancement in ChatGPT’s capability, now including an internet browsing feature, is a step towards simplifying this process.

With this new feature, ChatGPT can browse the web to pull in real-time information, making it an even more powerful tool for entrepreneurs and professionals. Imagine needing updated market statistics for an urgent project, or the latest news relevant to your industry. ChatGPT can now fetch this data for you, saving you the time and effort of sifting through numerous web pages.

Screenshot of ChatGPT fetching statatistic from the internet

Screenshot: ChatGPT Fetching Statistics from the Internet

This browsing feature essentially turns ChatGPT into a more interactive and real-time information retrieval system. By combining the power of GPT-4’s language processing with the ability to browse the web, ChatGPT is evolving into a comprehensive tool that can significantly enhance productivity and decision-making across various sectors.

The Caveat & Silver Lining

While the new features in ChatGPT hold immense potential for enhancing productivity and sparking innovative applications, there's a small catch — they come as part of the premium subscription which will set you back $20 USD plus tax (approximately $30 CAD) per month. This might pose a challenge for small business owners or individuals on a budget. However, there's a silver lining. There are free alternatives available that will give you a teaser of similar functionalities.

  1. DALL-E 2: You can experiment with the previous version of DALL-E on the OpenAI website and explore its image-generation capabilities at no cost.
  2. Whisper: Although not a direct alternative to ChatGPT's voice feature, OpenAI's Whisper is an Automatic Speech Recognition (ASR) system that could be explored for voice-related applications.
  3. Phind: This free AI browser is a great alternative, which I personally recommend for those keen on the web browsing feature of ChatGPT. Phind operates as a standalone browser and aims to make web navigation and information retrieval more cost-effective and efficient. Although they advertise themselves as a search engine for programmers, I have used it several times to find information with cited references and also to ask the AI model about recent events when I did not have access to ChatGPT Plus.
Screenshot of Phind AI Browser User Interface

Screenshot: Phind AI Browser User Interface

Conclusion

As OpenAI and ChatGPT break new ground in artificial intelligence, we find ourselves seamlessly incorporating these innovations into our daily lives. This article is a testament to that: I've integrated AI-generated art to enhance visual storytelling. DALL-E 3, in particular, has demonstrated its incredible ability to craft artistic, aesthetically pleasing, and practical images. AI image generation is most certainly causing waves in the fields of art and design, and I cannot wait to dive into these topics in the future. Moreover, we've witnessed the transformative power of voice generation in enhancing language accessibility — and that's just scratching the surface of its potential. As ChatGPT continues to evolve, it becomes an even more potent tool, unlocking new business strategies, creative expressions, and interactive digital experiences.

To all the artists, designers, entrepreneurs and visionaries reading this: I encourage you to immerse yourself in this dynamic landscape of AI, to experiment fearlessly, and to redefine the boundaries of your craft.

Remember, the only real constant in life is change.

Get the latest on Generative AI

Weekly articles
Get the latest industry updates delivered to your inbox every week.
No spam
We respect your inbox. Only valuable content, no spam.