Git Product home page Git Product logo

symposium2023's Introduction

Symposium 2023 Artificial Intelligence and ChatGPT

Source code and Documentation for my ADUG Symposium Talk presented on the 28th of April 2023. I have since added to and enhanced the code to further demonstrate capabilities of AI.

Artificial intelligence (AI) is an interdisciplinary field that combines computer science, mathematics, and cognitive psychology to create intelligent systems capable of performing complex tasks. Its rapid advancements have led to a wide array of applications demonstrating AI's versatility.

Language translation is one such application, where AI-powered tools efficiently translate between languages, simplifying tasks like translating software programs for global audiences. AI also excels in human-like conversations, with interactive applications that understand and respond to human language naturally. Voice recognition and real-time speech-to-text allow conversion and seamless voice-based interactions, making AI-driven applications more accessible and user-friendly.

In creative and artistic domains, AI can generate images based on textual descriptions, showcasing its capacity to understand and produce visual content. AI's computer vision capabilities enable it to accurately recognize faces and other objects in photographs and documents, illustrating its potential in visual recognition tasks and diverse applications like security and automation.

AI's ability to analyze and process data, and generate comprehensive reports highlights its value in various domains. Furthermore, AI-powered tools can transcribe audio files into written text, making transcription tasks more efficient and accurate.

The example programs below is an attempt to demonstrate the capabilities available to Delphi programmers today. I have worked on creating generic API's so that different providers can be swapped in or out to experiment or for any other reason.

ChatGPT Prompts

Some Example GPT Prompts

Presentation Slides

Example programs

  • EmbeddingsDemo
    • Simple demo showing how Embeddings work
  • Translate
    • translates between languages using the various cloud API's.
    • Simplify translating Delphi programs when using Delphi's built-in multi language resource support.
  • DelphiChatGPT
    • write questions to ChatGPT and have it speak the answer. image
  • FaceDetection
    • Detect faces in a photo. image
  • Weather
    • Query the weather forcast for Bendigo from the bureau of meteorology generate a paragraph or two and read it out image
  • TranscribeAudio
    • Upload a audio file and have it translated via a cloud speech to text api.
  • VoiceRecognition
    • convert speech to text in real-time straight from your microphone, feed it to OpenAI's GPT and have the response read back to you. image
  • Image generation
    • generate an image using text that you provide using OpenAI's DALLE-2 API.

Providers Used/Available

  • Google - Text to Speech, LLM, Translate
  • Microsoft Azure - Text to Speech, GPT, Translate
  • Amazon - Text to Speech, Translate
  • Anthropic claude-2 and claude-instant-1 support one of largest context windows currently available
  • Replicate access a wide range of models
  • Huggingface access a wide range of models
  • ElevenLabs Text to Speech and Voice Cloning
  • OpenAI
  • AssemblyAI Voice Recognition
  • DeepGram Voice Recognition
  • Rev.AI Voice Recognition
  • Conqui-ai Run a variaty of text to speech models locally from a docker container

Getting the projects working

  • Each of the cloud API's need to have been setup in their respective developer consoles. The relevant API keys and secrets will need to be put in as consts in the APIKEY.INC file.
  • A file in /libs/APIKEY.INC.EXAMPLE shows all the available keys to enter. If you're not using a particular provider you don't need a key for it.

Questions about code and how to set things up

  • Please feel free to raise issues about any questions you have about the code. I know there is a lot to this project and lots to setup, so I would like to improve the documentation to make it easy for everyone to use all the parts of this project.

Potential future areas of research/study

  • Using Embeddings to search large datasets
  • Using Python4Delphi to be able to call various Python AI libraries from Delphi.

Artificial Intelligence Related links

External Libraries required to build projects

  • ICS 9.0 or from the GetIt package manager built into Delphi.
  • AWS SDK For Delphi (Preview) in GetIt Package Manager in Delphi 11

Tools used to create example projects

symposium2023's People

Contributors

geoffsmith82 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.