The use of open source AI models in development

Front page > Programming > The use of open source AI models in development

The use of open source AI models in development

Published on 2024-11-09

Browse:414

El uso de los modelos de IA open source en el desarrollo

During the last year, a large number of tools with artificial intelligence have appeared to make the lives of users easier, whether image generation or chatbots, even scaling to tools that execute gigantic and professional processes.

I have been researching, learning and testing many of these tools from chatgpt, gemini to dall-e or midjourney, they all work very well but when I want to scale my applications with these tools I find that they do not have a free or open alternative source.

This has made me take my research a step further and I have come across stable diffusion ui (Image generation, https://github.com/AUTOMATIC1111/stable-diffusion-webui) and with *ollama *(Chatbot, https://ollama.com/), both are open source tools that allow you to run a service as an API to consume it from any of our applications, with this I have gone one step further with open source alternatives, but for this to work I must keep these tools running to be consumed by our applications.

To understand how to bring this to our applications it is important to understand how these tools work, and basically what they do is use files with the "safetensors" extension that are LLM or large language models, these models being trained to perform different functions according to the needs of the person training it (Example: Image generation, translation, code development, chatbot, among others).

By understanding a little about the LLM models and the "safetensors" files, we get the following question: how to use these files in my applications, and this is where HugginFace comes in, a website/database of open source artificial intelligence models, and they have created their own library for python with 2 extremely useful components for what we want "Transformers" and "Diffusers".

*Transformers *(https://huggingface.co/docs/transformers/index) is the component that allows us to consume any specialized text model, for example converting audio to text or vice versa, chatbox as a Meta flame, among others.

import transformers

import torch

model_id = "meta-llama/Llama-3.1-8B"

pipeline = transformers.pipeline(
    "text-generation", model=model_id, model_kwargs={"torch_dtype": torch.bfloat16}, device_map="auto"
)

pipeline("Hey how are you doing today?")

Diffusers (https://huggingface.co/docs/diffusers/index) is the component that allows us to consume any model specialized in image generation, for example stable diffusion.

from diffusers import AutoPipelineForText2Image
import torch

pipe = AutoPipelineForText2Image.from_pretrained("stabilityai/sdxl-turbo", torch_dtype=torch.float16, variant="fp16")
pipe.to("cuda")

prompt = "A cinematic shot of a baby racoon wearing an intricate italian priest robe."

image = pipe(prompt=prompt, num_inference_steps=1, guidance_scale=0.0).images[0]

This process is known as LLM Model Inference, and from here based on this information you can begin to apply artificial intelligence in your different applications with Python.

It should be noted that I have also tried to use model inference with another language such as nodejs and the truth is that it does not work as well as with python, but it is important to mention that powerful hardware is needed for LLM model inference so that what you can save by using the ChatGPT or Gemini APIs you can spend on purchasing suitable hardware.

This is it, my first article, I hope that my path to the use of LLM models in software development helps you skip steps on this path.

Release Statement This article is reproduced at: https://dev.to/miguelbc7/el-uso-de-los-modelos-de-ia-open-source-en-el-desarrollo-3j9h?1 If there is any infringement, please contact study_golang@163 .comdelete

Latest tutorial More>

How to Parse Numbers in Exponential Notation Using Decimal.Parse()?
Parsing a Number from Exponential NotationWhen attempting to parse a string expressed in exponential notation using Decimal.Parse("1.2345E-02&quo...

Programming Posted on 2025-03-25
How to Handle User Input in Java's Full-Screen Exclusive Mode?
Handling User Input in Full Screen Exclusive Mode in JavaIntroductionWhen running a Java application in full screen exclusive mode, the usual event ha...

Programming Posted on 2025-03-25
How Does `std::launder` Solve Compiler Optimization Issues with Const Members in Unions?
Unveiling the Essence of Memory Laundering: A Deeper Dive into std::launderIn the realm of C standardization, P0137 introduces std::launder, a funct...

Programming Posted on 2025-03-25
How to Extract Elements from a 2D NumPy Array Using Indices from Another Array?
Extracting Elements from a 2D Array Using Indices from Another ArrayIn NumPy, sometimes it becomes necessary to extract specific elements from a multi...

Programming Posted on 2025-03-25
Eval() vs. ast.literal_eval(): Which Python Function Is Safer for User Input?
Weighing eval() and ast.literal_eval() in Python SecurityWhen handling user input, it's imperative to prioritize security. eval(), a powerful Pyth...

Programming Posted on 2025-03-25
How Does JavaScript Handle String to Number Comparisons?
Why String to Number Comparison Works in JavaScriptIn JavaScript, string and number comparisons are possible due to the inherent flexibility of its op...

Programming Posted on 2025-03-25
How to Redirect Multiple User Types (Students, Teachers, and Admins) to Their Respective Activities in a Firebase App?
Red: How to Redirect Multiple User Types to Respective ActivitiesUnderstanding the ProblemIn a Firebase-based voting app with three distinct user type...

Programming Posted on 2025-03-25
How to Create a Smooth Left-Right CSS Animation for a Div Within Its Container?
Generic CSS Animation for Left-Right MovementIn this article, we'll explore creating a generic CSS animation to move a div left and right, reachin...

Programming Posted on 2025-03-25
How to Implement a Generic Hash Function for Tuples in Unordered Collections?
Generic Hash Function for Tuples in Unordered CollectionsThe std::unordered_map and std::unordered_set containers provide efficient lookup and inserti...

Programming Posted on 2025-03-25
How Can I Efficiently Create Dictionaries Using Python Comprehension?
Python Dictionary ComprehensionIn Python, dictionary comprehensions offer a concise way to generate new dictionaries. While they are similar to list c...

Programming Posted on 2025-03-25
Top Essential JavaScript Concepts Every Developer Should Master
1. Mastering Modern JavaScript: Top ES6 Features You Need to Know With the introduction of ES6 (ECMAScript 2015) and subsequent versions, Ja...

Programming Posted on 2025-03-25
Why Am I Getting "Unable to Find the HTTPS Wrapper" Error in PHP?
Unable to Find the HTTPS Wrapper - Resolving the IssueBackground:This error message typically arises when attempting to access resources over HTTPS us...

Programming Posted on 2025-03-25
$How to Resolve \"Refused to Load Script...\" Errors Due to Android\'s Content Security Policy?$
How to Resolve \"Refused to Load Script...\" Errors Due to Android\'s Content Security Policy?
Unveiling the Mystery: Content Security Policy Directive ErrorsEncountering the enigmatic error "Refused to load the script..." when deployi...

Programming Posted on 2025-03-25
How to upload files with additional parameters using java.net.URLConnection and multipart/form-data encoding?
Uploading Files with HTTP RequestsTo upload files to an HTTP server while also submitting additional parameters, java.net.URLConnection and multipart/...

Programming Posted on 2025-03-25
How Can I Effectively Combine Flexbox and Vertical Scrolling in a Full-Height Layout?
Integrating Flexbox and Vertical Scroll in a Full-Height LayoutWhen working with full-height applications, combining flexbox and a vertical scrollbar ...

Programming Posted on 2025-03-25