dmm: (Default)
GPT-4 in ChatGPT+ can now read and analyze images, one can upload an image and discuss it with GPT-4. 

GPT-4 in ChatGPT+ can also generate text prompts and render images via DALL-E 3.

What is missing is the ability to load an image and modify it with DALL-E 3 (or to load the image and use it as an inspiration for image generation by DALL-E 3).

1) So, on one hand, I want to ask a question:

Do people here have experience with loading an image and modifying it using an AI system (or with loading an image and using it as an inspiration for an AI system)? Any recommendations?

2) On the other hand, there are two possible workarounds.

2a) One can use an ASCII art inside GPT-4 to inspire images rendered by DALL-E 3 (it works, but I was not able to obtain the results I would like so far).

2b) One can analyze an image and obtain an image description in one mode of GPT-4 in ChatGPT+, and then open another session and use that description in the prompt to guide DALL-E 3.

Some of the results obtained via method 2b) are quite interesting. I am going to post related images and dialogs as a series of comments.

Update: In November, the ability to read and create images within one session has been added (and, generally speaking, all functionality which has been separated into different type of sessions is now available from a single session, you should be able to ask the system for anything, be it "search the web", or "generate and execute code", or "upload this data, and then generate code to analyze this data, and run it"; these used to require starting different types of sessions, but now all this is available together).
dmm: (Default)
Let's see whether github hosting for *.webp images is OK to render them here, in the post, and in the comments.

test image

dmm: (Default)
CodeGeeX seems to be a reasonably competitive free and open source alternative to GitHub Copilot. It might be a good thing to be aware of (although we do have ChatGPT these days).

Riffusion is a free and open source app which generates spectrograms via stable diffusion and converts them to music.

Links are in the comments.
dmm: (Default)
Sigmoid Social is a Mastodon instance for people researching, working on, or just interested in AI. It seems that a number of AI people are creating accounts there. I followed the example set by Ken Stanley and created one for myself too:

sigmoid.social/@DataflowMatrixMachines

It has a different optional verification setup: one can verify that one controls a Web page referenced from one's Mastodon profile if one feels like it. For example, I used that verification mechanism to put a green checkmark on the link to the GitHub mirror of my resume page.

It's often quite annoying that Twitter interferes with reading it anonymously, without a login. This is one of the better workarounds:

nitter.net/

Lexica.art lets one to search for AI-generated art created by other people (no account required) and to generate one's own AI art (a free account is required):

lexica.art/

Mage.space lets one to generate AI art (no account required, but it is possible to create one):

www.mage.space/

dmm: (Default)
A nice introduction to AI art, together with some history of the scene: deeplearn.art/get-started-with-making-ai-art-in-2022/

Profile

dmm: (Default)
Dataflow matrix machines (by Anhinga anhinga)

January 2025

S M T W T F S
   1234
5 67891011
12131415161718
19202122232425
262728293031 

Syndicate

RSS Atom

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Apr. 23rd, 2025 05:21 pm
Powered by Dreamwidth Studios