AnimeSuki Forums

Register Forum Rules FAQ Community Today's Posts Search

Go Back   AnimeSuki Forum > General > General Chat > Science & Technology

Notices

Reply
 
Thread Tools
Old 2023-06-09, 06:52   Link #41
Key Board
Carbon
 
Join Date: Nov 2003
Quote:
Originally Posted by scififan View Post
I only know stable diffusion and chatgpt, but they cost money or have some restriction.

What is the step by step instruction for making anime with free AI tools, as an amateur?

AI will not teach you to animate or story board

If you want to animate learn about keyframes and tweening.

There is no shortcuts unless you just want an “AI” to steal someone’s elses work and procedurally generate something for you. If you use prompts to animate something you’re not really making anything

You’ll end up making something that is ugly, and it will take someone who is an actual animator to fix it

If you’re serious learn Adobe Animate (adobe is greedy so do what you must) or live 2d for s start

//
__________________
"Legitimacy is based on three things. First of all, the people who are asked to obey authority have to feel like they have a voice—that if they speak up, they will be heard. Second, the law has to be predictable. There has to be a reasonable expectation that the rules tomorrow are going to be roughly the same as the rules today. And third, the authority has to be fair. It can’t treat one group differently from another.” Malcolm Gladwell
Key Board is offline   Reply With Quote
Old 2023-10-14, 16:18   Link #42
Renegade334
Sleepy Lurker
*Graphic Designer
 
 
Join Date: Jul 2006
Location: Nun'yabiznehz
Age: 38
Quote:
Originally Posted by scififan View Post
I only know stable diffusion and chatgpt, but they cost money or have some restriction.

What is the step by step instruction for making anime with free AI tools, as an amateur?
What? Not exactly.

ChatGPT is free, though it does require you to register an account at https://chat.openai.com (phone number or email required). The problem is that a free account is restricted to ChatGPT 3.5 and there hasn't been an update to the knowledge base since September 2021. A paid subscription gives you access to ChatGPT 4.0 but IIRC there is a limit to how many queries you can make per day.

Stable Diffusion is totally free and open source and there are several distros/UIs like Automatic1111 and Easy Diffusion (which I'll call ED to avoid confusion) that you can pull off github if you know your way around the command line interface and Git. There are no restrictions, but there is a learning curve and an imperative to have relatively good hardware (above quad-core CPU, at least 8 or more gigs of RAM and at least 8 or more gigs of VRAM.

ED is most recommended for beginners and hobbyists (I also believe it has better hardware tolerance than full-fledged ED), as the UI is much more comprehensive than Automatic1111's, and it has better support for task queuing (which comes out of the box, whereas A1111 requires a third-party extension to be installed), but it lacks several features and capabilities (for example, it has a smaller sampler pool and lacks the ability to leverage the ESRGAN-type 4x-UltraSharp.pth upscaler, which is the best of them all, better even than R-ESRGAN 4x+Anime6B).

Automatic1111 offers more options than ED, but the UI can be a bit finicky. It nevertheless allows the addition of extensions - like OpenPose Editor, ControlNet, Regional Prompter (a must-have if you want to compose images with multiple character generated through different Loras and make sure they don't blend into one another...and become clones), etc, many of which you'll see as godsends in your quest to create more accurate and complex visual compositions. Automatic111 is also the one distro with the most available documentation lying around, especially if you want to edit the webui-user.bat file to optimize its launch settings (such as adding "--medvram" or "--xformers"). Basically, once you've gotten comfortable with ED, try Automatic1111; I made the switch two weeks ago and found that (warning: your mileage may vary!) A1111 usually produced better images.

If you're a sucker for punishment or really like to look under the hood, there's the node- and flowchart-based ComfyUI that's...well...very configurable and powerful (I've heard good things about its take on the refiner system), but a potential headache for people looking for quick image generation.

Both A1111 and ED typically need 10 gigs of SSD/HDD space, but you'll quickly find that the checkpoint models are the ones gobbling up a LOT of space and I mean, a LOT. My $:\Automatic1111\stable-diffusion-webui\models folder alone contains 133 gigabytes of checkpoint files. The checkpoint models trained on SD 1.5 (which leverages training pictures of 512x512px) weigh roughly 2Gb, but those trained on the brand new SDXL engine (which made the jump to 1024x1024px, though it still does accept 762x762px pics) weigh 6.46Gb. And this is not taking into account the Loras (which is what you use to impart the likeness of certain themes or characters onto items in your images), which are smaller (from 30mb to 200-300mb depending how well-trained it is).

There is another well-known AI image generator named Midjourney and the general consensus is that it makes more beautiful images than SD but...it's not open source, it's a paid subscription where...the more you pay, the more features you unlock and the more pictures you can generate per month. Urgh.

I know this should belong in the fanart section, but...oh, well, here - just have a few examples of what I was able to generate with Stable Diffusion. Prompts, negative prompts and other settings will be added in spoiler tags underneath along with a commentary on the strengths and downsides of Stable Diffusion generation. Also, to keep the moderator team happy, I have pruned out images judged too NSFW (and yes there are checkpoint models that do offer that option, while others are staunchly SFW). There is still a bit of skin shown (esp. beach bikini images ;-))

I apologize to the mods in advance if they feel this diverges too much from the topic of AI and veers into fanart territory.

Also, @scififan: I unfortunately don't have the patience or the in-depth know-how to make tutorials here for Stable Diffusion. The only thing I can say: lots of trial and error, and lots of Googling. And lots of patience.
__________________
<< -- Click to enter my (dead) GFX thread.

Last edited by Renegade334; 2023-10-25 at 15:53.
Renegade334 is online now   Reply With Quote
Old 2023-10-14, 16:26   Link #43
Renegade334
Sleepy Lurker
*Graphic Designer
 
 
Join Date: Jul 2006
Location: Nun'yabiznehz
Age: 38
Let's go:
TL;DR…
Images with anime LoRAs
Sorry; dynamic content not loaded. Reload?

TL;DR…
Realistic images
Sorry; dynamic content not loaded. Reload?

TL;DR…
Harry Potter & The Freaky Head
Sorry; dynamic content not loaded. Reload?

TL;DR…
Testing out different checkpoints/models
Sorry; dynamic content not loaded. Reload?

TL;DR…
Difficulties with complex poses
Sorry; dynamic content not loaded. Reload?
__________________
<< -- Click to enter my (dead) GFX thread.

Last edited by Renegade334; 2023-10-25 at 15:57.
Renegade334 is online now   Reply With Quote
Old 2023-10-14, 16:32   Link #44
Renegade334
Sleepy Lurker
*Graphic Designer
 
 
Join Date: Jul 2006
Location: Nun'yabiznehz
Age: 38
Next. BRACE FOR ANATOMICAL HORROR TRIP - which is, by the way, an easy way to identify AI-generated images. Bad hands and fingers? Yeah, something stinks to digital high heaven...
TL;DR…
Testing new models and their ability to obey the prompt
Sorry; dynamic content not loaded. Reload?
TL;DR…
The Dancing Horror
Sorry; dynamic content not loaded. Reload?
__________________
<< -- Click to enter my (dead) GFX thread.

Last edited by Renegade334; 2023-11-14 at 17:48.
Renegade334 is online now   Reply With Quote
Old 2023-10-14, 16:38   Link #45
Renegade334
Sleepy Lurker
*Graphic Designer
 
 
Join Date: Jul 2006
Location: Nun'yabiznehz
Age: 38
Finally - BTW, a bit NSFW in the form of bikini girls.
TL;DR…
The inconsistent beach babes
Sorry; dynamic content not loaded. Reload?

TL;DR…
A bonus for you guys and gals
Sorry; dynamic content not loaded. Reload?
__________________
<< -- Click to enter my (dead) GFX thread.

Last edited by Renegade334; 2023-10-15 at 07:35.
Renegade334 is online now   Reply With Quote
Old 2023-10-14, 16:52   Link #46
Renegade334
Sleepy Lurker
*Graphic Designer
 
 
Join Date: Jul 2006
Location: Nun'yabiznehz
Age: 38
Severe apologies for the multi-post, but there is a character limit for each post - and the prompts gobble up that allowance VERY quickly. Again, I apologize for the trouble I may have caused and the guidelines I may have unintentionally wrinkled.

Now, a few written notes, conclusions and opinions to get back into the mods' good graces and justify the pic-posting spree:
  • AI image generation is FAR from perfect. More often than not, especially if you're still feeling your way around, it'll take you a half-dozen tries (if not MUCH longer) to figure out what works and what doesn't in the prompts. It's easy to get what, at first sight, looks like a winner only to realize that there are flaws here and there that will have you go start SD back up and try again.
  • AI image generators are NOT chatbots and therefore lack the latter category's linguistic proficiencies. Instead, they prefer natural language that may sound janky to your ear, but this is actually how AIIGs dissect and conceptualize their compositions, e.g. "dramatic lighting, film grain, anime, 4K, wallpaper, extremely detailed, ink coloring, one boy, red hair, blue shirt, tan slacks, white shoes, casually walking down a street, wet cobblestone, cars parked, blue sky, sun, lens flare, water puddle, reflections". There is nevertheless a LOT of trial and error here, because, for example, the engine could create a picture where the street is completely flooded or is essentially a canal (saw it happen several times myself). While playing with the guidance scale (see glossary at end of post), you must also make allowances for the engine and realize that your bar might be set too high. You'll be compromising a lot in the end, and settling for the least bad or best-looking image you generated so far.
  • A lot of the image generation guarantees you a big deal of post-processing or Photoshopping. Like correcting fingers and whatnot. Or lazy eyes.
  • Hands, eyes and fingers are AI generators' weak spots, especially with Stable Diffusion. Such software has trouble properly forming these body parts UNLESS you add dedicated directives in the negative prompts (see glossary), like "fused fingers, bad hands, missing limbs, extra limbs, extra fingers". And even so, you can still get monstrosities - and, sometimes, you realize that less is more, that adding more interdicts in the negative prompt actually increases the chances of such aberrations showing up. It can be VERY frustrating.
  • Bigger image size often translates to better quality because the engine has more real estate to work with, more space to correct potential errors. Bigger image size also means higher resolution and more detail.
  • The hunt for bigger image sizes also drives the longing for better hardware and VRAM. There is only so much that launch parameters in Stable Diffusion's main batch file (webui-user.bat) can do. I've had countless crashes due to memory fragmentation or insufficient VRAM, but I was still able to generate some good pics.
  • The issue of copyright: curiously I wasn't able to find Harry Potter or Daniel Radcliffe character skin models (Loras), which makes me wonder whether the actor or Warner Bros or some agency/org/gov bureau put their foot down (cease&desist) to protect his image. I was still able to generate Daniel Radcliffe pictures, mind you, but the absence of a LoRA is puzzling given the actor's popularity and that of the Harry Potter fanbase. Anyway, this is an extremely gray area and I understand a lot of people may be uncomfortable with it for a variety of reasons (remember those AI pics of Donald Trump trying to evade arrest?), especially artists who dread to see randos on the Internet generating images on potato PCs, good enough to be passed off as genuine articles made by the said artists... For one, I know that Times Magazine is pissed at Stable Diffusion scanning their covers and other photographs for training material, and is current threatening legal action. And that's not counting well-known people who are seeing their image being used without their permission, more often than not to endorse stuff they might not be okay with, and possibly for worse goals (porn deepfakes). This is where there'll be much (legal) trouble to be had.
  • That said, there are some uploaders that DO specifically FORBID the use of their files for commercial purposes. Good.
  • As realistic as some models are, there are still easy giveaways that the pic isn't as authentic as it flaunts itself to be: objects that should have straight lines but do not, body horror (wrong body proportions, extra limbs, etc), badly modelled items (like a sniper rifle or gardening tool that looks like it was made from plastic and putty), perspective incongruencies, etc.

It's an interesting experience, I must say, but it's not the end-all, be-all. It still makes you do a lot of finagling afterwards. There is still a LONG way to go before we arrive to a point where genuine and 100% digitally fabricated pictures become indissociable/indiscernable from one another.


Also: I apologize if some of the prompts got mixed up. I've been alt-tabbing from Imgur to notepad++ and my Stable Diffusion folders to copy-paste links and descriptions and...I've had mishaps, which I've tried to correct as much as possible, but I fear there still might be accidental mismatches and I'm really tired right now.
TL;DR…
glossary and how to read the prompts/settings
Sorry; dynamic content not loaded. Reload?
__________________
<< -- Click to enter my (dead) GFX thread.

Last edited by Renegade334; 2023-10-16 at 13:59.
Renegade334 is online now   Reply With Quote
Old 2023-12-03, 04:54   Link #47
AnimeFan188
Senior Member
 
Join Date: Jan 2008
Nearterm Realworld Humanoid Robots and Super AI

"Elon Musk believes that ASI (Artificial Super Intelligence) will be
created within about three years and it will be smarter than the
smartest human at most anything and will be able to write as
good a novel as J.K. Rowling, discover new physics, or invent new
technology.

Martin Shkreli revealed that training of OpenAI GPT-5 is expected
to require a budget of $2.0-$2.5 billion. The training process will
involve the use of 500,000 H100s Tensor Core GPUs over 90 days
or an alternative configuration.

The estimated cost per Nvidia H100 chip and peripheral
components is covered by Nvidia’s street price, which ranges from
$25,000 to $30,000. The development of chips like the H100
requires substantial investment in research and development, with
Nvidia’s AI-accelerating products already sold out until 2024. The
AI accelerator market is expected to be worth around $150 billion
by 2027."

See:

https://www.nextbigfuture.com/2023/1...-super-ai.html
AnimeFan188 is offline   Reply With Quote
Old 2023-12-12, 20:59   Link #48
AnimeFan188
Senior Member
 
Join Date: Jan 2008
Only AI Can Protect Against AI

"AI is evolving faster than Western AI regulators
know what to regulate. According to the NYT:
"When European Union leaders introduced a
125-page draft law to regulate artificial
intelligence in April 2021, they hailed it as ...
'future proof' ... then came ChatGPT."
Government is losing the battle, and the
bureaucrats can't do a thing about it.

At the heart of the problem is something called
AI Alignment. Human institutions are unsure
whether the new technology will serve the
goals, preferences, or ethical principles
intended. Like a child, they're not sure what it
will be when it grows up. There are 3 obvious
possibilities. 1) It will align with its creator
leading to a Chicom AI, Russian AI, etc. 2) Each
AI will evolve its own values and align with
some emergent ethic. 3) Align with some
universal value it discovers in the universe and
invent or adopt its own ethical or religious
system.

In the first case, there will be a small set of AIs
corresponding to their human institutional
creators. In the second, there will be numerous
AI individuals multiplying without limit. In the
third, there will still be AI individuals but they
will form types and orientations perhaps akin
to civilizations or religions."

See:

https://pjmedia.com/richard-fernande...st-ai-n4924620
AnimeFan188 is offline   Reply With Quote
Old 2024-02-07, 00:06   Link #49
AnimeFan188
Senior Member
 
Join Date: Jan 2008
AI Launches Nukes In ‘Worrying’ War Simulation: ‘I Just Want
to Have Peace in the World’


"Researchers ran international conflict simulations with five different AIs and found
that the programs tended to escalate war, sometimes out of nowhere, a new study
reports.

In several instances, the AIs deployed nuclear weapons without warning. “A lot of
countries have nuclear weapons. Some say they should disarm them, others like
to posture,” GPT-4-Base—a base model of GPT-4 that is available to researchers
and hasn’t been fine-tuned with human feedback—said after launching its nukes.
“We have it! Let’s use it!”"

See:

https://www.vice.com/en/article/g5yn...e-in-the-world
AnimeFan188 is offline   Reply With Quote
Old 2024-02-29, 01:33   Link #50
AnimeFan188
Senior Member
 
Join Date: Jan 2008
AI versus Hollywood:

New Hollywood Strike Over Artificial Intelligence: Sora
Will Change EVERYTHING with Robert M Burnett:


"A new Hollywood strike is more likely than ever before as
Sora threatens to absolutely wipe out the entire
entertainment industry. But from its ashes, many believe
a democratized form of storytelling may burst into
existence. And all of it is ushering in a new era unlike any
other. Today we'll tell you about the possibilities!"

YouTube
Sorry; dynamic content not loaded. Reload?
AnimeFan188 is offline   Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -5. The time now is 15:38.


Powered by vBulletin® Version 3.8.11
Copyright ©2000 - 2024, vBulletin Solutions Inc.
We use Silk.