creator cover SPCell
SPCell

SPCell 

AI content creator

0subscribers

2posts

About

🇷🇺Русский🇷🇺
Данная страница предназначена для англоязычной аудитории. Для русскоязычной есть ВК и другие сервисы, где я публикую свой контент, ссылки смотреть ниже.
🇬🇧English🇬🇧
Greetings to visitors of my page, I am SPCell or simply Cell. On this page I will be publishing my AI content. I create it based on my interests, but you can change my plans through an order. LoRAs on Stable Diffusion is my primary direction.
Links to my other pages can be found here on this page. The services I provide, their cost and methods for contacting me will be listed below. When contacting me, you can offer cooperation, ideas for blog development and order my services.
Discord - spcell.business
Rules:
1. There is a division between what is prohibited to do and what is strictly prohibited to do. For violation of the first point, depending on the context, a warning will follow with further tightening of sanctions up to a temporary or permanent ban, in some cases a ban can be issued immediately. For violation of the second point, a ban will be issued on the spot without warning and the right to unban. In the future, the rules may be changed, updated or revised.
2. The following are prohibited:
1)Messages not on the topic specified in the post, video, etc. This also includes spam and unauthorized advertising.
2)Personal insults to users and the administration, inadequate behavior in general.
3. The following are strictly prohibited:
1)Harassment and bullying of users with the purpose of humiliating and harming them, calls for such actions, organizing them and participating in them.
2)Incitement to hatred and calls for violence or other illegal actions, approval and encouragement of such actions against a person or group of people. This also includes: threats of physical violence, wishes for death or serious injury, incitement to hatred based on race, religion, gender and other characteristics, approval and propaganda of destructive ideologies (e.g. Nazism), incitement to cruelty to animals, description and approval of such actions, description of methods of suicide, incitement to suicide.
3)Incitement and calls for network attacks, DDoS.
4)Instructions for the creation, purchase or sale of prohibited substances, weapons, explosives, counterfeit documents.
5)18+ content (NSFW/NSFL). The only exception: if it corresponds to the topic specified in the post (for example, in a historical context for educational purposes), but each relevant post will clearly define the boundaries of what is acceptable.

My Experience with the RVC Neural Network

Hello, readers! I’m SPCell, and I’d like to share my experience working with the RVC neural network, which I used to create “ Ai covers”,  transferring the vocal performance of one song onto another person’s voice. I began experimenting in summer 2023, had settled on optimal parameters by early 2024, and have since returned from time to time to fine‑tune the quality.
At the heart of it all is the training process: you must select argument values so that the output voice sounds as natural as possible and free of artifacts. If the model is under‑trained, the voice sounds robotic; if it’s over‑trained, the pitch begins to “jump,” but the voice itself still sounds acceptable, which is better than audible glitches. Initially, the best encoder was considered to be harvest, but after rmvpe appeared I switched to it, and later when rmvpe+ came out I adopted that as well, since it produced a modest but noticeable improvement over the version without the “+.”
Other training arguments I tweaked included:
*bitrate (depends on the sample rate of your dataset files),
*hop length (controls how strictly the pitch matches the original; lower values force a tighter match, higher values allow more flexibility),
*thread count (likely tied to how many GPU threads are used, affecting training strength),
*batch size (simultaneous file processing to speed up training; I set it to the maximum my GPU could handle),
*the total number of epochs (and saving checkpoints at intervals),
*the number of GPUs used.
I trained my models on Kaggle, where I could employ two GPUs, but found out that using a single GPU provided a cleaner final voice. To separate vocals from instrumentals I used Ultimate Vocal Remover, then cleaned any remaining artifacts in Adobe Audition, RX Pro Audio Editor, and SpectraLayers.
When it came time to generate covers, I always specified rmvpe or rmvpe+ as an argument, testing pitch adjustments separately so that the voice would match my dataset. In songs where the original singer performed at unusually high or low pitches, I’d raise or lower the generation pitch relative to the song’s normal sections (where the singer stays on a single tone) to keep the character of the voice aligned with the dataset.

My content plan draft

Hello, SPCell here. Posting my content plan draft here.
Posts will be divided by size: small, medium and large. English-speaking audience can read them on Reddit (in specific subreddits that is), Telegram, Threads and here, on Boosty. In the future videos both for my AI content and personal blog are planned to be made. This content plan is not yet final, so some changes most likely will be made.
In the neuro-blog all posts will be published both in Russian and in English. In the personal blog I made a division into Russian and English language segments, since a number of posts will concern life in the Russia, so this information more likely won't say much to an average English-speaking person.
Subscription levels1

Benefactor

$5.5 per month
Simply thanks for supporting me. If there will be more donators I will consider introducing more subscription levels with special features.
Go up