-
EchoMimicV2 Testing Dataset (EMTD) for Halfbody Human| Portrait Animation Series #4
EchoMimicV2 Testing Dataset (EMTD) for Halfbody Human| Portrait Animation Series #4 Buiding a dataset is often a tedius but critical task. However, when you can run EchoMimicV2 and understand how the pose inputs work, you may want to have a dataset. Running EchoMimicV2 on Windows 11 | Portrait Animation Series #1EchoMimicV2 is an opensource framework for…
-
Understanding Pose Inputs in the EchoMimicV2 Pipeline | Portrait Animation Series #3
Understanding Pose Inputs in the EchoMimicV2 Pipeline | Portrait Animation Series #3 This article analyses the pose input format and its role in the EchoMimicV2 (EM2) pipeline. We are able to set customised pose inputs in the previous post. But now we want to dive in and have a better understanding of this data and its…
-
Customised Pose Inputs in EchoMimicV2 | Portrait Animation Series 2
Customised Pose Inputs in EchoMimicV2 | Portrait Animation Series 2 This article explores how to set customised pose inputs in EchoMimicV2. When you can run EchoMimicV2 on your own PC, you may also want to explore the pose format and use customised pose inputs instead of the default ones. Explore the default pose inputs In the folder…
-
Running EchoMimicV2 on Windows 11
EchoMimicV2 is an opensource framework for audio driven human portrait animation. Compared to many other solutions (talking head) which only focusing on facial animation and head movement, this framework includes upper-body movement. The official repository tested the system on the Linux environment. However, this article records the process of running this framework on Windows 11…
-
Vision Transformers (ViT) Experiments Using PyTorch and PyTorch Lightning
Also can be seen at: https://medium.com/@heyulong3d/vision-transformers-vit-experiments-using-pytorch-and-pytorch-lightning-61e26738d9dd?sk=8326d1c2706380c7599c67e53d2e2b5c Overview This article will implement Vision Transformer (ViT) from scratch using PyTorch and PyTorch Lightning. It also covers insightful experiments with different patch size, model size, attention heads, and other improvements like overlapping patch embedding on CIFAR-10 dataset. This article will focus more on practice and experiments rather than…
-
[Deep Learning | ChatGPT]Learn Transformers(1) Tensors: It’s All About APIs!
Environment: TensorFlow Code Error happened.
-
[Deep Learning | ChatGPT]Learn Transformers(0) Let’s Do Some Research!
I have not done much deep learning research for about one year, because I have focused on development development and deep learning model deployment in Unreal 5. These days, ChatGPT have gained much popularity in our world, and we do have some technical need to learn text-based interface of human-computer interaction. I have done deep…
-
[Deep Learning]ONNX Runtime C++(2): Inferencing ONNX Model of Dynamic Input/Output Size Using CUDA
Some ONNX models have dynamic input/output size: In the above example, onnxruntime will set dimensions of batch_size, height, width to -1. NOTE: I tested it on onnxruntime v1.13.1 and onnxruntime-win-x64-gpu-1.13.1.zip. Some tips to avoid runtime errors(these errors often happen, and waste me a lot of time😱): We should Specify its shape when creating input tensors,…
-
[Deep Learning]ONNX Runtime C++(1):DL Model Inference Using ONNX Runtime
Example: FnsCandyStyleTransfer This example is based on microsoft/onnxruntime-inference-examples. I use Windows x64 platform, so LibPNG is a dependency: Compile and use libpng and zlib in Visual Studio References Error happened.