-
Running EchoMimicV2 on Windows 11
EchoMimicV2 is an opensource framework for audio driven human portrait animation. Compared to many other solutions (talking head) which only focusing on facial animation and head movement, this framework includes upper-body movement. The official repository tested the system on the Linux environment. However, this article records the process of running this framework on Windows 11…
-
Vision Transformers (ViT) Experiments Using PyTorch and PyTorch Lightning
Also can be seen at: https://medium.com/@heyulong3d/vision-transformers-vit-experiments-using-pytorch-and-pytorch-lightning-61e26738d9dd?sk=8326d1c2706380c7599c67e53d2e2b5c Overview This article will implement Vision Transformer (ViT) from scratch using PyTorch and PyTorch Lightning. It also covers insightful experiments with different patch size, model size, attention heads, and other improvements like overlapping patch embedding on CIFAR-10 dataset. This article will focus more on practice and experiments rather than…
-
OpenCV C++ Crash Course Notes: Samples
Binary Image: 0 or 1 Gray Scale Image: RGB Image: Importing Images Importing Video Webcam Reading Processing: Gray, Blur, Canny Edge Detector Processing: Dilate, Erode Resize Crop Region of Interest(RoI) Draw Shapes & Text Warping Color Detection & Trackbar Detect Shapes & Contours Face Detection References Error happened.