SPADE-based Line Art Colorization

Last updated on Oct 14, 2024

Course project in KAIST Introduction to Artificial Intelligence course (CS470)

Project Summary

SPADE is a semantic image synthesis model designed to generate photorealistic images based on a specified semantic input. It introduces new spatially-adaptive normalization layers, addressing the issue of standard normalization layers potentially erasing semantic information and leading to suboptimal synthesized images. Recognizing the potential applicability of this model to colorization, we considered the line art and color hints as analogous to the semantic map in SPADE. Our hypothesis was that by using these inputs, the model could effectively synthesize images resembling illustrations, allowing users to colorize line art with their preferred colors.

To this end, we will deliver a SPADE-based model that colorizes a given line art image using a hint image. Instead of the semantic map the SPADE originally used, we use the line art image and hint image that consists of color patches as a semantic input. We also implement a web demo application that allows the users to use our resulting model interactively.

Libraries & Frameworks

PyTorch
Flask
HTML/CSS

Deep Learning

SPADE-based Line Art Colorization

Project Summary

Libraries & Frameworks

Graduate Student Researcher @ KAISTResearch Intern @ Microsoft Research

Graduate Student Researcher @ KAIST
Research Intern @ Microsoft Research