Abstract
Portrait sketching is widely used in digital art, forensic security and other fields with its unique value. However, existing portrait sketch style transfer techniques often focus on overall style transformation, neglecting the hierarchical drawing steps and the integration of details in the sketching process. The generated images still fall short in terms of layering effects and detail representation. To address this issue, this paper proposes a generative adversarial network-based portrait sketch temporal generation pipeline (PSTG). The pipeline simulates the artist’s layer-by-layer drawing process, sequentially executing from outline to facial features and then to hair, to generate high-quality sketch images with detailed expressiveness. Additionally, we designed a composite network structure that includes both global and local generators. The global generator is responsible for capturing overall contours and proportions, while the local generators focus on the detailed depiction of facial features and hair. This structure excels in capturing both overall proportions and local details. Experimental results demonstrate that the PSTG method not only restores the detail hierarchy of sketches but also achieves significant success in retaining sketch details and brushstroke effects. It also effectively imitates the brushstroke style of artists, generating portrait sketches that visually resemble the works of professional artists, outperforming existing holistic stylization methods overall.







Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
No datasets were generated or analyzed during the current study.
References
Yi, R., Liu, Y.-J., Lai, Y.-K., Rosin, P.L.: Apdrawinggan: generating artistic portrait drawings from face photos with hierarchical gans. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10735–10744 (2019)
Pinkney, J.N.M., Adler, D.: Resolution dependent gan interpolation for controllable image synthesis between domains. arXiv arXiv:2010.05334 (2020)
Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., Aila, T.: Analyzing and improving the image quality of stylegan. In: Computer Vision and Pattern Recognition (2019)
Shu, Y., Yi, R., Xia, M., Ye, Z., Zhao, W., Chen, Y., Lai, Y.-K., Liu, Y.-J.: Gan-based multi-style photo cartoonization. IEEE Trans. Vis. Comput. Graph. 27(1), 1–1 (2021)
Song, G., Luo, L., Liu, J., Ma, W.-C., Lai, C.-P., Zheng, C., Cham, T.: Agilegan: stylizing portraits by inversion-consistent transfer learning. ACM Trans. Graph. 40(4), 117:1-117:13 (2021)
Cui, A., McKee, D., Lazebnik, S.: Dressing in order: recurrent person image generation for pose transfer, virtual try-on and outfit editing. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 14618–14627 (2021)
Li, Y.J., Fang, C., Hertzmann, A., Shechtman, E., Yang, M.H.: Im2Pencil: controllable pencil illustration from photographs. In: Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 1525–1534. IEEE, Long Beach (2019)
Wang, S.Y., Bau, D., Zhu, J.Y.: Sketch your own GAN. In: Proceedings of IEEE/CVF International Conference on Computer Vision, pp. 14030–14040. IEEE, Montreal (2021)
Yang, S., Wang, Z., Liu, J., Guo, Z.: Deep plastic surgery: robust and controllable image editing with human-drawn sketches. In: Computer Vision—ECCV,: 16th European Conference, Glasgow, UK, August 23–28, 2020. Proceedings, Part XV, pp. 601–617. Springer, Berlin (2020)
Lu, Y.Y., Wu, S.Z., Tai, Y.W., Tang, C.K.: Image generation from sketch constraint using contextual GAN. In: Proceedings of the 15th European Conference on Computer Vision, pp. 213–228. Springer, Munich (2018)
Peng, C., Wang, N., Li, J., Gao, X.: Universal face photo-sketch style transfer via multiview domain translation. IEEE Trans. Image Process. 29, 8519–8534 (2020)
Wan, W., Yang, Y., Lee, H.J.: Generative adversarial learning for detail-preserving face sketch synthesis. Neurocomputing 438, 107–121 (2021)
Zhang, M., Wang, N., Li, Y., Gao, X.: Bionic face sketch generator. IEEE Trans. Cybern. 50(6), 2701–2714 (2020)
Yu, J., Xu, X., Gao, F., Shi, S., Wang, M., Tao, D., Huang, Q.: Toward realistic face photo-sketch synthesis via composition-aided gans. IEEE Trans. Cybern. 51(9), 4350–4362 (2021)
Li, L., Tang, J., Ye, Z., Sheng, B., Mao, L., Ma, L.: Unsupervised face super-resolution via gradient enhancement and semantic guidance. Vis. Comput. 37, 2855–2867 (2021)
Li, Y., Chen, X., Wu, F., Zha, Z.-J.: Linestofacephoto: face photo generation from lines with conditional selfattention generative adversarial networks. In: Proceedings of the 27th ACM International Conference on Multimedia, ser. MM’19, pp. 2323–2331. Association for Computing Machinery, New York (2019)
Li, Y., Chen, X., Yang, B., Chen, Z., Cheng, Z., Zha, Z.-J.: DeepFacePencil: Creating Face Images from Freehand Sketches, pp. 991–999. Association for Computing Machinery, New York (2020)
Zhang, S.C., Ji, R.R., Hu, J., Lu, X.Q., Li, X.L.: Face sketch synthesis by multidomain adversarial learning. IEEE Trans. Neural Netw. Learn. Syst. 30(5), 1419–1428 (2019). https://doi.org/10.1109/TNNLS.2018.2869574
Sheng, B., Li, P., Gao, C., Ma, K.-L.: Deep neural representation guided face sketch synthesis. IEEE Trans. Vis. Comput. Graph. 25(12), 3216–3230 (2019)
Zhang, M., Wang, R., Gao, X., Li, J., Tao, D.: Dualtransfer face sketch-photo synthesis. IEEE Trans. Image Process. 28(2), 642–657 (2019)
Zhang, L.L., Lin, L., Wu, X., Ding, S.Y., Zhang, L.: End-to-end photo-sketch generation via fully convolutional representation learning. In: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, pp. 627–634. ACM, Shanghai (2015). https://doi.org/10.1145/2671188.2749321
Zhu, J.Y., Krähenbühl, P., Shechtman, E., Efros, A.A.: Generative visual manipulation on the natural image manifold. In: Proceedings of the 14th European Conference on Computer Vision, pp. 597–613. Springer, Amsterdam (2016). https://doi.org/10.1007/978-3-31946454-1-36
Zhang, M.J., Wang, N.N., Li, Y.S., Wang, R.X., Gao, X.B.: Face sketch synthesis from coarse to fine. In: Proceedings of the 32nd AAAI Conference on Artificial Intelligence, California, USA, pp. 7558-7565 (2018). https://doi.org/10.1609/aaai.v32i1.12224
Zhang, M.J., Wang, N., Li, Y., Gao, X.: Markov random neural fields for face sketch synthesis. In: Proceedings of International Joint Conferences on Artificial Intelligence, Stockholm, Sweden, pp. 7558–7565 (2018)
Zhang, S.C., Ji, R.R., Hu, J., Gao, Y., Lin, C.W.: Robust face sketch synthesis via generative adversarial fusion of priors and parametric sigmoid. In: Proceedings of the 27th International Joint Conference on Artificial Intelligence, Stockholm, Sweden, pp. 1163–1169 (2018)
Wang, L.D., Sindagi, V., Patel, V.: High-quality facial photo-sketch synthesis using multi-adversarial networks. In: Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, pp. 83–90. IEEE, Xi’an (2018). https://doi.org/10.1109/FG.2018.00022
Kazemi, H., Iranmanesh, M., Dabouei, A., Soleymani, S., Nasrabadi, N.M.: Facial attributes guided deep sketch-to-photo synthesis. In: Proceedings of IEEE Winter Applications of Computer Vision Workshops. IEEE, Lake Tahoe (2018). https://doi.org/10.1109/WACVW.2018.00006
Kazemi, H., Taherkhani, F.: Unsupervised facial geometry learning for sketch to photo synthesis. In: Proceedings of International Conference of the Biometrics Special Interest Group. IEEE, Darmstadt (2018)
Duan, S.C., Chen, Z.X., Wu, Q.M.J., Cai, L., Lu, D.: Multi-scale gradients self-attention residual learning for face photo-sketch transformation. IEEE Trans. Inf. Forensics Secur. 16, 12181230 (2020). https://doi.org/10.1109/TIFS.2020.3031386
Goodfellow, I.J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, Canada, pp. 2672–2680 (2014)
Peng, C., Zhang, C., Liu, D., Wang, N., Gao, X.: Face photo-sketch synthesis via intra-domain enhancement. Knowl. Based Syst. 259(C), Art. no. 110026 (2023). https://doi.org/10.1016/j.knosys.2022.110026
Yi, R., Xia, M., Liu, Y.-J., Lai, Y.-K., Rosin, P.L.: Line drawings for face portraits from photos using global and local structure based gans. IEEE Trans Pattern Anal Mach Intell 43(10), 3462–3475 (2021)
Yi, R., Liu, Y.-J., Lai, Y.-K., Rosin, P.: Quality metric guided portrait line drawing generation from unpaired training data. IEEE Trans. Pattern Anal. Mach. Intell. 66, 1 (2022)
Zenoozi, A.D., Navi, K., Majidi, B.: Argan: fast converging gan for animation style transfer. In: International Conference on Machine Vision and Image Processing (MVIP), vol. 2022, pp. 1–5 (2022)
Li, B., Zhu, Y., Wang, Y., Lin, C.-W., Ghanem, B., Shen, L.: Anigan: style-guided generative adversarial networks for unsupervised anime face generation. IEEE Trans. Multimed. 23(4), 1–1 (2021)
Chen, Z., Wang, C., Yuan, B., Tao, D.: Puppeteergan: arbitrary portrait animation with semantic-aware appearance transformation. In: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2020, pp. 13515–13524 (2020)
Huang, Z., Zheng, Z., Yan, C., Xie, H., Sun, Y., Wang, J., Zhang, J.: Real-world automatic makeup via identity preservation makeup net. In: International Joint Conference on Artificial Intelligence (2021)
Zhang, X., Zheng, Z., Gao, D., Zhang, B., Yang, Y., Chua, T.S.: Multi-view consistent generative adversarial networks for compositional 3D-aware image synthesis. Int. J. Comput. Vis. 131(8), 2219–2242 (2023)
Suo, Y., Zheng, Z., Wang, X., et al.: Jointly harnessing prior structures and temporal consistency for sign language video generation. ACM Trans. Multimed. Comput. Commun. Appl. 18(2), 1–19 (2022)
Lin, X., Sun, S., Huang, W., Sheng, B., Li, P., Feng, D.D.: EAPT: efficient attention pyramid transformer for image processing. IEEE Trans. Multimed. 25, 50–61 (2021)
Xie, Z., Zhang, W., Sheng, B., Li, P., Chen, C.P.: BaGFN: broad attentive graph fusion network for high-order feature interactions. IEEE Trans. Neural Netw. Learn. Syst. 34(8), 4499–4513 (2021)
Zhou, Y., Chen, Z., Li, P., Song, H., Chen, C.P., Sheng, B.: FSAD-Net: feedback spatial attention dehazing network. IEEE Trans. Neural Netw. Learn. Syst. 34, 66 (2022)
Zhang, S., Ji, R., Hu, J., Lu, X., Li, X.: Face sketch synthesis by multidomain adversarial learning. IEEE Trans. Neural Netw. Learn. Syst. 30(5), 1419–1428 (2019)
Heusel, M., Ramsauer, H., Unterhaltung, T., Nair, V., Urtasun, R.: GANs trained by a two time-scale update rule converge to equilibrium. In: Advances in Neural Information Processing Systems (NeurIPS), pp. 6736–6744
Wang, Z., Bovik, A.C., Sheikh, H.R., Simoncelli, E.P.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 2017, pp. 2223–2232 (2017)
Shang, M., Gao, F., Li, X., Zhu, J., Dai, L.: Bridging unpaired facial photos and sketches by line-drawings. In: ICASSP 2021—2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2010–2014. IEEE (2021)
Isola, P., Zhu, J.-Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, pp. 5967–5976 (2017). https://doi.org/10.1109/CVPR.2017.632
Author information
Authors and Affiliations
Contributions
Mingfu Xiong provided critical guidance and fnancial support for this study, Mengsi Guo wrote the main manuscript text and prepared the experimental data, and Jin Huang was responsible for the preliminary algorithm ideas. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised: the assignment of portraits and text in author biographies was not correct.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Guo, M., Xiong, M., Huang, J. et al. Face photo-sketch portraits transformation via generation pipeline. Vis Comput 41, 1183–1196 (2025). https://doi.org/10.1007/s00371-024-03403-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00371-024-03403-5