当前位置:网站首页>CVPR 2022 𞓜 a creative and aesthetic text generation method! Support any input
CVPR 2022 𞓜 a creative and aesthetic text generation method! Support any input
2022-06-28 21:41:00 【Zhiyuan community】
This article briefly introduces CVPR 2022 Papers hired “Aesthetic Text Logo Synthesis via Content-aware Layout Inferring” . This paper aims to explore the process of character logo image design Automatic layout generation in . This paper is based on conditional countermeasure generation network (conditional-GAN), A dual discriminator structure and a differentiable splicing module are proposed , According to the visual and semantic information of the input text , The layout geometric parameters of each glyph are deduced , So as to synthesize the text identification image . This method can assist graphic design and other visual tasks related to text . The data set and code related to this work have been open source ( See the end of the article ).
The paper : https://arxiv.org/abs/2204.02701
Data sets and code : https://github.com/yizhiwang96/TextLogoLayout
One 、 Research background
Text mark (Text Logo) The design of depends very much on the creativity and experience of the designer , among , How to arrange the layout of each text element is a core problem . Layout design needs to consider many factors , Such as glyph 、 Literal semantics 、 Theme, etc . Pictured 1 Shown , There is usually no shape overlap between different words ; Line breaks or column breaks in Chinese signs are usually in the word element (Token) after ; Emphasize the meaning of words , Larger sizes are usually used ; Geometric transformations such as oblique cutting and rotation can respectively embody themes such as a sense of strength and a sense of joy . Most of the existing schemes in the industry are to design a set of rules that are easy to implement , Design the layout according to some preset templates , However, the results are often monotonous and lack creativity and beauty . In response to this question , This paper proposes a content aware text logo image generation model , From a large number of existing words Logo Learn layout design rules implicitly , Thus, a new font can be generated for any input font Logo.

chart 1 Common layout types in text logo images
Two 、 Data sets
Training AI Models usually require a lot of data , However, there is no data set for this task in the industry . To solve this problem , This paper proposes TextLogo3K Data sets , With the help of Tencent video platform , collect 、 Marking the 3,470 Zhang's carefully selected words Logo chart , these Logo From movies 、 Cover of TV series and animation . Pictured 2 Sum graph 3 Shown , The dataset accurately annotates the glyphs at the pixel level , Also marked with a font bounding box 、 Character category .

chart 2 TextLogo3K in Logo Image annotation
meanwhile , Their position and segmentation information in the original poster picture are also provided :

chart 3 TextLogo3K Annotation of poster image in
The data set is free for users to do academic research ( No commercial use ). Except for the words Logo Generate , This data set can also be applied to Text detection and recognition 、 Artistic font generation 、 Texture effect migration 、 Scene text editing Etc .
3、 ... and 、 Model design
The flow diagram of this model is shown in the figure below :

chart 4 The flow chart of this model
This model is based on Conditional GAN To generate text Logo, Innovative use of the dual discriminator structure ( Sequence discriminator and image discriminator ), On the trajectory sequence and the whole of the glyph Logo The images are judged separately ; At the same time, with the help of differentiable splicing (Differentiable Composition), Construction position coordinates to Logo The differential rendering process of an image . Its main processes include :
- First, the bimodal characteristics of the input elements are used ( That is, the visual features of the font and the semantic features of the text ), Encode it as a conditional feature .
- Coordinate generator A conditional feature and a random noise are used as input , Predict position coordinates for each character , That is, the coordinate of the center point of the font circumscribed frame , Width and height .
- The position coordinates of each character form a track sequence , Therefore, a Sequence discriminator To judge whether the sequence and are true or false according to the conditions . Note that the coordinate values in this task are continuous , It ensures that the sequence discriminator can propagate the gradient .
- adopt Differentiable splicing , Merge each glyph to get Logo Images .
- introduce Image discriminator , As Sequence discriminator A supplement to , The purpose is to further capture the details of the logo image , Ensure that there is no large overlap between different glyphs , The space between glyphs is reasonable .
边栏推荐
- LeetCode123. The best time to buy and sell stocks III
- LeetCode213. House raiding II
- LeetCode188. 买卖股票的最佳时机IV
- Bitbucket failed to pull the warehouse Using SSH
- pyechart绘制多条y轴折线图
- LeetCode1114. Print in sequence
- Lua源码剖析:一. lua变量类型可变特性在C代码中实现。
- Study on luminiprobe non fluorescent azide -- 3-azido propanol
- Golang JSON serializing and deserializing strings deserializing to map[string]interface{}
- Which is the most reliable and safe for a securities company to open an account
猜你喜欢

视觉弱监督学习研究进展

The further application of Li Kou tree

17 `bs object Node name h3 Parent ` parents get parent node ancestor node

Smarca2 antibody study: abnova smarca2 monoclonal antibody protocol

Anti rabbit dylight 488 abbkine universal immunofluorescence (if) toolbox

pyechart绘制多条y轴折线图

城市大脑知识图谱构建及应用研究

Study on bifunctional crosslinker lumiprobe sulfoacyanine 7 dicarboxylic acid

Security dilemma of NFT liquidity agreement - Analysis of the hacked event of NFT loan agreement xcarnival

什么是接口?什么是接口测试?
随机推荐
Openfire user and group relationship migration
The blocks problem (uva101) Purple Book p110vector application
LeetCode:合并K个升序链表_23
LeetCode188. The best time to buy and sell stocks IV
Smarca2 antibody study: abnova smarca2 monoclonal antibody protocol
LeetCode986. 区间列表的交集
LeetCode877. Stone game
Anr analysis - question 1
Leetcode: expand a binary tree into a linked list_ one hundred and fourteen
After easycvr creates a new user, the video access page cannot be clicked. Fix the problem
API gateway Apache APIs IX helps the evolution of snowball dual active architecture
接口用例设计
17 `bs object Node name h3 Parent ` parents get parent node ancestor node
Zero foundation self-study SQL course | complete collection of date functions in SQL
16 `bs对象.节点名div.属性contents` children descendants 获取子节点 子孙节点
The further application of Li Kou tree
LeetCode:合并两个有序链表_21
Understanding web automated testing
Leetcode daily question - Sword finger offer II 091 Paint the house
LeetCode:二叉树展开为链表_114