liudongdong1 收录于 Categories Math&Model

2020-07-13 约 994 字预计阅读 2 分钟 - 次阅读

https://lddpicture.oss-cn-beijing.aliyuncs.com/picture/image-20201129135707833.png

Siamese Network 是一种神经网络的框架，用于评估两个输入样本的相似度，而不是具体的某种网络，就像seq2seq一样，具体实现上可以使用RNN也可以使用CNN。

1. Siamese Network

1.Paper

level: author: Sumit Chopra(Courant Institute of Mathematical Sciences), Raia Hadsell(New York University), Yann LeCun date: keyword:

similarity metric

Chopra, Sumit, Raia Hadsell, and Yann LeCun. “Learning a similarity metric discriminatively, with application to face verification.” 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05). Vol. 1. IEEE, 2005. Cited by 2402

Paper: Similarity Metric

Summary

present a method for training a similarity metric from data, which is used for recognition or verification applications where the number of categories is very large and not known during training, and where the number of training samples for a single category is very small.
learn a function that maps input patterns into a target space such that the $L_1$ norm in the target space approximates the semantic distance in the input space;

Proble Statement

traditional approaches to classification using discriminative methods require that all categories be known in advance, also require that all the categories available for all categories.
- computing a similarity metric between the pattern to be classified or verified and a library of stored prototypes;
- use non-discriminative probablilistic methods in a reduce-dimension space,

Methods

system overview:

$$ E_w(X_1,X_2)=||G_w(X_1)-G_w(X_2)|| $$

Condition 1:

$\exists m>0$, such that $E_w(x_1,x_2)+m<E_w(X_1,X_2’)$;

**【Contrastive Loss Function】 **

a contrastive term to ensure not only that the energy for a pair of inputs from the same category is low, but also that the energy for a pair from different categories is large;

$(X_1,X_2)i$Y，X_1,X_2)i$ : the i-th sample with a pair of images and a label;
$L_G$: the partial loss function for a genuine pair;
$L_I$: the partial loss function for an imposter pair;
$P$: the number of trainning samples;

$$ H(E_w^G,H_w^I)=L_G(E_w^G)+L_I(E_w^I) $$

Condition 2: the minima of $H(E_w^G,H_w^I)$ should be inside the half plane $E_w^G+m<E_w^I$;

Condition 3: the nagative of the gradient of $H(E_W^G,E_w^I)$ on the margin line $E_w^G+m=E_w^I$ has a positive dot product with the direction [-1,1];

Notes 去加强了解

condition 证明部分没有看懂；

2. 应用

2.1. Signature Verification

Bromley, Jane, et al. “Signature verification using a” siamese" time delay neural network." Advances in neural information processing systems. 1994. cited by 1942

base on Siamese neural network, design a system for verification of signatures written on a pen-input tablet;
contain two sub-networks to extract features from two signatures, while the joining neuron measures the distance between the two feature vectors;

2.2. Image patches comparation

Zagoruyko, Sergey, and Nikos Komodakis. “Learning to compare image patches via convolutional neural networks.” Proceedings of the IEEE conference on computer vision and pattern recognition. 2015. cited by 978

level: CVPR author: Sergey Zagoruyko date: 2015 keyword:

image patches