Robust Road Sign Segmentation

Robust Road Sign Segmentation
Chung-Hsin Liao 1, Hiromitsu Hama 2
1
Department of Electronic Engineering, ChengShiu University
No.840,ChengCing Rd., Niaosong Township, Kaohsiung County, Taiwan, R.O.C, [email protected]
2
Graduate School of Engineering, Osaka City University, Japan
3-3-138, Sugimoto, Sumiyoshiku, Osaka 558-8585, Japan, [email protected]
information necessary for successful driving-they
describe the current traffic situation, define right-ofThis paper deals with the robust segmentation of road way, prohibit or permit certain directions, warn about
signs under bad illumination conditions including risky factors and also help drivers with navigation.
night-time. The road sign recognition system is very Recently, many techniques have been developed to
important for safe driving. The proposed segmentation segment and recognize road signs. Many of them
system is developed using relative similarity with the adopted the road sign segmentation using color and
two-way connectivity. Here, we focus mainly on how shape information. For example, Piccioli et al. [1] use
to obtain the optimum threshold to each illumination color and a priori information to limit the possible
condition. The proposed system works on various locations of signs in the image. They then extract
types of road signs such as circular, equilateral edges and look for circular or triangular regions before
triangle, pentagon and diamond-shapes. The final goal applying a cross-correlation technique for recognizing
of our proposed system indicates to the driver the the signs. In [2], a redness measure is used to locate
t
op,y
i
e
l
d,a
n
d“
do n
ote
n
t
e
r
”s
i
gn
s
.Th
i
ss
t
e
pi
s
presence of a sign in advance so that some incorrect s
followed
by
edge
detection
and
shape
analysis
to
human decisions may be avoided. The effectiveness of
the proposed method is demonstrated using 100 identify the sign. In their approach to detecting stop
images of road signs under various illumination signs, Yuille and his colleagues [3] correct for the
conditions. The result is encouraging and color of the ambient illumination, locate the
demonstrates the ability and efficiency of our boundaries of the signs and map the sign into a front
algorithm in achieving the segmentation task. to parallel position before reading the sign. Another
According to our experimental results, the average paper that describes actively controlling the camera
for sign detection is [4]. They predict the location of
accuracy rate for segmentation is 98%.
the sign and point the camera for a closer view. Signs
are recognized and their contents are read by template
1. INTRODUCTION
matching.
The road sign recognition system is an essential A decision tree method is used in [5] to detect and
modu
l
e of‘
Dr
i
v
e
rSu
ppor
tSy
s
t
e
m(
DSS)
’
,whi
c
h recognize signs without using color. Detection is
should help human drivers to predict and avoid based on shape using local orientations of image edges
dangerous situation in traffic environment. The and hierarchical templates. The results are sent to a
objective of DSS is to increase the safety, efficiency decision tree which either labels the regions by sign
and comfort of driving. Among them, driving safety is type or rejects them. A method to detect speed limit
the most important and the major concern. So, the signs is given in [6]. It is based on first using color to
primary goal of such a system is to ensure the traffic locate candidate signs, followed by a multi-resolution
safety, which may be also used as a support tool for application of templates that look for circular regions.
vehicle guidance and navigation. The increase in Another approach is taken by Fleischer et al., [7], who
traffic accidents is becoming a serious social problem use a model-based, top down approach. Predictions
with the recent rapid traffic increase. In many cases, are made of locations in which signs may appear and
t
h
edr
i
v
e
r
’
sc
a
r
e
l
e
s
s
n
e
s
si
st
h
epr
i
ma
r
yf
a
c
t
oroft
r
a
f
f
i
c shape, size, and color are used to specify edge models
accident, and the DSS is demanded for supporting for the signs in the 3D world. Signs that are found are
dr
i
v
e
r
’
ss
a
f
e
t
y
.To be a
bl
et
oh
e
l
pt
h
e dr
i
v
e
ri
n tracked through subsequent images using a Kalman
decision-making, DSS must understand the current filter. Shaposhnikov et al., [8], make use of color
traffic situation. Therefore, it should create and segmentation using the CIECAM97 color appearance
maintain the model of its neighbourhood. Because of model. They then use histograms of oriented edge
the dominant role of visual information for the human elements to determine the shape of the sign followed
driver, computer vision methods are often used in by location of the center of the sign. Escalera et al., [9],
semiautonomous and autonomous vehicle prototypes also start with color matching, which they follow with
for the creation of such model. Road signs carry much corner detection in which they look for corners in
Abstract
specific relationships that correspond to triangular,
rectangular, or circular signs.
Extracted sign regions are matched with templates
using a Markov [9-11] model, and the match with
highest rank is regarded as the recognition result.
Until now, many approaches have been proposed to
develop road sign segmentation and recognition
s
y
s
t
e
msun
de
rg
ood c
on
di
t
i
on
sbu
tt
h
e
yc
a
n’
tg
i
v
e
successful results under bad conditions, for example,
at night-time. In this paper, we propose a robust road
sign segmentation system under all illumination
conditions. Although the segmentation is not easy to
be successful at night-time, we focus on such difficult
conditions as shown in figure 1. This paper applies
adaptive thresholds for robust segmentation. Needless
to say, segmentation is not so easy under outdoor
scenes. There are many degrees of freedom that Fig. 1: Difficulties: (a) reflections, (b) not controlled lighting, (c)
shadows, (d) night-time.
caused the segmentation to be difficult.

occlusion and the presence of other objects
with the same plane,

lighting conditions are changeable and not
controllable, producing bright spots and shadows,

color and shape of signs depending their age,
physical conditions and scratches.
To overcome these difficulties, the proposed algorithm
is used both of color and shape information, and
furthermore relative similarity with the two-way
connectivity method. Though the experiments with
100 road signs taken under various illumination
conditions including night-time, it is shown that the
system can give 98% segmentation rate.
The remainder of this system is organized as follows.
In Section 2, we present segmentation method. The
flow diagram for the proposed system is described in
this section. In Section 3, the experimental results and
successful rates are shown. Finally, conclusions and
future works are discussed in Section 4.
2. SEGMENTATION METHOD
Our proposed method is a combination of color feature
based and region based segmentation method.
Segmentation is an important part of any automated
image recognition systems. To achieve region
segmentation, it is needed to set the color distance also
called the threshold first. It depends on decision of
suitable thresholds whether or not to segment the
required region successfully. Color distance is used as
a measurement of color similarity where pixels
satisfying a certain degree of color homogeneity are
grouped to form a cluster.
Here, we propose a robust segmentation method using
relative similarity with the two-way connectivity
method. This paper focuses on how to decide
adaptively the threshold with K-nearest connectivity
for relative similarity using color information. After
noise removal, the interested regions are divided up
from outliers using shape information. The
segmentation is done in the RGB space. The flow
diagram is shown in figure 2.
Fig. 2: Flow diagram.
Most of previous road sign segmentation approaches
used the fixed thresholds or absolute similarity, so
called bounding-box method. For example, the region
segmentation was developed using the thresholds over
gray level images and color images but these methods
were very sensitive to the threshold values. This
method is one of the simplest ways of color image
segmentation. The circular road signs include only
three colors: red, blue and white. For example, in the
case of bounding-box for red region detection, the
thresholds for R, G and B components may be set to
40 R 100 , 10 G 85 ,and 15 B 75 , respectively.
Example is shown in figure 3. The segmentation is
completed for figure 3(a), but not for figure 3(b).
Although we may change the bounding-box so that
can be successful in this failure case, the new box
c
a
n
’
tg
r
a
n
t
e
et
os
ol
v
eot
h
e
ri
npu
ti
ma
g
e
s
.Th
i
sme
a
n
s
,
the bounding-box should be always changed according
to an input image.
Fig. 3: Road sign segmentation using a bounding-box (RGB color
space): (a-1) and (b-1) input image, (a-2) and (b-2) segmented result
using the bounding-box, (a-3) segmented result (successful) and (b3) segmented result (unsuccessful).
Fig. 4: Lattice seed points: (a) lattice seed points in an image, and
(b) complete segmentation by seed region growing (threshold=16).
Fig. 5: Examples of K-nearest connectivity links: (a) one-way and
two-way similarities, (b) K=1, (c) K=2, (d) K=3, (e) K=4, (f) K=5,
(g) K=6, (h) K=7, and (i) K=8 ((b)~(i) threshold=2).
For these reasons, the proposed segmentation
algorithm is developed with a relative similarity
method instead of an absolute one. Relative to filling
algorithm of connected regions, there are many
techniques seed filling, scan line filling algorithms and
so on. They can be applicable to obtaining connected
regions by relative similarity. Among them, the seed
filling algorithm is adopted, and the seed region
growing method here. The main idea is to compare the
relation of the seed pixel (central pixel) and
neighbourhood pixels, that is, they belong to the same
objects. The choice of criterion is very important and
critical. The seed points are assigned at every lattice
point as shown in figure 4. If the lattice is small
enough, then the result would include at least one
region which should be subdivided. The assumption
using suitable intervals can be solved with the option
of splitting such regions. On the other hand, it takes
expensive computational cost. Here we treat three
main types of connectivity: the traditional 4-connected
(N=4), 8-connected (N=8), and K-nearest connectivity
(K=4~8), and two types of similarity: one-way and
two-way. Figure 5 (a) shows a schematic illustration
of the concept of relative similarity where the links are
neighborhoods.
In this figure, one way arrow (A→B) means that A is
similar to B, but B is not similar to A. The two way
arrow (A
C) means that A and C are similar to each
other. In this case, the link is made only between A
and C. According to this way, connected regions are
made, and it is called the two-way method. Figure 5
(e) and (f) show that there is a link between the central
two points ④ and ③ when K=5, and no link when
K=4. In image pattern recognition, thresholding is an
extremely useful technique, because of its powerful
ability to extract characters or objects from their
backgrounds. A real world outdoor scene includes
many objects, such as cars, persons, visual
information signs, buildings and so on, which
composed of a variety of shapes and colors under
various lighting conditions. In analyzing such scene
images, the results after thresholding must be invariant
to these conditions.
The segmentation method used adaptive thresholds
[12-16] for relative similarity [17-19]. More precisely,
the same color region means the uniform color region
under at a certain similarity. The similarity value for
two consecutive pixels is calculated by Eq.(1) that is
defined in RGB space. Here, it is very important how
to decide the optimum threshold. We will find the
stable region for deciding the threshold. It means that
t
h
er
e
g
i
on c
a
n’
tdr
a
ma
t
i
c
a
l
l
yc
h
a
ng
ea
l
t
h
oug
ht
h
e
threshold is gradually changing in a wide range. Here,
this method can be developed to decide the suitable
threshold automatically for various images. These
aspects are shown in figure 6. In this figure, when the
threshold is 14, there are more than three connected
regions. The areas corresponding to L1, L2 and L3 are
100, 125 and 84 pixels, respectively. When the
threshold increases to 16, there is only one region. It
means that the three regions are combined into one
object. In other words, L1, L2 and L3 belong to a
common region. If the threshold increases, the region
area becomes larger. Although the area size can be in
the predetermined range, there may be sometimes a
few error cases. To avoid this kind of problems, this
system uses the margin to the upper and lower
boundaries of the threshold range. Here, the margin 
is set as 5 to obtain the stable area. In figure 6, the
deciding of the optimal thresholds is explained with a
red ring color region of a road sign. According to this
figure, the stable area is in the range of 310 to 400
pixels and the stable region is between 16 and 36.
Then the optimal threshold is decided a value between
21 and 31, for example, Threshold=26. Here the
margin  is 5, then 16+ =21, and 36- =31. Here,
used road signs are taken from the distance between
30m to 40m. Here, the area and ratio parameter for
circular, equilateral-triangle, pentagon and diamondshapes using 20 test images. Using these training data,
the predetermined areas for uniform color regions of
various road sign types are described as shown in table
1. In this paper, relative similarity with 4-connectivity
is adopted to decide the suitable threshold.

distance  R i R j
2
2
2 
G i G j 

B i B j 
,
K-nearest connectivity. The K-nearest (K=8) and 8connected (N=8) are the same. It can be also seen that
K-nearest (K=4) is the best because it gives the widest
range of suitable thresholds that is the mean, minimum
and maximum in the case of K=4 are larger than those
of other K values. The reason to choose (K=4) is that
the wider range of the optimum thresholds gives the
more stable uniform region. Firstly, the optimum
threshold is automatically decided using relative
similarity with the two-way K-nearest connectivity,
here K=4. Using the temporary decided threshold, the
system performs the labelling process. The ROI is
examined using its area and ratio of width and height.
For example, the red ring of road signs is 314~400
pixels at the distance 30m ~ 40m, that already
predefined in table 1. If there is no suitable region at
this stage, the system goes back to the former stage
and checks again and changing the threshold. After
extracting the ROI, the system removes noise and then
completes the segmentation.
(1)
where, Ri, Gi, Bi : the R, G and B components of the
current interest pixel Xi,
Rj, Gj, Bj: the R, G and B components of the
neighborhood pixel Xj.
Fig. 7: The suitable threshold valuation.
3. EXPERIMENT RESULTS
Fig. 6: Decision of the optimum thresholds.
TABLE 1: AREA AND RATION PARAMETER:
Moreover,
the
thresholdings
using
various
connectivity methods are implemented and the results
are compared as shown in figure 7. It includes the
results of 4-connectivity, 8-connectivity and two-way.
To confirm the effectiveness of the proposed method,
the images of road signs were taken under various
illumination conditions by resolution CCD camera.
For the images taken from 30m to 40m distances, the
red ring region area is in the range of 300 to 400 pixels
according to our experimental results. Using the
relationship of the red ring region area and the
distance, the system can be easily extended to images
taken from any distance. In this paper, special
attention is paid to robustness and flexibility under
bad conditions. Some examples of bad conditions are
shown. The access rates for segmentation are
expressed in table 2.
This system was implemented using on Visual Basic
Ver.6 and Halcon Ver.6 and executed on a personal
computer with Pentium 4 processor 2.66 GHz.
Experimental data used here are circular, equilateral
triangle, pentagon and diamond-shapes road signs.
The processing time for deciding the adaptive
threshold is approximately 2.2 seconds. The total time
for segmentation is 2.8 sec. This system decided
automatically adaptive thresholds and then segmented
input images almost perfectly. In this system, there
were a few failure cases in segmentation because of
occlusion and worse illumination conditions that are
shown in figure 8.
REFERENCES
[1] Piccioli, G., De Micheli, E., Parodi, P., and
Ca
mpa
n
i
,M.
,“
A Robu
s
tMe
t
h
odf
orRoa
dSi
g
n
De
t
e
c
t
i
on a
n
d Re
c
ogn
i
t
i
on
,
”I
ma
g
ea
n
d Vi
s
i
on
Computing, vol. 14, no. 3, pp. 209-223, 1996.
[2] L.Es
t
e
v
e
za
n
d N.Ke
ht
a
r
na
v
a
z
,“
Ar
e
a
l
-time
histographic approach to road sign r
e
c
ogn
i
t
i
on
,
”
in Proc. IEEE Southwest Symp. Image Analysis
Interpretation, pp. 94–100, 1996.
[3] Yuille, A. L., Snow, D., and Nitzberg, M.
Si
gnf
i
n
de
r
,“
Us
i
ngCol
ort
oDe
t
e
c
t
,Loc
a
l
i
z
ea
n
d
I
d
e
n
t
i
f
y
I
n
f
o
r
m
a
t
i
o
n
a
l
S
i
g
n
s
,
”
i
n
I
n
t
.
C
o
n
f
.
o
n
4. CONCLUSIONS AND DISCUSSIONS
Computer Vision (ICCV98), Bombay, India, pp.
628-633, 1998.
We have presented an approach for road sign
segmentation under various illumination conditions [4] Miura, Jun, Tsuyoshi, Kanda, and Shirai,
Yos
h
i
a
k
i
,“
AnAc
t
i
v
eVi
s
i
onSy
s
t
e
mf
orRe
a
l
with outdoor scenes. This paper focused differing
T
i
m
e
T
r
a
f
f
i
c
S
i
g
n
R
e
c
o
g
n
i
t
i
o
n
,
”
I
E
E
E
C
o
n
f
.
o
n
from previous approaches, in dealing with many areas
Intelligent Transportation Systems, Dearborn, MI,
of freedoms. The main objective was emphasized to
pp. 52-57, 2000.
the problem of lighting changes and shadows. In this
[5]
Paclik, Pavel and Novov
i
c
ova
,J
a
n
a
,”
Roa
dSi
g
n
system, a robust road sign segmentation algorithm was
Cl
a
s
s
i
f
i
c
a
t
i
on Wi
t
h
ou
tCol
or I
n
f
or
ma
t
i
on
,
”i
n
proposed using adaptive thresholds for relative
Proc. of the 6th Conf. of Advanced School of
similarity with K-nearest connectivity, where K=4.
Imaging and Computing, Lommel, Belgium, 2000.
The system was developed successfully under various
[6]
Se
k
a
n
i
n
a
,Luk
a
sa
n
dTor
r
e
s
e
n,J
i
m,“
De
t
e
c
t
i
onof
illumination conditions such as daytime, evening and
N
o
r
w
e
g
i
a
n
S
p
e
e
d
L
i
m
i
t
S
i
g
n
s
,
”
i
n
P
r
o
c
.
o
ft
h
e
night-time. In this paper, segmentation was completed
16th
European
Simulation
Multiconference
98 road signs among 100 road sign images.
(ESM-2002), Darmstadt, Germany, pp. 337-340,
The future work is to develop a robust segmentation
2002.
system under more occluded conditions, and bad
,a
ndRa
t
h
,T.M.
,“
3Dweather such as snow and rainy days and so on. The [7] Fleischer, K, Nagel, H.-H.
Model-Based-Vision for Innercity Driving
next step of our system is to recognize road signs and
Sc
e
n
e
s
,
” I
EEE I
n
t
e
l
l
i
g
e
n
t Ve
h
i
c
l
e
s Sy
mp.
transmit only the necessary information to the driver
(IV'2002),
Versailles,
France,
pp.
477-482,
2002.
by the way of voice or vibrations that may not disturb
in his driving. This system will be intended to [8] Shaposhnikov, Lubov N., Podladchikova,
Alexander V., Golovan, Natalia, Shevtsova,
c
on
s
i
de
rt
h
edr
i
v
e
r
’
sg
a
z
i
ng u
s
i
ng e
y
ec
a
me
r
aa
n
d
A.
,
Hong
,Kun
bi
n
,a
n
d Ga
o,Xi
a
oh
on
g
,“
Roa
d
when the driver dose not pay attention to the necessary
Sign
Recognition
by
Single
Positioning
of
Spaceroad environments, the system will alert to the driver
Va
r
i
a
n
tSe
n
s
orWi
n
dow,
”i
nPr
oc
.oft
h
e15t
hI
n
t
.
in time.
Conf. on Vision Interface, Calgary, Canada, pp.
213-217, 2002.
[9] A.del
aEs
c
a
l
e
r
aa
n
dL.Mor
e
n
o,“
Roa
dt
r
a
f
f
i
c
s
i
gn de
t
e
c
t
i
on a
n
dc
l
a
s
s
i
f
i
c
a
t
i
on
,
”I
EEE Tr
a
n
s
.
Industrial Electromagnetism, vol. 44, pp. 847–
859,
1997.
[10] M.La
l
on
dea
n
dY.Li
,“
De
t
e
c
t
i
onofr
oa
ds
i
g
n
s
using color i
n
de
x
i
n
g
,
” Ce
n
t
r
e de Re
c
h
e
r
c
h
e
Informatique de Montreal, Montreal, QC, Canada,
CRIM-IT-95, pp. 12–49, 1995.
[11] D.
S.Ka
n
g
,N.
C.Gr
i
s
wol
d,N.Ke
h
t
a
r
n
a
v
a
z
,“
An
invariant traffic sign recognition system based on
sequential color processing and geometrical
transf
or
ma
t
i
on
,
”i
nPr
oc
.oft
h
eI
EEESou
t
hwe
s
t
Symp. on Image Analysis and Interpretation,
Dallas, TX, pp.88-93, April, 1994.
[12] Thi Thi Zin, C. Liao, T. Kaneko, Y. Yanagihara,
Fig. 8: Examples of failure cases in segmentation: (a) occlusion, (b)
H.Ha
ma
,“
Re
c
og
ni
t
i
onofRoa
dEn
v
i
r
onme
n
tby
very dirty, (c) very dark and deteriorate, (d) only a part of region
Using Color Information from Moving Video
was affected by strongly sunlight.
I
ma
g
e
s
,
”I
nPr
oc
.7t
hI
ma
geMe
di
aPr
oc
e
s
s
i
n
g
Symp. (IMPS 2002), pp.109-110, November,
2002.
TABLE 2: SEGMENTATION RESULTS:
[13] Thi Thi Zin, C. Liao, T. Kaneko, H. Hama, Pyke
Ti
n
,“
Robus
tRoa
d Si
g
n Re
c
og
nition byUsing
Multi-f
e
a
t
u
r
eTe
mpl
a
t
eMa
t
c
h
i
ng
,
”I
nPr
oc
.2n
d
Int. Conf. on Computer Applications, Yangon,
Myanmar, pp.474-481, January, 2004.
[14] C.Li
a
o,T.Ka
n
e
k
o,H.Ha
ma
,“
Ex
t
r
a
c
t
i
onof
Road Traffic Signs using Similarity of Adjacent
Col
orI
n
f
or
ma
t
i
on
,
”In Proc. 4th Int. Conf. On
Advanced Mechatronic-Toward Evolutionary
Fusion of It and Mechatronics, Asahikawa Japan,
pp.103-108, 2004.
[15] C.Li
a
o,H.Ha
ma
,“
Ex
t
r
a
c
t
i
onofRoa
dTr
a
f
f
i
c
Signs using Similarity of Adjacent Color
I
n
f
or
ma
t
i
on
,
”I
n Pr
oc
. 36t
hI
n
t
. Conf. On
Stochastic System Theory and ITS Applications,
Tokyo Denki University, Saitama, Japan, pp.8-13,
2004.
[16] C.Li
a
o,H.Ha
ma
,“
Re
c
ogn
i
t
i
onofRoa
dTr
a
f
f
i
c
Si
gn
s
,
”I
n Pr
oc
.9t
hI
ma
g
e Me
di
a Pr
oc
e
s
s
i
ng
Symp. (IMPS 2004), pp.109-110, November,
2004.
[17] C. Liao, Thi Thi Zin,T. Kaneko, H. Hama,
“
Robu
s
tSe
gme
n
t
a
t
i
on of Roa
d Tr
a
f
f
i
c Si
gn
s
Us
i
ng Ada
pt
i
v
e Th
r
e
s
h
ol
ds
”t
h
eI
n
s
t
i
t
ut
e of
Electronics, Information and Communication
Engineers Electronics Express, Vol. 2, No. 14,
pp.423-428, July, 2005.
[18] C.Li
a
o,“
Robus
tSe
gme
ntation of road traffic
s
i
gn
sUs
i
ngRe
l
a
t
i
v
eSi
mi
l
a
r
i
t
y
,
”I
nPr
oc
.10t
h
Image Media Processing Symp. (IMPS 2005),
pp.111-112, November, 2005.
[19] C.Li
a
o,“
Ada
pt
i
v
eRe
g
i
onCol
orSe
gme
n
t
a
t
i
on
f
orRoa
d Si
g
ns
,
”I
n Pr
oc
.37t
hI
n
t
.Con
f
.On
Stochastic System Theory and ITS Applications,
Japan, pp.13-14, October, 2005.
中文摘要
本研究在探討最合適門檻値自動對道路交通標識分割。在數位影像中大多數仍使用絶對門檻値,進行
對影像分割,在各種撮影状態下,很容易受雜音影響。本研究中所使用的 K-nearest connectivity 可
提供非常穩定的影像分割結果。此方法利用相對性性質,並接合了交通標識形顔色形状資訊,以便能正
確對道路交通標識進行分割。在實驗中以晴天,傍晩及夜間,共 100 張照片可自動取得最合適門檻値並
且不需用手動方法自動獲得 98%正確的分割率。