Chapter 1: Computer Systems

HiPIC
e-Business Architecture
with Enterprise Search Engine
KOCSEA 2008
Jongwook Woo, PhD
[email protected]
High-Performance Internet Computing Center
Computer Information Systems Department
California State University, Los Angeles
Jongwook Woo
CSULA
HiPIC
Abstract
 e-Business architecture has been updated dramatically for the
past 10 years. The architecture moved from the traditional clientserver architecture to n-tier architecture since Internet and Web
came out to the world. Internet and web even create new product
and revenue as companies have invested in Information
Technology seriously not as an option. Content is one of the
examples in this new era. Content has become intellectual
property and a company or organization that has content can
make profit by providing the content to the customers and the
users. As the content increases high, there is a need for search
engine in order to provide the proper content to the user quickly.
Thus, search engine has received highlight in the content industry,
mostly portal sites. The paper introduces search engine especially internal search engine not web search engine - and its
fundamentals. Then, e-Business architecture with search engine
is illustrated.
Jongwook Woo
CSULA
HiPIC
Search Engine
 Information retrieval system
Help users find what they need
– In Internet or Intranet
Minimize time required to find information
 search engines
Web search engine
Enterprise search engine
Ex: Apache Lucene, Solr, FAST
Jongwook Woo
CSULA
HiPIC
Search Engine Basics
 Indexing
 Querying
 Ranking
 Lemmatization and Phrase search
 Geo search
 Search Navigation
Jongwook Woo
CSULA
HiPIC
Querying
 Basic of all search engine
 The user uses query to search
 With search terms AND, OR, NOT, NEAR
 The process of specify criteria about an
item on interest and have the engine
find the matching items.
 Syntax of querying
Formalized
Natural Language Search
Jongwook Woo
CSULA
HiPIC
Indexing
 How data is collected and stored
then retrieve once a user do a search.
Optimize the speed and performance of
finding relevant documents.
 Need to determine ranking by
Relevance and similarity
Determines the order in which it is presented
to the user
Jongwook Woo
CSULA
HiPIC
Lemmatized/Phrase Search
Lemmatized
A search which allows the user to enter
a lemma and return the inflected forms
Ex: hamburger vs hamburgers
Phrase
A query that matches documents
containing a particular sequence of
terms.
Ex: “The Great Wall”
Jongwook Woo
CSULA
HiPIC
Others
 Geo-search
 Parse text and identify the coordinates of the
geographic entities mentioned in the query
 Ex: East Palo Alto CA -> Latitude: 37.47 N,
Longitude: 122.14 W
 Facet Browsing
Search result can be categorized
and displays the counts of each search results
categorized
Jongwook Woo
CSULA
HiPIC
Local (Enterprise) Search
 Used by corporation to search in their
own intranets to retrieve information.
 Either used in public or not
 Portal sites provides and sells contents
Contents is digital info:
– text, multimedia data etc
Ex: citysearch.com, insiderpages.com,
superpages.com etc
Jongwook Woo
CSULA
HiPIC
Web Search
Used to search the World Wide
Web
Ex:
www.Google.com
www.msn.com
www.Yahoo.com
Probably no more market to
compete with these dinosaurs
Jongwook Woo
CSULA
HiPIC
Legacy N-Tier Architecture
Presentation Logic
Only for the user
Business Logic
Data Access Logic
Normally DB
Search query communicates with DB
Jongwook Woo
CSULA
HiPIC
N-Tier Web Architecture
n-Tier Architecture
DB Server
The user
searches
the site
DB Access Library
Business Logic
Presentation Logic
Jongwook Woo
CSULA
HiPIC
Latest N-Tier Architecture
 Presentation Logic
Both for the user and the author
 Business Logic
 Data Access Logic
Both for DB
 Search Access Logic
Connects Search Engine that search query
communicates with
Jongwook Woo
CSULA
HiPIC
N-Tier Web Architecture with
Enterprise Search Engine
n-Tier Architecture
Search
Engine
DB Server
Load/index DB to
Search DB
DB Access Library
Search DB
Search Access
Library
The user
searches
the site
Business Logic
Presentation Logic
Jongwook Woo
CSULA
HiPIC
Summary
 Search Engine should not be optional
for Internet Business
 Enterprise Search has been popular by
providing contents to the user and the
third parties
as contents are product
Don’t need to compete with Web Search
Business such as Google, Yahoo, MSN etc
Jongwook Woo
CSULA