Aster Analytics 5.11 Release Notes - Information Products

Aster Analytics 5.11 Release Notes
Product ID: B700-1002-511K
Last updated: October 29, 2013
Aster Analytics version: 5.11
Summary
This document describes the new and revised functions in Aster Analytics version 5.11. It also
lists known and fixed issues for this release.
Contents
•
Who Should Install this Release?
•
New Functions
•
Revised Functions
•
Getting and Installing Aster Analytics Foundation
•
Backward Compatibility Notes
•
Fixed Issues
•
Naming Convention for Teradata Aster Releases
•
Contacting Teradata Global Technical Support (GTS)
•
Third Party Licenses
•
About This Document
Aster Analytics 5.11 Release Notes
1
Aster Analytics 5.11 Release Notes
Who Should Install this Release?
Who Should Install this Release?
All Teradata Aster installations using Aster Database 5.0 or later should consider installing this
release for all of the Aster Analytics functions they have licensed from Teradata.
New Functions
This release introduces the following new functions:
Time Series, Path, and Attribution Analysis
•
Dynamic time warping (DTW): Measures the similarity between two sequences that vary
in time or speed.
Text Analysis Functions
•
TextTagging: Tags input tuples according to user-defined rules. These rules comprise
logical and text processing operators.
•
Latent Dirichlet Allocation (LDA) functions:
•
•
LDATrainer: Builds a topic model based on the supplied training data and parameters.
•
LDAInference: Estimates the topic distribution for each document based on the
generated model.
•
LDATopicPrinter: Displays the readable information of the model.
TF_IDF: Evaluates the importance of a word within a specific document, weighted by the
number of times the word appears in the entire corpus of documents.
Association Analysis Functions
•
WSRecommender: An item-based, collaborative filtering function that uses a weightedsum algorithm to make recommendations (for example, items or products that users
should consider purchasing).
Data Transformation Functions
•
MurmurHash: Computes the hash value of the input columns.
•
IdentityMatch: Tries to match enterprise customers with users records provided by
external data sources.
Aster Analytics 5.11 Release Notes
2
Aster Analytics 5.11 Release Notes
Revised Functions
Revised Functions
SSL JDBC support has been added to all SQL-MR driver functions.
•
minhash
•
KNN
•
kmeans
•
forest_drive
•
forest_predict
•
naiveBayesPredict
•
path_analyzer
•
outlierFilter
•
eigen_centrality
•
local_clustering_coefficient
•
TrainSentimentExtractor
•
TrainNamedEntityFinder
•
TextclassifierTrainer
•
GLMPredict
•
GLM
•
frequentpaths
•
single_tree_drive
•
triangle_finder
•
rectangle_finder
•
pagerank
•
degrees
•
cfilter
•
dwt
•
idwt
•
dwt2D
•
idwt2D
•
ldaInference
•
ldaTrainer
For more information, see “Connecting to Aster Database Using SSL JDBC Connections” in
Chapter 2 of the Aster Analytics Foundation User Guide.
In addition, new versions of the following functions have been released:
Aster Analytics 5.11 Release Notes
3
Aster Analytics 5.11 Release Notes
Revised Functions
Time Series, Path and Attribution Analysis
•
Attribution: Added multiple input FACT support. Also, this function wad enhanced to
provide a time-weighted exponential model in addition to row-number-based
calculations.
•
FrequentPath: Modified to avoid out-of-memory errors when processing certain large
transactions. This modification has impacted the performance of this function.
Statistical Analysis Functions
•
ConfusionMatrix: Added precision and recall support. Also, the function now generates a
table as output.
•
CORR_MAP: Now accepts range of column indices in the COLUMNPAIRS argument.
•
Enhanced Histogram Function (hist_map): Added support for building separate
histograms for distinct group by values.
•
Support Vector Machines (SVM) functions: The SVM function have been replaced by
these functions, which significantly improve performance:
Notice! The old SVM functions are included in the 5.11 Analytics Foundation Bundle, but should not be used.
•
SparseSVMTrainer: Builds a predictive model according to a training set.
•
SparseSVMPredictor: Gives a prediction for each sample in the test set.
•
SVMModelPrinter: Displays the readable information of the model.
Text Analysis Functions
•
TextTokenizer: Added support for user-defined dictionary.
•
ExtractSentiment:
•
•
Added support for Chinese and English phrases.
•
The ONLY_SUBJECTIVE_SENTENCE argument has been deprecated.
•
In the MODEL argument, MAX_ENTROPY is replaced by CLASSIFICATION to
clarify a classification model is used. MAX_ENTROPY limited the model to only one
kind of classification type.
•
The out_content column is deleted if the level is DOCUMENT.
•
The out_feature column is deleted.
TrainSentimentExtractor: In the previous version, a maximum entropy model is trained
by trainSentimentExtractor itself. In this version, the textClassifierTrainer function is
called directly to benefit from the continuous update of textClassifierTrainer.
Data Transformation Functions
•
XMLParser: Added an error handling argument.
•
XMLRelation: Added an error handling argument.
Aster Analytics 5.11 Release Notes
4
Aster Analytics 5.11 Release Notes
Getting and Installing Aster Analytics Foundation
Getting and Installing Aster Analytics
Foundation
To get the release, obtain the Aster Analytics Foundation 5.11 installer for the functions you
have licensed.
Backward Compatibility Notes
•
Internationalization is not completely supported on Aster Database versions 5.0, 5.0.1,
5.0.2, and 5.10.
•
On Aster Database versions 5.0, 5.0.1, 5.0.2, and 5.10, SQL-MR and SQL queries crash
when relations do not contain columns.
Known Issues
When GLM function uses the Poisson family with non-canonical links and the parameters are
not initialized properly, the function throws an error message.
Fixed Issues
Permission denied for schema "public" and for user "loadusr"
Issue ID: ANLY-707
Details: There are permissions issues with various SQL-MR functions when a user who does
not have CREATE privileges on the public schema attempts to use the functions. An example
error message is:
ERROR: SQL-MR function TEXTCLASSIFIERTRAINER failed: Error occurred when
install model file: Fail to install file: /tmp/1379088204401/knn.bin
Message:[AsterData][NClusterJDBCDSII](34) ERROR: permission denied for
schema "public" for user "loadusr" ()
A note has been added to Chapter 2 of the Aster Analytics Foundation User Guide:
“Before you run SQL-MR functions, make sure you have the appropriate CREATE privileges
on the public schema. This helps you avoid permission issues in cases where CREATE
privileges on the public schema are needed.”
Aster Analytics 5.11 Release Notes
5
Aster Analytics 5.11 Release Notes
Fixed Issues
SQL-MR function FOREST_DRIVE fails because permission was
denied
Issue ID: ANLY-708
Details: SQLMR function does not run unless the “public” and “loadusr” user is granted
CREATE on public schema:
"ERROR: SQL-MR function FOREST_DRIVE failed:
[AsterData][NClusterJDBCDSII](34) ERROR: permission denied for schema
"public" for user "loadusr" ()
This issue has been fixed.
Issue with writing to public schema
Issue ID: ANLY-699
Details: Some functions like knn and glm refer to the public schema, which causes issues when
the Aster Foundation library is not installed within that schema. To resolve this issue, if you
install these functions in a schema other than 'public', add that schema into the search path.
The forest_drive function creates an unnecessary table
(default_dt_monitor_table) on each call
Issue ID: ANLY-694
Details: The default_dt_monitor_table table is a useful table that the function uses to store
metadata about the decision tree it creates. Two arguments were added MONITORTABLE and
DROPMONITORTABLE to the function that let you specify the name of this table (the
default is default_dt_monitor_table) and whether to drop an existing table monitor table with
the same name.
The Aster Analytics Foundation User Guide contains descriptions of these two arguments.
The forest_drive function is not taking 'numeric(12,2)' as a
datatype
Issue ID: ANLY-673
Details: This error message appears when using the forest_drive function:
JDBC Out of Memory Error.
This issue has been fixed, but at a cost to performance.
The FrequentPaths function fails due to out-of-memory error
Issue ID: ANLY-673
Details: In cases of large transactions, an out-of-memory error is generated:
Error: Columns in 'numericInputs' clause must be numeric
This issue has been fixed.
Aster Analytics 5.11 Release Notes
6
Aster Analytics 5.11 Release Notes
Naming Convention for Teradata Aster Releases
Naming Convention for Teradata Aster
Releases
Teradata Aster release naming convention has changed beginning in January 2013. The new
release naming convention is as follows:
•
AD: Aster Database
•
AC: Aster Client
•
AA: Aster Analytics Foundation
Release numbering for all of the above will follow this convention:
XX.YY.ZZ.nn
where:
•
XX is a major release.
•
YY is a minor release.
•
ZZ is a maintenance release.
•
nn is a efix or hotpatch.
We use this terminology:
•
major release: Major releases of Aster typically introduce new features.
•
minor release: Point releases of Aster Database typically provide feature enhancements
and bug fixes. A point release may also introduce new features.
•
maintenance release: Maintenance releases of Aster typically provide only bug fixes. They
may also include feature enhancements.
•
efix or hotpatch: Efix and hotpatch releases are introduced to address customer issues of
an urgent nature.
Contacting Teradata Global Technical Support
(GTS)
For assistance and updated documentation, contact Teradata Global Technical Support
(GTS):
•
Support Portal: http://tays.teradata.com/
•
International: 212-444-0443
•
US Customers: 877-698-3282
•
Toll Free Number: 877-MyT-Data
Aster Analytics 5.11 Release Notes
7
Aster Analytics 5.11 Release Notes
Third Party Licenses
Third Party Licenses
Your Aster installation includes a number of open source products. The license text for these
products is available in the Aster Database User Guide, in the chapter “Licenses” and on your
Aster queen, as a set of text files in the /home/beehive/licenses directory.
About This Document
Aster Analytics Release Notes, version 5.11, 1st edition, 2013-10-17
Copyright and Legal Statements
The product or products described in this book are licensed products of Teradata Corporation
or its affiliates.
Teradata, Aster, Aster Data, nCluster, SQL-MapReduce, Aprimo, BYNET, DBC/1012,
DecisionCast, DecisionFlow, DecisionPoint, Eye logo design, InfoWise, Meta Warehouse,
MyCommerce, SeeChain, SeeCommerce, SeeRisk, Teradata Decision Experts, Teradata Source
Experts, WebAnalyst, “More Data. Big Insights,” and “You’ve Never Seen Your Business Like
This Before” are trademarks or registered trademarks of Teradata Corporation or its affiliates.
Adaptec and SCSISelect are trademarks or registered trademarks of Adaptec, Inc. AMD
Opteron and Opteron are trademarks of Advanced Micro Devices, Inc. BakBone and NetVault
are trademarks or registered trademarks of BakBone Software, Inc. EMC, PowerPath, SRDF,
and Symmetrix are registered trademarks of EMC Corporation. GoldenGate is a trademark of
GoldenGate Software, Inc.Hewlett-Packard and HP are registered trademarks of HewlettPackard Company. Intel, Pentium, and XEON are registered trademarks of Intel Corporation.
IBM, CICS, RACF, Tivoli, and z/OS are registered trademarks of International Business
Machines Corporation. Linux is a registered trademark of Linus Torvalds. LSI and Engenio
are registered trademarks of LSI Corporation. Microsoft, Active Directory, Windows,
Windows NT, and Windows Server are registered trademarks of Microsoft Corporation in the
United States and other countries. MicroStrategy is a registered trademark of MicroStrategy
Incorporated. Novell and SUSE are registered trademarks of Novell, Inc., in the United States
and other countries. QLogic and SANbox are trademarks or registered trademarks of QLogic
Corporation. RedHat is a registered trademark of Red Hat, Inc. SAS and SAS/C are
trademarks or registered trademarks of SAS Institute Inc. SPARC is a registered trademark of
SPARC International, Inc. Sun Microsystems, Solaris, Sun, and Sun Java are trademarks or
registered trademarks of Sun Microsystems, Inc., in the United States and other countries.
Symantec, NetBackup, and VERITAS are trademarks or registered trademarks of Symantec
Corporation or its affiliates in the United States and other countries. Ubuntu and Canonical
are registered trademarks of Canonical Ltd. Unicode is a collective membership mark and a
service mark of Unicode, Inc. UNIX is a registered trademark of The Open Group in the
United States and other countries. Other product and company names mentioned herein may
be the trademarks of their respective owners.
Aster Analytics 5.11 Release Notes
8
Aster Analytics 5.11 Release Notes
About This Document
THE INFORMATION CONTAINED IN THIS DOCUMENT IS PROVIDED ON AN “AS-IS”
BASIS, WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESS OR IMPLIED,
INCLUDING THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A
PARTICULAR PURPOSE, OR NON-INFRINGEMENT. SOME JURISDICTIONS DO NOT
ALLOW THE EXCLUSION OF IMPLIED WARRANTIES, SO THE ABOVE EXCLUSION
MAY NOT APPLY TO YOU. IN NO EVENT WILL TERADATA CORPORATION BE LIABLE
FOR ANY INDIRECT, DIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL
DAMAGES, INCLUDING LOST PROFITS OR LOST SAVINGS, EVEN IF EXPRESSLY
ADVISED OF THE POSSIBILITY OF SUCH DAMAGES.
The information contained in this document may contain references or cross-references to
features, functions, products, or services that are not announced or available in your country.
Such references do not imply that Teradata Corporation intends to announce such features,
functions, products, or services in your country. Please consult your local Teradata
Corporation representative for those features, functions, products, or services available in
your country. Information contained in this document may contain technical inaccuracies or
typographical errors. Information may be changed or updated without notice. Teradata
Corporation may also make improvements or changes in the products or services described in
this information at any time without notice.
If you’d like to help maintain the quality of our product documentation, please send us your
comments on the accuracy, clarity, organization, and usefulness of this document. You can
send your comments to [email protected].
Any comments or materials (collectively referred to as “Feedback”) sent to Teradata
Corporation will be deemed non-confidential. Teradata Corporation will have no obligation
of any kind with respect to Feedback and will be free to use, reproduce, disclose, exhibit,
display, transform, create derivative works of, and distribute the Feedback and derivative
works thereof without limitation on a royalty-free basis. Further, Teradata Corporation will be
free to use any ideas, concepts, know-how, or techniques contained in such Feedback for any
purpose whatsoever, including developing, manufacturing, or marketing products or services
incorporating Feedback.
Copyright © 2013 by Teradata Corporation. All Rights Reserved.
Release Chronology
This release chronology covers Aster Analytics release only. Prior to this list, Aster Analytics
Foundation was released with Aster Database.
Aster Analytics 5.0-Release1 (October 2012); 5.0-Release 2 (December 2012)
Aster Analytics 5.11 (October 2013)
Aster Analytics 5.11 Release Notes
9