Menu
Home
Contact us
Stats
Categories
Calendar
Toggle Wiki
Wiki Home
Last Changes
Rankings
List pages
Orphan pages
Sandbox
Print
Toggle Image Galleries
Galleries
Rankings
Toggle Articles
Articles home
List articles
Rankings
Toggle Blogs
List blogs
Rankings
Toggle Forums
List forums
Rankings
Toggle File Galleries
List galleries
Rankings
Toggle Maps
Mapfiles
Toggle Surveys
List surveys
Stats
ITHEA Classification Structure > H. Information Systems  > H.2 DATABASE MANAGEMENT  > H.2.8 Database Applications 
METHODS FOR EVALUATING OF REGULARITIES SYSTEMS STRUCTURE
By: Kostomarova et al. (11140 reads)
Rating: (1.00/10)

Abstract: The new method for analysis of regularities systems is discussed. Regularities are related to effect of explanatory variables on outcome. At that it is supposed that different levels of outcome correspond to different subregions of explanatory variables space. Such regularities may be effectively uncovered with the help of optimal valid partitioning technique. The OVP approach is based on searching partitions of explanatory variables space that in the best way separate observations with different levels of outcomes. Partitions of single variables ranges or two-dimensional admissible areas for pairs of variables are searched inside corresponding families. Output system of regularities is formed with the help of statistical validity estimates by two types of permutation tests. One of the problems associated with OVP procedure is great number of regularities in output system in high-dimensional tasks. The new approach for output system structure evaluating is suggested that is based on searching subsystem of small size with possibly better forecasting ability of convex combination of associated predictors. Mean error of convex combination becomes smaller when average forecasting ability of ensemble members becomes better and deviations between prognoses associated with different regularities increase. So minimization of convex combination mean error allows to receive subsystem of regularities with strong forecasting abilities that significantly differ from each other. Each regularity of output system may be characterized by distances to regularities in subsystem.

Keywords: Optimal partitioning, statistical validity, permutation test, regularities, explanatory variables effect, complexity

ACM Classification Keywords: H.2.8 Database Applications - Data mining, G.3 Probability and Statistics - Nonparametric statistics, Probabilistic algorithms

Link:

METHODS FOR EVALUATING OF REGULARITIES SYSTEMS STRUCTURE

Irina Kostomarova, Anna Kuznetsova, Natalia Malygina, Oleg Senko

http://foibg.com/ibs_isc/ibs-16/ibs-16-p05.pdf

Print
H.2.8 Database Applications
article: STORING INFORMATION VIA NATURAL LANGUAGE ADDRESSING – A STEP TOWARD MODELING ... · ALGORITHM FOR QUICK NUMBERING OF LARGE VOLUMES OF DATA · RDFARM - A SYSTEM FOR STORING LARGE SETS OF RDF TRIPLES AND QUADRUPLES BY ... · SELF-CITATIONS EFFECT ON SCIENTOMETRIC INDEXES · SHAPING THE CITATION-PAPER RANK DISTRIBUTIONS: BEYOND HIRSCH’S MODEL · ONTOARM - A SYSTEM FOR STORING ONTOLOGIES BY NATURAL LANGUAGE ADDRESSING · METHOD OF DATA ANALYSIS BASED ON CLUSTERING IN “SYNDROMES” INDICATORS SPACE · ANALYZING THE LOCALIZATION OF LANGUAGE FEATURES WITH COMPLEX SYSTEMS TOOLS ... · WORDARM - A SYSTEM FOR STORING DICTIONARIES AND THESAURUSES BY ... · ASSOCIATION RULE MINING WITH N-DIMENSIONAL UNIT CUBE CHAIN SPLIT TECHNIQUE · ON A METHOD OF MULTI-ALGORITHMIC CLASSIFICATION · PROCESSING SETS OF CLASSES’ LOGICAL REGULARITIES · CITATION-PAPER RANK DISTRIBUTIONS AND ASSOCIATED SCIENTOMETRIC INDICATORS ... · MULTI-VARIANT PYRAMIDAL CLUSTERING AND ANALYSIS HIGH-DIMENSIONAL DATA · THEORETICAL ANALYSIS OF EMPIRICAL RELATIONSHIPS FOR PARETODISTRIBUTED... · INTEGRATED ENVIRONMENT FOR STORING AND HANDLING INFORMATION IN TASKS OF ... · ABOUT MULTI-VARIANT CLUSTERING AND ANALYSIS HIGH-DIMENSIONAL DATA · COMPUTATIONAL MODEL FOR SERENDIPITY · METHOD FOR EVALUATING OF DISCREPANCY BETWEEN REGULARITIES SYSTEMS IN ... · ASTRONOMICAL PLATES SPECTRA EXTRACTION OBJECTIVES AND POSSIBLE SOLUTIONS ... · METHODS OF REGULARITIES SEARCHING BASED ON OPTIMAL PARTITIONING · AN APPROACH TO VARIABLE AGGREGATION IN EFFICIENCY ANALYSIS · INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS · METHODS FOR EVALUATING OF REGULARITIES SYSTEMS STRUCTURE · COMPOSITE BLOCK OPTIMIZED CLASSIFICATION DATA STRUCTURES · INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS · INDIRECT SPATIAL DATA EXTRACTION FROM WEB DOCUMENTS · HOW TO USE A DESKTOP VERSION OF A DBMS FOR CLIENT-SERVER APPLICATIONS · DEVELOPMENT OF DATABASE FOR DISTRIBUTED INFORMATION MEASUREMENT ... · THE DEVELOPMENT OF THE GENERALIZATION ALGORITHM BASED ON THE ROUGH SET THEORY · THE ROLE OF DBMS IN ANALYTICAL PROCESSES OF THE LOGISTIC ·
Login
[ register | I forgot my password ]
World Clock
Powered by Tikiwiki Powered by PHP Powered by Smarty Powered by ADOdb Made with CSS Powered by RDF powered by The PHP Layers Menu System
RSS Wiki RSS Blogs rss Articles RSS Image Galleries RSS File Galleries RSS Forums RSS Maps rss Calendars
[ Execution time: 0.09 secs ]   [ Memory usage: 7.57MB ]   [ GZIP Disabled ]   [ Server load: 0.16 ]
Powered by Tikiwiki CMS/Groupware