Segmentation Model With Respect to the Difference in Means
Autor: | Marcin Owczarczuk |
---|---|
Rok vydání: | 2006 |
Předmět: |
education.field_of_study
Variables business.industry media_common.quotation_subject Population ANOVA regression trees segmentation Pattern recognition jel:C21 jel:C44 Regression Combinatorics jel:M31 Feature (machine learning) Segmentation Artificial intelligence business education Categorical variable Mathematics media_common |
Zdroj: | SSRN Electronic Journal. |
ISSN: | 1556-5068 |
DOI: | 10.2139/ssrn.955989 |
Popis: | The aim of the paper is to formulate and solve the following segmentation problem. Given is a population described by independent variables: X1, ... , Xn, (both continuous and categorical), the continuous dependent variable Y and the two-level categorical variable $\alpha$ with levels $\alpha = 1$ and $\alpha = 0$. $\bar{Y}_{\alpha=1}$ and $\bar{Y}_{\alpha=0}$ are the means of $Y$ for observations at levels $\alpha=1$ and $\alpha=0$, respectively. The goal is to create the segments of the population, described by the independent variables, that the difference in means $\bar{Y}_{\alpha=1}-\bar{Y}_{\alpha=0}$ is the feature that distinguishes the segments. I. e. the means should be as different as possible between segments and should be similar within the segment. The solution is based on regression trees approach. |
Databáze: | OpenAIRE |
Externí odkaz: |