Tuesday, January 26, 2010

The Scatter Diagram

Firstly for the regression we use the scatter diagram for the studying the relationship between two variables. It study the relationship between the two variables.

Example:-




In the above example, the points are plotted by assigning values of the independent variable X to the horizontal axis and values of the dependent variable Y to the vertical axis.
The pattern made by the points plotted on the scatter diagram usually suggest the basic nature and strength of the relationship between two variables. The scatter diagram also shows that, subjects with large waist circumferences also have larger amounts of deep abdominal AT. These impressions suggest that the relationship between the two variables may be described by a straight line crossing the Y-axis below the origin and making approximately a 45-degree angle with the X-axis.

Monday, January 25, 2010

The Sample Regression Equation

In simple linear regression the object of the researcher's interest is the population regression- the regression that describes the true relationship between the dependent variable Y and the independent variable X.

In an effort to reach a decision regarding the likely form of this relationship, the researcher draws a sample from the population of interest and, using the resulting data, computes a sample regression equation that forms the basis for reaching conclusions regarding the unknown population regression equation.


Steps in Regression Analysis
In the absence of extensive information regarding the nature of the variables of interest, a frequently employed strategy is to assume initially that they are linearly related. Subsequently analysis, then, involves the following steps;
1- Determine whether or not the assumptions underlying a linear relationship are met in the data available for analysis.
2- Obtain the equation for the line that best fits the sample data.
3- Evaluate the equation to obtain some idea of the strength of relationship and the usefulness of the equation for predicting and estimating.
4- If the data appear to conform satisfactorily to the linear model, use the equation obtained from

the sample data to predict and to estimate.



Sunday, January 24, 2010

The Regression Model

In the typical regression problem, as in most problems in applied statistics, researchers have available for analysis a sample of observations from some real or hypothetical population. Based on the results of their analysis of the sample data, they are interested in reaching decisions about the population from which the sample is presumed to have been drawn. It is important, therefore, that the researchers understand the nature of the population in which they are interested. They should know enough about the population to be able either to construct a mathematical model for its representation or to determine if it reasonably fits some established model. A researcher about to analyze a set of data by the methods of simple linear regression, e.g. should be secure in the knowledge that the simple linear regression model is, at least, an approximate representation of the population. It is unlikely that the model will be a perfect portrait of the real situation, since this characteristics is seldom found in models of practical value. A model constructed so that it corresponds precisely with the details of the situation is usually too complicated to yield any information of value. On the other hand, the results obtained from the analysis of data that have been forced into a model that does not fit are also worthless. Fortunately. however, a perfectly fitting model is not a requirement for obtaining useful results. researchers, then, should be able to distinguish between the occasion when their chosen models and the data are sufficiently compatible for them to proceed and the case where their chosen model must be abandoned.
Assumptions of  Regression Model

 

Friday, January 22, 2010

Simple Linerar Regeression and Correlation

In analyzing data for the health sciences disciplines, we find that it is frequently desireable to learn something about the relationship between two variables. We may, for example, be intrested in studying the relationship between blood pressure and age, height and weight, the concentration of an injected drug and heart rate, the consumption level of some nutrient and weight gain, the intensity of a stimulus and reaction time, or total family income and medical care expenditures. the nature and strength of the relationship between variables such as these may be examined by Regression and Correlation analysis, two statistical techniques that, although related, serve different purposes.

Regression
regression analysis is helpful in ascertaining the probable form of relationship between variables, and the ultimate objective when this method of analysis is employed usually is to predict or estimate the value of one varianle corresponding to a given value of another variable. The ideas of regression were first elucidated by the English Scientist Sir Francis Galton in reports of his research on heridity ( firstly in the sweet peas and lator in human stature. He described a tndency of adult offspring, having either shortb or tall parens, to revert back toward the average height of general population.He first used the word reversion, and later regression, to refer to this phenomenon.

Correlation
on the other hand, correlation is concerd with measuring the strength of the ralationship between variables. When we compute measures of correlation from a set of data, we are interested in the degree of the correlation between variables. The concept and the terminology of correlation analysis originated with Galton, who first used the word correlation.


Thursday, January 21, 2010

Factorial Experiment

In the experimental designs that we have considered up to this point we have been interested in the effects of only one variable, the treatments. Frequently, however, we may be interested in studying, simultaneously, the effects of two or more variables. we refer to the variables in which we are interested as factors. The experiment in which two or more factors are investigated simultaneously is called a Factorial Experiment.
the different designated categories of the factors are called levels. Suppose, for example, that we are studying the effect on reaction time of three dosages of second factor of interest in the study is age, and it is thought that two age groups, under 65 years and 65 years and over, should be included. We then have two levels of the age factor. In general, we say that factor A occurs at a levels and factor B occurs at b levels. In factorial experiment we may study not only the effects of individual factors but also, if the experiment is properly conducted, the interaction between factors.
Advantages
The following are the advantages of the Factorial experiment;
1- The interaction of the factors may be studied.
2- There is a saving of time and effort.
3- Since the various factors are combined in one experiment, the results have a wider range of application.

Wednesday, January 20, 2010

Repeated Measures Design

One of the most frequently used experimental designs in the health sciences field is the repeated measures design.
Definition
                 ''A repeated meausres design is one in wich measurements of the same variable are made on each subject on two or more different occasions.''
The usual motivation for using a repeated measures design is a desire to control for variability among subjects. In such a design each subject serves as its own control. When measurements are taken on only two occasions we have the paired comparisions design that we discussed. One of the most frequently encountered situations in which the repeated measures design is used is the situation in which the investigator is concerned with reponses over time.
Advantage
The major advantage of the repeated measures design is, as previously mentiond, its ability to cintrol for extraneous variation among subjects. An additional advantage is the fact that fewer subjects are needed for the repeated measures design than for a design in which different subjects are used for each occasion on which measurements are made. Suppose, for example , that we have four treatments ( in the usual sense) or four points in time on each of which we would like to have 10 measurements. If a different sample of subjects is used for each of the four treatments or points in time, 40 subjects would be required . If we are able to take measurements on the same subject for each treatment or point in time, i.e if we can use a repeated measures design, only 10 subjects would be required. This can be a very attractive advantage if subjects are scarce or expensive to recruit.
Disadvantage
A major potential problem to be on the alert for is what is known as the carry-over effect. When two or more treatments are being evaluated, the investigator should make sure that a subject's response to one treatment does not reflect a residual effect from previous treatments. This problem can frequently be solved by allowing a sufficient length of time between treatments. Another possible problem is the position effect. A subject's response to a treatment experienced last in a sequence may be different from the response that would have occured if the treatment had been first in the sequence. In certain studies, such as those involving physical participation on the part of the subjects, enthusiasm that is high at the beginning of the study may give way to boredom toward the end. A way around this problem is to randomized the sequence of treatments independently for each subject. 

Monday, January 18, 2010

Tukey's Test for Unequall Sample Sizes

When the samples are not equal or not having same size, Then the Tuky's Test is not applicable. Spotvoll and Stoline have extended Tukey Procedure to the case where the sample sizes are different.Their Procedure , which is applicable for experiments involving three or more treatments and significance levels of 0.05 or less, consists of replacing n in the following equation;


the smallest of the two sample sizes associated with the two sample means that are to be compared. If we designate the new quantity by HSD, we have as the new test criterion