Ace Your Reading and interpreting data Tests with Plainmath's Expert Help and Detailed Resources

Recent questions in Reading and interpreting data
Pre-AlgebraAnswered question
Yahir Tucker Yahir Tucker 2022-06-26

I was reading a tutorial written on Linear Regression by Avi Kak. There is a part about geometric interpretation of linear regression on pg.19.
The optimum solution for β~ that minimizes the cost function C( β ) in Eq. (14) possesses the following geometrical interpretation: Focusing on the equation y = X β , the measured vector y on the left resides in a large N dimensional space. On the other hand, as we vary β in our search for the best possible solution, the space spanned by the product X β will be a (p+1)-dimensional subspace (a hyperplane, really) in the N dimensional space in which y resides. The question now is: which point in the hyperplane spanned by X β is the best approximation to the point y which is outside the hyperplane. For any selected value for β , the “error” vector y − X β will go from the tip of the vector X β to the tip of the y vector. Minimization of the cost function C in Eq. (14) amounts to minimizing the norm of this difference vector.
I could not understand how to relate N-dimensional space and (p+1)-dimensional subspace. B vector defines a (p+1) dimensional subspace but I could not understand why N dimensional space contains the (p+1) subspace. As I understand in (p+1) each dimension means features but in N dimensional space each dimension means a data point. I'm a lot confused about the idea. Are there any other resource that explains the idea in a much more detail? or Could anyone explains the idea how these spaces relate?

Pre-AlgebraAnswered question
rigliztetbf rigliztetbf 2022-06-26

I'm reading about Bayesian data analysis by Gelman et al. and I'm having big trouble interpreting the following part in the book (note, the rat tumor rate θ in the following text has: θ B e t a ( α , β )
Choosing a standard parameterization and setting up a ‘noninformative’ hyperprior dis- tribution.
Because we have no immediately available information about the distribution of tumor rates in populations of rats, we seek a relatively diffuse hyperprior distribution for ( α , β ). Before assigning a hyperprior distribution, we reparameterize in terms of logit ( α α + β ) = log ( α β ) and log ( α + β ), which are the logit of the mean and the logarithm of the ‘sample size’ in the beta population distribution for θ . It would seem reasonable to assign independent hyperprior distributions to the prior mean and ‘sample size,’ and we use the logistic and logarithmic transformations to put each on ( , ) scale. Unfortunately, a uniform prior density on these newly transformed parameters yields an improper posterior density, with an infinite integral in the limit ( α + β ) , and so this particular prior density cannot be used here.
In a problem such as this with a reasonably large amount of data, it is possible to set up a ‘noninformative’ hyperprior density that is dominated by the likelihood and yields a proper posterior distribution. One reasonable choice of diffuse hyperprior density is uniform on ( α α + β , ( α + β ) 1 / 2 ), which when multiplied by the appropriate Jacobian yields the following densities on the original scale,
p ( α , β ) ( α + β ) 5 / 2 ,
and on the natural transformed scale:
p ( log ( α β ) , log ( α + β ) ) α β ( α + β ) 5 / 2 .
My problem is especially the bolded parts in the text.
Question (1): What does the author explicitly mean by: "is uniform on ( α α + β , ( α + β ) 1 / 2 )
Question (2): What is the appropriate Jacobian?
Question (3): How does the author arrive into the original and transformed scale priors?
To me the book hides many details under the hood and makes understanding difficult for a beginner on the subject due to seemingly ambiguous text.
P.S. if you need more information, or me to clarify my questions please let me know.

Pre-AlgebraAnswered question
Semaj Christian Semaj Christian 2022-06-26

Survivor function of a variable that has discrete and continuous components
I'm currently reading The Statistical Analysis of Failure Time Data by Kalbfleisch and Prentice and had trouble at arriving at the expression for the survivor function of a random variable T having both discrete and continuous components. The setup is the following:
Let T be a random variable on [ 0 , ) with survivor function F(t)=P(T>t). Then
if T is absolutely continuous with density f, then the hazard function λ can be defined as
λ ( t ) := lim h 0 + P ( t T < t + h T t ) h = f ( t ) F ( t )
for t 0, and hence we have
F ( t ) = exp ( 0 t λ ( s ) d s ) , t 0.
if T is discrete taking on the values 0 a 1 < a 2 < , then we define the hazard at a i as
λ i = P ( T = a i T a i ) , i = 1 , 2 , .
Then we can show that
F ( t ) = j a j t ( 1 λ j ) , t 0.
These expressions for the survivor functions I am ok with. Now they write the following:
More generally, the distribution of T may have both discrete and continuous components. In this case, the hazard function can be defined to have the continuous component λ c ( t ) and discrete components λ 1 , λ 2 , at the discrete times a 1 < a 2 <
The overall survivor function can then be written
(1) F ( t ) = exp ( 0 t λ c ( s ) d s ) j a j t ( 1 λ j ) .
That T has both discrete and continuous components means that the distribution of T is of the form
P T ( d x ) = f c ( x ) λ ( d x ) + j = 1 b j δ a j ( d x )
or equivalently
P ( T A ) = A f c ( x ) d x + j a j A b j
for some sequence a 1 < a 2 < and b i ( 0 , 1 ) and some non-negative measurable function f c with 0 f c d λ + i = 1 b j = 1. If we define
λ c ( t ) = f c ( t ) P ( T t ) = f c ( t ) F ( t ) , t a i ,
and
λ i = P ( T = a i T a i ) ,
then how do I show (and is it even true) that the survivor function of T is given by (1)?

Interpreting data questions are mostly approached with the help of complex word problems where logic always comes first. Take a look at the list of questions posted below and see how certain equations have been used. The majority of answers presented will have a free take on things as there are no definite rules in certain scenarios. Reading through these solutions will help you learn how to interpret and read various scientific data. It is an essential aspect for engineers and architects, as well as students majoring in sociology or related sciences.