Under what conditions does $H(Xmid f(Y))=H(Xmid Y)$?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP











up vote
2
down vote

favorite
1












I have the problem that I cannot solve:
Under what conditions does $H(X∣f(Y))=H(X∣Y)$?
I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$. Are they equal?



This is an exercise in the textbook. There is a solution, but I don't think it's correct (more precisely, it is not satisfactory enough). The provided solution is as below.



Suggested Solution (not satisfactory). If $H(X|g(Y )) = H(X|Y )$, then $H(X)−H(X|g(Y )) = H(X) − H(X|Y )$, i.e., $I(X; g(Y )) = I(X; Y )$. This is the condition for equality in the data processing inequality. From the derivation of the inequality, we have equality iff $X → g(Y ) → Y$ forms a Markov chain. Hence $H(X|g(Y )) = H(X|Y )$ iff $X → g(Y ) → Y$ . This condition includes many special cases, such as $g$ being one-to-one, and $X$ and $Y$ being independent. However, these two special cases do not exhaust all the possibilities.







share|cite|improve this question





















  • "it is not satisfactory enough" Why?
    – Clement C.
    Jul 30 at 0:35










  • I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$.
    – khahuras
    Jul 30 at 2:11










  • I would add when $ mathbbE left[ X mid Y right] = mathbbE left[ X mid f left( Y right) right] $ and $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $. Clearly when $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $ all are equal.
    – Royi
    Jul 30 at 4:51










  • @Royi, I don't understand your comment.
    – khahuras
    Jul 30 at 5:32










  • I said you should extend the question to the cases I wrote above in addition to what you wrote.
    – Royi
    Jul 30 at 6:04














up vote
2
down vote

favorite
1












I have the problem that I cannot solve:
Under what conditions does $H(X∣f(Y))=H(X∣Y)$?
I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$. Are they equal?



This is an exercise in the textbook. There is a solution, but I don't think it's correct (more precisely, it is not satisfactory enough). The provided solution is as below.



Suggested Solution (not satisfactory). If $H(X|g(Y )) = H(X|Y )$, then $H(X)−H(X|g(Y )) = H(X) − H(X|Y )$, i.e., $I(X; g(Y )) = I(X; Y )$. This is the condition for equality in the data processing inequality. From the derivation of the inequality, we have equality iff $X → g(Y ) → Y$ forms a Markov chain. Hence $H(X|g(Y )) = H(X|Y )$ iff $X → g(Y ) → Y$ . This condition includes many special cases, such as $g$ being one-to-one, and $X$ and $Y$ being independent. However, these two special cases do not exhaust all the possibilities.







share|cite|improve this question





















  • "it is not satisfactory enough" Why?
    – Clement C.
    Jul 30 at 0:35










  • I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$.
    – khahuras
    Jul 30 at 2:11










  • I would add when $ mathbbE left[ X mid Y right] = mathbbE left[ X mid f left( Y right) right] $ and $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $. Clearly when $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $ all are equal.
    – Royi
    Jul 30 at 4:51










  • @Royi, I don't understand your comment.
    – khahuras
    Jul 30 at 5:32










  • I said you should extend the question to the cases I wrote above in addition to what you wrote.
    – Royi
    Jul 30 at 6:04












up vote
2
down vote

favorite
1









up vote
2
down vote

favorite
1






1





I have the problem that I cannot solve:
Under what conditions does $H(X∣f(Y))=H(X∣Y)$?
I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$. Are they equal?



This is an exercise in the textbook. There is a solution, but I don't think it's correct (more precisely, it is not satisfactory enough). The provided solution is as below.



Suggested Solution (not satisfactory). If $H(X|g(Y )) = H(X|Y )$, then $H(X)−H(X|g(Y )) = H(X) − H(X|Y )$, i.e., $I(X; g(Y )) = I(X; Y )$. This is the condition for equality in the data processing inequality. From the derivation of the inequality, we have equality iff $X → g(Y ) → Y$ forms a Markov chain. Hence $H(X|g(Y )) = H(X|Y )$ iff $X → g(Y ) → Y$ . This condition includes many special cases, such as $g$ being one-to-one, and $X$ and $Y$ being independent. However, these two special cases do not exhaust all the possibilities.







share|cite|improve this question













I have the problem that I cannot solve:
Under what conditions does $H(X∣f(Y))=H(X∣Y)$?
I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$. Are they equal?



This is an exercise in the textbook. There is a solution, but I don't think it's correct (more precisely, it is not satisfactory enough). The provided solution is as below.



Suggested Solution (not satisfactory). If $H(X|g(Y )) = H(X|Y )$, then $H(X)−H(X|g(Y )) = H(X) − H(X|Y )$, i.e., $I(X; g(Y )) = I(X; Y )$. This is the condition for equality in the data processing inequality. From the derivation of the inequality, we have equality iff $X → g(Y ) → Y$ forms a Markov chain. Hence $H(X|g(Y )) = H(X|Y )$ iff $X → g(Y ) → Y$ . This condition includes many special cases, such as $g$ being one-to-one, and $X$ and $Y$ being independent. However, these two special cases do not exhaust all the possibilities.









share|cite|improve this question












share|cite|improve this question




share|cite|improve this question








edited Jul 30 at 2:11
























asked Jul 30 at 0:14









khahuras

184




184











  • "it is not satisfactory enough" Why?
    – Clement C.
    Jul 30 at 0:35










  • I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$.
    – khahuras
    Jul 30 at 2:11










  • I would add when $ mathbbE left[ X mid Y right] = mathbbE left[ X mid f left( Y right) right] $ and $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $. Clearly when $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $ all are equal.
    – Royi
    Jul 30 at 4:51










  • @Royi, I don't understand your comment.
    – khahuras
    Jul 30 at 5:32










  • I said you should extend the question to the cases I wrote above in addition to what you wrote.
    – Royi
    Jul 30 at 6:04
















  • "it is not satisfactory enough" Why?
    – Clement C.
    Jul 30 at 0:35










  • I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$.
    – khahuras
    Jul 30 at 2:11










  • I would add when $ mathbbE left[ X mid Y right] = mathbbE left[ X mid f left( Y right) right] $ and $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $. Clearly when $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $ all are equal.
    – Royi
    Jul 30 at 4:51










  • @Royi, I don't understand your comment.
    – khahuras
    Jul 30 at 5:32










  • I said you should extend the question to the cases I wrote above in addition to what you wrote.
    – Royi
    Jul 30 at 6:04















"it is not satisfactory enough" Why?
– Clement C.
Jul 30 at 0:35




"it is not satisfactory enough" Why?
– Clement C.
Jul 30 at 0:35












I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$.
– khahuras
Jul 30 at 2:11




I would like to draw a result about the relation between $p_X(cdot | g(Y))$ and $p_X(cdot | Y)$.
– khahuras
Jul 30 at 2:11












I would add when $ mathbbE left[ X mid Y right] = mathbbE left[ X mid f left( Y right) right] $ and $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $. Clearly when $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $ all are equal.
– Royi
Jul 30 at 4:51




I would add when $ mathbbE left[ X mid Y right] = mathbbE left[ X mid f left( Y right) right] $ and $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $. Clearly when $ p_X left( cdot mid Y right) = p_X left( cdot mid f left( Y right) right) $ all are equal.
– Royi
Jul 30 at 4:51












@Royi, I don't understand your comment.
– khahuras
Jul 30 at 5:32




@Royi, I don't understand your comment.
– khahuras
Jul 30 at 5:32












I said you should extend the question to the cases I wrote above in addition to what you wrote.
– Royi
Jul 30 at 6:04




I said you should extend the question to the cases I wrote above in addition to what you wrote.
– Royi
Jul 30 at 6:04










1 Answer
1






active

oldest

votes

















up vote
3
down vote



accepted










The data processing inequality lemma already provides the complete answer. $I(X;Y)=I(X;g(Y))$ holds if and only if $$tag1Xrightarrow Y rightarrow g(Y)$$ and $$tag2 Xrightarrow g(Y) rightarrow Y.$$



Now note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid yright)tag3,
endalign
$$
where the last equality holds since (1) implies that $g(Y)rightarrow Y rightarrow X$, i.e., $X$ is independent of $g(Y)$, when conditioned on $Y$. Also note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid g(y)right) tag4,
endalign
$$
where the last equality holds since (2) implies $Yrightarrow g(Y) rightarrow X$.



It follows from (3) and (4) that it must hold
$$
pleft(xmid g(y)right) = p(x mid y),
$$
for all valid values of $x$ and $y$. This is the most general condition you seek, which captures the cases of $g$ being one to one and $Y$ independent of $X$.






share|cite|improve this answer























  • What if it was extended to under which conditions $ mathbbE left[ X mid Y right] = mathbbE left[ X mid g left( Y right) right] $?
    – Royi
    Jul 30 at 17:05










  • Thanks for your solution @Stelios.
    – khahuras
    Jul 31 at 4:28










Your Answer




StackExchange.ifUsing("editor", function ()
return StackExchange.using("mathjaxEditing", function ()
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix)
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
);
);
, "mathjax-editing");

StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "69"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: false,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
noCode: true, onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);








 

draft saved


draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2866558%2funder-what-conditions-does-hx-mid-fy-hx-mid-y%23new-answer', 'question_page');

);

Post as a guest






























1 Answer
1






active

oldest

votes








1 Answer
1






active

oldest

votes









active

oldest

votes






active

oldest

votes








up vote
3
down vote



accepted










The data processing inequality lemma already provides the complete answer. $I(X;Y)=I(X;g(Y))$ holds if and only if $$tag1Xrightarrow Y rightarrow g(Y)$$ and $$tag2 Xrightarrow g(Y) rightarrow Y.$$



Now note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid yright)tag3,
endalign
$$
where the last equality holds since (1) implies that $g(Y)rightarrow Y rightarrow X$, i.e., $X$ is independent of $g(Y)$, when conditioned on $Y$. Also note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid g(y)right) tag4,
endalign
$$
where the last equality holds since (2) implies $Yrightarrow g(Y) rightarrow X$.



It follows from (3) and (4) that it must hold
$$
pleft(xmid g(y)right) = p(x mid y),
$$
for all valid values of $x$ and $y$. This is the most general condition you seek, which captures the cases of $g$ being one to one and $Y$ independent of $X$.






share|cite|improve this answer























  • What if it was extended to under which conditions $ mathbbE left[ X mid Y right] = mathbbE left[ X mid g left( Y right) right] $?
    – Royi
    Jul 30 at 17:05










  • Thanks for your solution @Stelios.
    – khahuras
    Jul 31 at 4:28














up vote
3
down vote



accepted










The data processing inequality lemma already provides the complete answer. $I(X;Y)=I(X;g(Y))$ holds if and only if $$tag1Xrightarrow Y rightarrow g(Y)$$ and $$tag2 Xrightarrow g(Y) rightarrow Y.$$



Now note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid yright)tag3,
endalign
$$
where the last equality holds since (1) implies that $g(Y)rightarrow Y rightarrow X$, i.e., $X$ is independent of $g(Y)$, when conditioned on $Y$. Also note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid g(y)right) tag4,
endalign
$$
where the last equality holds since (2) implies $Yrightarrow g(Y) rightarrow X$.



It follows from (3) and (4) that it must hold
$$
pleft(xmid g(y)right) = p(x mid y),
$$
for all valid values of $x$ and $y$. This is the most general condition you seek, which captures the cases of $g$ being one to one and $Y$ independent of $X$.






share|cite|improve this answer























  • What if it was extended to under which conditions $ mathbbE left[ X mid Y right] = mathbbE left[ X mid g left( Y right) right] $?
    – Royi
    Jul 30 at 17:05










  • Thanks for your solution @Stelios.
    – khahuras
    Jul 31 at 4:28












up vote
3
down vote



accepted







up vote
3
down vote



accepted






The data processing inequality lemma already provides the complete answer. $I(X;Y)=I(X;g(Y))$ holds if and only if $$tag1Xrightarrow Y rightarrow g(Y)$$ and $$tag2 Xrightarrow g(Y) rightarrow Y.$$



Now note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid yright)tag3,
endalign
$$
where the last equality holds since (1) implies that $g(Y)rightarrow Y rightarrow X$, i.e., $X$ is independent of $g(Y)$, when conditioned on $Y$. Also note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid g(y)right) tag4,
endalign
$$
where the last equality holds since (2) implies $Yrightarrow g(Y) rightarrow X$.



It follows from (3) and (4) that it must hold
$$
pleft(xmid g(y)right) = p(x mid y),
$$
for all valid values of $x$ and $y$. This is the most general condition you seek, which captures the cases of $g$ being one to one and $Y$ independent of $X$.






share|cite|improve this answer















The data processing inequality lemma already provides the complete answer. $I(X;Y)=I(X;g(Y))$ holds if and only if $$tag1Xrightarrow Y rightarrow g(Y)$$ and $$tag2 Xrightarrow g(Y) rightarrow Y.$$



Now note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid yright)tag3,
endalign
$$
where the last equality holds since (1) implies that $g(Y)rightarrow Y rightarrow X$, i.e., $X$ is independent of $g(Y)$, when conditioned on $Y$. Also note that
$$
beginalign
p(x,g(y),y) &= p(y, g(y))pleft(xmid y, g(y)right)\
&=p(y, g(y))pleft(xmid g(y)right) tag4,
endalign
$$
where the last equality holds since (2) implies $Yrightarrow g(Y) rightarrow X$.



It follows from (3) and (4) that it must hold
$$
pleft(xmid g(y)right) = p(x mid y),
$$
for all valid values of $x$ and $y$. This is the most general condition you seek, which captures the cases of $g$ being one to one and $Y$ independent of $X$.







share|cite|improve this answer















share|cite|improve this answer



share|cite|improve this answer








edited Jul 30 at 15:40


























answered Jul 30 at 15:30









Stelios

2,051278




2,051278











  • What if it was extended to under which conditions $ mathbbE left[ X mid Y right] = mathbbE left[ X mid g left( Y right) right] $?
    – Royi
    Jul 30 at 17:05










  • Thanks for your solution @Stelios.
    – khahuras
    Jul 31 at 4:28
















  • What if it was extended to under which conditions $ mathbbE left[ X mid Y right] = mathbbE left[ X mid g left( Y right) right] $?
    – Royi
    Jul 30 at 17:05










  • Thanks for your solution @Stelios.
    – khahuras
    Jul 31 at 4:28















What if it was extended to under which conditions $ mathbbE left[ X mid Y right] = mathbbE left[ X mid g left( Y right) right] $?
– Royi
Jul 30 at 17:05




What if it was extended to under which conditions $ mathbbE left[ X mid Y right] = mathbbE left[ X mid g left( Y right) right] $?
– Royi
Jul 30 at 17:05












Thanks for your solution @Stelios.
– khahuras
Jul 31 at 4:28




Thanks for your solution @Stelios.
– khahuras
Jul 31 at 4:28












 

draft saved


draft discarded


























 


draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fmath.stackexchange.com%2fquestions%2f2866558%2funder-what-conditions-does-hx-mid-fy-hx-mid-y%23new-answer', 'question_page');

);

Post as a guest













































































Comments

Popular posts from this blog

What is the equation of a 3D cone with generalised tilt?

Color the edges and diagonals of a regular polygon

Relationship between determinant of matrix and determinant of adjoint?