<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments for Radford Neal&#039;s blog</title>
	<atom:link href="http://radfordneal.wordpress.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://radfordneal.wordpress.com</link>
	<description></description>
	<lastBuildDate>Thu, 02 May 2013 20:17:06 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>Comment on Design Flaws in R #2 — Dropped Dimensions by Anonymous</title>
		<link>http://radfordneal.wordpress.com/2008/08/20/design-flaws-in-r-2-%e2%80%94-dropped-dimensions/#comment-940</link>
		<dc:creator><![CDATA[Anonymous]]></dc:creator>
		<pubDate>Thu, 02 May 2013 20:17:06 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=302#comment-940</guid>
		<description><![CDATA[Thank you, this is a very useful tip! I have an automated porgram that has to deal with this situation often. The &quot;drop&quot; statement shortened my program by at least 20 lines!]]></description>
		<content:encoded><![CDATA[<p>Thank you, this is a very useful tip! I have an automated porgram that has to deal with this situation often. The &#8220;drop&#8221; statement shortened my program by at least 20 lines!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on The Harmonic Mean of the Likelihood:  Worst Monte Carlo Method Ever by Greg</title>
		<link>http://radfordneal.wordpress.com/2008/08/17/the-harmonic-mean-of-the-likelihood-worst-monte-carlo-method-ever/#comment-936</link>
		<dc:creator><![CDATA[Greg]]></dc:creator>
		<pubDate>Thu, 18 Apr 2013 23:35:17 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=192#comment-936</guid>
		<description><![CDATA[Dear Prof. Neal,

Thank you so much for this post, I&#039;ve found it really useful. I&#039;m about to implement some method to calculate the marginal likelihood of an econometric model and now I see that the harmonic mean estimator is not an attractive solution. What do you think about Geweke&#039;s (1999) modified harmonic mean estimator (p.46 here: http://www.tandfonline.com/doi/pdf/10.1080/07474939908800428)?

Thank you for your help in advance!

Best,
Greg]]></description>
		<content:encoded><![CDATA[<p>Dear Prof. Neal,</p>
<p>Thank you so much for this post, I&#8217;ve found it really useful. I&#8217;m about to implement some method to calculate the marginal likelihood of an econometric model and now I see that the harmonic mean estimator is not an attractive solution. What do you think about Geweke&#8217;s (1999) modified harmonic mean estimator (p.46 here: <a href="http://www.tandfonline.com/doi/pdf/10.1080/07474939908800428" rel="nofollow">http://www.tandfonline.com/doi/pdf/10.1080/07474939908800428</a>)?</p>
<p>Thank you for your help in advance!</p>
<p>Best,<br />
Greg</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Design Flaws in R #2 — Dropped Dimensions by Clement Kent</title>
		<link>http://radfordneal.wordpress.com/2008/08/20/design-flaws-in-r-2-%e2%80%94-dropped-dimensions/#comment-932</link>
		<dc:creator><![CDATA[Clement Kent]]></dc:creator>
		<pubDate>Wed, 06 Mar 2013 23:06:17 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=302#comment-932</guid>
		<description><![CDATA[My previous comment on APL, the predecessor of R and S, applies well to Eli&#039;s complaint. APL, and a number of non-S languages that descend from it, do not drop dimensions, have rational designs for permutation, transposition, and subsetting, and allow a larger number of operations along dimensions of arrays than R. I&#039;m not going backward to use APL, but really wish R could be upgraded to be as good as its grandparent is for arrays.]]></description>
		<content:encoded><![CDATA[<p>My previous comment on APL, the predecessor of R and S, applies well to Eli&#8217;s complaint. APL, and a number of non-S languages that descend from it, do not drop dimensions, have rational designs for permutation, transposition, and subsetting, and allow a larger number of operations along dimensions of arrays than R. I&#8217;m not going backward to use APL, but really wish R could be upgraded to be as good as its grandparent is for arrays.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Design Flaws in R #2 — Dropped Dimensions by Radford Neal</title>
		<link>http://radfordneal.wordpress.com/2008/08/20/design-flaws-in-r-2-%e2%80%94-dropped-dimensions/#comment-931</link>
		<dc:creator><![CDATA[Radford Neal]]></dc:creator>
		<pubDate>Wed, 06 Mar 2013 20:29:41 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=302#comment-931</guid>
		<description><![CDATA[For the three-dimensional case, you maybe should use the &quot;aperm&quot; function.]]></description>
		<content:encoded><![CDATA[<p>For the three-dimensional case, you maybe should use the &#8220;aperm&#8221; function.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Design Flaws in R #2 — Dropped Dimensions by Eli</title>
		<link>http://radfordneal.wordpress.com/2008/08/20/design-flaws-in-r-2-%e2%80%94-dropped-dimensions/#comment-930</link>
		<dc:creator><![CDATA[Eli]]></dc:creator>
		<pubDate>Wed, 06 Mar 2013 20:13:34 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=302#comment-930</guid>
		<description><![CDATA[I too write a lot of programs in R that use matrix multiplication.

#this is an example that kills me
a=matrix(1,4,4)
# transpose of the first row is a 4x1 COLUMN matrix
t(a[1,])

sadly, first R removes the dim attr, then does a transpose on the vector (which by some bizarre logic is possible with no dim attr) and gives you a 1x4 ROW matrix. 

Horrible.  However this is easily circumvented by employing drop=FALSE defensively throughout all your R code.  Until... you need to work with arrays with more than 2 dimensions. 

a=array(1,dim=c(1,2,10))
#once again R drops the dim attr and we get a row
t(a[,,1])
#that&#039;s ok, let&#039;s use drop=FALSE
t(a[,,1,drop=FALSE])
#oh no, a is not a matrix so we get an error
#--which actually makes sense unlike the behavior for t() on a vector with no dim attr

But what do we do to get a[,,1] as a matrix so we can take the transpose?

#this works but is computationally 3x as expensive as needed
b=a[,,1]
dim(b)=c(dim(a)[1],dim(a)[2])
t(b)

#this is less expensive but ugly
matrix(a[,,1],dim(a)[2],dim(a)[1],byrow=TRUE)

#by the way, this does not work for getting the correct transpose of a matrix
b=a[,,1]
dim(b)=c(dim(a)[2],dim(a)[1])

My solution was to write my own utility function for subsetting 3D matrices.]]></description>
		<content:encoded><![CDATA[<p>I too write a lot of programs in R that use matrix multiplication.</p>
<p>#this is an example that kills me<br />
a=matrix(1,4,4)<br />
# transpose of the first row is a 4&#215;1 COLUMN matrix<br />
t(a[1,])</p>
<p>sadly, first R removes the dim attr, then does a transpose on the vector (which by some bizarre logic is possible with no dim attr) and gives you a 1&#215;4 ROW matrix. </p>
<p>Horrible.  However this is easily circumvented by employing drop=FALSE defensively throughout all your R code.  Until&#8230; you need to work with arrays with more than 2 dimensions. </p>
<p>a=array(1,dim=c(1,2,10))<br />
#once again R drops the dim attr and we get a row<br />
t(a[,,1])<br />
#that&#8217;s ok, let&#8217;s use drop=FALSE<br />
t(a[,,1,drop=FALSE])<br />
#oh no, a is not a matrix so we get an error<br />
#&#8211;which actually makes sense unlike the behavior for t() on a vector with no dim attr</p>
<p>But what do we do to get a[,,1] as a matrix so we can take the transpose?</p>
<p>#this works but is computationally 3x as expensive as needed<br />
b=a[,,1]<br />
dim(b)=c(dim(a)[1],dim(a)[2])<br />
t(b)</p>
<p>#this is less expensive but ugly<br />
matrix(a[,,1],dim(a)[2],dim(a)[1],byrow=TRUE)</p>
<p>#by the way, this does not work for getting the correct transpose of a matrix<br />
b=a[,,1]<br />
dim(b)=c(dim(a)[2],dim(a)[1])</p>
<p>My solution was to write my own utility function for subsetting 3D matrices.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Evaluation of NUTS — more comments on the paper by Hoffman and Gelman by Nigel Goodwin</title>
		<link>http://radfordneal.wordpress.com/2012/01/27/evaluation-of-nuts-more-comments-on-the-paper-by-hoffman-and-gelman/#comment-926</link>
		<dc:creator><![CDATA[Nigel Goodwin]]></dc:creator>
		<pubDate>Sun, 06 Jan 2013 22:07:45 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=1001#comment-926</guid>
		<description><![CDATA[I was kind of aware of the Lagrangian approach, it looks promising, but I won&#039;t be implementing and testing it for some time - I have other priorities. It is difficult enough for me to calculate derivatives of my function, let alone second derivatives. My function is based on linear regression, radial basis functions, and Matern functions, and is used to calculate an estimate and a variance, and the variance contributes to the objective. Not so simple.

As I come from an optimisation background, BFGS is very appealing, but not so simple to see how it can be used in an MCMC scheme.

I do all this for optimisation of complex engineering designs.

The Burda and Maheu paper looks interesting, but I have been advised by others (who are of good repute) that it may not be reversible, so I would treat it with caution.

DREAM and its variants can be very useful. It is a nice idea.]]></description>
		<content:encoded><![CDATA[<p>I was kind of aware of the Lagrangian approach, it looks promising, but I won&#8217;t be implementing and testing it for some time &#8211; I have other priorities. It is difficult enough for me to calculate derivatives of my function, let alone second derivatives. My function is based on linear regression, radial basis functions, and Matern functions, and is used to calculate an estimate and a variance, and the variance contributes to the objective. Not so simple.</p>
<p>As I come from an optimisation background, BFGS is very appealing, but not so simple to see how it can be used in an MCMC scheme.</p>
<p>I do all this for optimisation of complex engineering designs.</p>
<p>The Burda and Maheu paper looks interesting, but I have been advised by others (who are of good repute) that it may not be reversible, so I would treat it with caution.</p>
<p>DREAM and its variants can be very useful. It is a nice idea.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Evaluation of NUTS — more comments on the paper by Hoffman and Gelman by jsalvatier</title>
		<link>http://radfordneal.wordpress.com/2012/01/27/evaluation-of-nuts-more-comments-on-the-paper-by-hoffman-and-gelman/#comment-925</link>
		<dc:creator><![CDATA[jsalvatier]]></dc:creator>
		<pubDate>Sun, 06 Jan 2013 19:48:08 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=1001#comment-925</guid>
		<description><![CDATA[Thanks for the update, very interesting. 

Do you have any thoughts about &quot;BAYESIAN ADAPTIVELY UPDATED HAMILTONIAN MONTE CARLO WITH AN APPLICATION TO HIGH-DIMENSIONAL BEKK GARCH MODELS&quot;?

http://www.rcfea.org/RePEc/pdf/wp46_12.pdf

or &quot;Lagrangian Dynamical Monte Carlo&quot; ?
http://arxiv.org/abs/1211.3759]]></description>
		<content:encoded><![CDATA[<p>Thanks for the update, very interesting. </p>
<p>Do you have any thoughts about &#8220;BAYESIAN ADAPTIVELY UPDATED HAMILTONIAN MONTE CARLO WITH AN APPLICATION TO HIGH-DIMENSIONAL BEKK GARCH MODELS&#8221;?</p>
<p><a href="http://www.rcfea.org/RePEc/pdf/wp46_12.pdf" rel="nofollow">http://www.rcfea.org/RePEc/pdf/wp46_12.pdf</a></p>
<p>or &#8220;Lagrangian Dynamical Monte Carlo&#8221; ?<br />
<a href="http://arxiv.org/abs/1211.3759" rel="nofollow">http://arxiv.org/abs/1211.3759</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Evaluation of NUTS — more comments on the paper by Hoffman and Gelman by Nigel Goodwin</title>
		<link>http://radfordneal.wordpress.com/2012/01/27/evaluation-of-nuts-more-comments-on-the-paper-by-hoffman-and-gelman/#comment-924</link>
		<dc:creator><![CDATA[Nigel Goodwin]]></dc:creator>
		<pubDate>Sun, 06 Jan 2013 15:58:53 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=1001#comment-924</guid>
		<description><![CDATA[I have done a lot of work on this BFGS method and NUTS. Long story short, if the hessian is reasonably constant over the space, it can give excellent benefits. E.g. tests on a highly correlated Gausian in high dimensions was very impressive. But in more realistic functions, such as Harrio banana functions, the Hessian approximation is not good, and behaviour can be worse than no Hessian.

I&#039;m afraid the holy grail has not been found. We need a hessian which is local along leapfrog steps, rather than global, we need a hessian which can be efficiently calculated and/or adapted with efficient leapfrogs, and we need to be able jump modes. No published method I have seen satisfies these criteria.

Also of course it has to satisfy detailed balance etc. I tried various methods which did not satisfy detailed balance but gave good local hessians, but the results were (not surprisingly) unreliable.

I have had lengthy discussions with Hoffman and Zhang, and I think we all came to similar conclusions.

My current approach is to use a mix of different methods - NUTS and DREAM and RWM, each has strengths and weaknesses.]]></description>
		<content:encoded><![CDATA[<p>I have done a lot of work on this BFGS method and NUTS. Long story short, if the hessian is reasonably constant over the space, it can give excellent benefits. E.g. tests on a highly correlated Gausian in high dimensions was very impressive. But in more realistic functions, such as Harrio banana functions, the Hessian approximation is not good, and behaviour can be worse than no Hessian.</p>
<p>I&#8217;m afraid the holy grail has not been found. We need a hessian which is local along leapfrog steps, rather than global, we need a hessian which can be efficiently calculated and/or adapted with efficient leapfrogs, and we need to be able jump modes. No published method I have seen satisfies these criteria.</p>
<p>Also of course it has to satisfy detailed balance etc. I tried various methods which did not satisfy detailed balance but gave good local hessians, but the results were (not surprisingly) unreliable.</p>
<p>I have had lengthy discussions with Hoffman and Zhang, and I think we all came to similar conclusions.</p>
<p>My current approach is to use a mix of different methods &#8211; NUTS and DREAM and RWM, each has strengths and weaknesses.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Evaluation of NUTS — more comments on the paper by Hoffman and Gelman by jsalvati</title>
		<link>http://radfordneal.wordpress.com/2012/01/27/evaluation-of-nuts-more-comments-on-the-paper-by-hoffman-and-gelman/#comment-923</link>
		<dc:creator><![CDATA[jsalvati]]></dc:creator>
		<pubDate>Sun, 06 Jan 2013 08:55:46 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=1001#comment-923</guid>
		<description><![CDATA[I was initially excited by Zhang&#039;s paper, but experience with it was that if the number of stored samples was significantly less than the number of dimensions of the distribution, the sampler got &#039;stuck&#039; in a subspace of the true distribution. It would only slowly move orthogonal to that distribution.

I suppose it&#039;s possible I misimplemented the algorithm, though. If you&#039;ve had more success, I&#039;d love to hear about it.]]></description>
		<content:encoded><![CDATA[<p>I was initially excited by Zhang&#8217;s paper, but experience with it was that if the number of stored samples was significantly less than the number of dimensions of the distribution, the sampler got &#8216;stuck&#8217; in a subspace of the true distribution. It would only slowly move orthogonal to that distribution.</p>
<p>I suppose it&#8217;s possible I misimplemented the algorithm, though. If you&#8217;ve had more success, I&#8217;d love to hear about it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Two textbooks on probability using R by Washington S. Silva (@twssecn)</title>
		<link>http://radfordneal.wordpress.com/2011/06/18/two-textbooks-on-probability-using-r/#comment-916</link>
		<dc:creator><![CDATA[Washington S. Silva (@twssecn)]]></dc:creator>
		<pubDate>Thu, 20 Dec 2012 01:13:20 +0000</pubDate>
		<guid isPermaLink="false">http://radfordneal.wordpress.com/?p=828#comment-916</guid>
		<description><![CDATA[The book Think Stats Probability and Statistics for Programmers,
By Allen B. Downey, deserves analysis. (In english now)]]></description>
		<content:encoded><![CDATA[<p>The book Think Stats Probability and Statistics for Programmers,<br />
By Allen B. Downey, deserves analysis. (In english now)</p>
]]></content:encoded>
	</item>
</channel>
</rss>
