<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-8369377483851107448</id><updated>2012-02-16T05:42:18.910-07:00</updated><category term='linux'/><category term='clustering'/><category term='hierarchical clusering'/><category term='Research'/><category term='emacs'/><category term='sparse matrix'/><category term='Intern Interview'/><category term='text categorization'/><category term='Data Mining'/><category term='social computing'/><category term='multi-dimensional networks'/><category term='Progress'/><category term='misc'/><category term='matlab'/><category term='personal life'/><category term='community detection'/><category term='information retrieval'/><category term='AI'/><category term='writing skills'/><category term='Feature Selection'/><category term='collective behavior'/><category term='statistics'/><category term='machine learning'/><category term='Transfer Learning'/><category term='Application'/><category term='papers'/><title type='text'>Swimming in Social Media</title><subtitle type='html'>Random thoughts about data mining, social computing, especially in social media.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>68</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-60643480940405163</id><published>2010-05-15T17:29:00.001-06:00</published><updated>2010-05-15T17:29:57.800-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><category scheme='http://www.blogger.com/atom/ns#' term='emacs'/><title type='text'>remove accidentally inserted words from your spelling dictionary for emacs</title><content type='html'>somehow, I accidentally inserted an incorrect word to my spelling dictionary in emacs.&amp;nbsp; Wow, it took me a while to figure out where my private dictionary is.&lt;br /&gt;&lt;br /&gt;Below are my only configuration script for ispell in emacs:&lt;br /&gt;(global-set-key (kbd "&lt;f2&gt;") 'ispell-word)&lt;br /&gt;(global-set-key (kbd "&lt;f3&gt;") 'ispell)&lt;/f3&gt;&lt;/f2&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;So i thought I am using ispell and noticed&amp;nbsp; the following documenation for ispell:&lt;br /&gt;&lt;br /&gt;&lt;pre&gt;Whenever ispell is started the file `.ispell_words' is read from&lt;br /&gt;your home directory (if it exists). This file contains a list of&lt;br /&gt;words, one per line. The order of the words is not important, but&lt;br /&gt;the case is. Ispell will consider all of the words good, and will&lt;br /&gt;use them as possible near misses.&lt;/pre&gt;&lt;pre&gt;&lt;/pre&gt;&lt;br /&gt;Unfortunately, I didn't find such a file in my home directory. It turned out that I was using aspell in ispell mode. What the hell!&lt;br /&gt;&lt;br /&gt;The private dictionary for aspell is located at&lt;br /&gt;~/.aspell.en.pws&lt;br /&gt;&lt;br /&gt;Finally solved the problem.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;pre&gt;&lt;/pre&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-60643480940405163?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/60643480940405163/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=60643480940405163' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/60643480940405163'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/60643480940405163'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2010/05/change-your-spelling-dictionary-for.html' title='remove accidentally inserted words from your spelling dictionary for emacs'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-7043551549181338870</id><published>2010-04-07T22:31:00.002-06:00</published><updated>2010-04-07T22:31:52.155-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><title type='text'>how to make backspace working for firefox in Ubuntu?</title><content type='html'>&lt;div class="post-225 post hentry category-bugsquashing category-snippets category-ubuntu"&gt;  &lt;h2&gt;http://embraceubuntu.com/2006/12/21/fix-firefox-backspace-to-take-you-to-the-previous-page/ &lt;/h2&gt;&lt;h2&gt;&amp;nbsp;&lt;/h2&gt;&lt;h2&gt;&lt;a href="http://embraceubuntu.com/2006/12/21/fix-firefox-backspace-to-take-you-to-the-previous-page/" title="Permalink for : Fix Firefox Backspace to Take You to the Previous Page"&gt;Fix Firefox Backspace to Take You to the Previous&amp;nbsp;Page&lt;/a&gt;  &lt;em&gt;December 21, 2006&lt;/em&gt;&lt;/h2&gt;&lt;em class="info"&gt;Posted by Carthik in &lt;a href="http://en.wordpress.com/tag/bugsquashing/" rel="category tag" title="View all posts in bugsquashing"&gt;bugsquashing&lt;/a&gt;,  &lt;a href="http://en.wordpress.com/tag/snippets/" rel="category tag" title="View all posts in snippets"&gt;snippets&lt;/a&gt;,  &lt;a href="http://en.wordpress.com/tag/ubuntu/" rel="category tag" title="View all posts in ubuntu"&gt;ubuntu&lt;/a&gt;. &lt;br /&gt;&lt;a href="http://embraceubuntu.com/2006/12/21/fix-firefox-backspace-to-take-you-to-the-previous-page/trackback/" title="trackback url"&gt;trackback&lt;/a&gt;     &lt;/em&gt;        &lt;div class="snap_preview"&gt;In a surprising development that seems really strange and unnecessary, Firefox 2.0 won’t go to the previous page when I press the “backspace” button on the keyboard. I have grown used to this over the period I have used Firefox. The fact that I can’t use backspace the way I am used to has been annoying me no end. So I decided to dig a little deeper.&lt;br /&gt;The feature was removed to fix &lt;a href="https://bugzilla.mozilla.org/show_bug.cgi?id=325541"&gt;a bug&lt;/a&gt;. &lt;a href="https://bugzilla.mozilla.org/show_bug.cgi?id=358764"&gt;The bug that was caused by fixing the previous bug, which is that the backspace does not behave like it should &lt;/a&gt;has been fixed too (Thank heavens!)&lt;br /&gt;But then, until the bug fix propagates to a firefox build available on Ubuntu, one has to resort to a little scratching to fix the matter. Here’s how you resurrect the backspace button in Firefox 2.0 (current as of this date):&lt;br /&gt;Type “about:config” in the address bar of Firefox and press Enter.&lt;br /&gt;`Filter` for ‘browser.backspace_action’ and change its value to 0 (zero).&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-7043551549181338870?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/7043551549181338870/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=7043551549181338870' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7043551549181338870'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7043551549181338870'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2010/04/how-to-make-backspace-working-for.html' title='how to make backspace working for firefox in Ubuntu?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6862143416951913728</id><published>2009-12-01T12:32:00.003-07:00</published><updated>2009-12-01T12:51:23.065-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sparse matrix'/><category scheme='http://www.blogger.com/atom/ns#' term='matlab'/><title type='text'>matlab sparse matrix format</title><content type='html'>I have encountered several cases to handle matlab sparse matrix in mex c or c++.&lt;br /&gt;&lt;br /&gt;Here, I use a simple example to explain how the sparse matrix is stored in matlab (Very creative, but not necessarily for my usage. Hence, it is always convenient to understand the structure)&lt;br /&gt;&lt;br /&gt;The sparse matrix format in matlab involve several components:&lt;br /&gt;nz: the number of non-zeros&lt;br /&gt;m: the number of rows&lt;br /&gt;n: the number of cols&lt;br /&gt;&lt;br /&gt;jc: the index of columns.&lt;br /&gt;ir: the index of rows;&lt;br /&gt;pr: the nonzero entries stored in a double vector.&lt;br /&gt;&lt;br /&gt;The tricky part is the relation between jc, ir and pr.&lt;br /&gt;&lt;p&gt;For example, consider a &lt;tt&gt;7&lt;/tt&gt;-by-&lt;tt&gt;3&lt;/tt&gt; sparse &lt;tt&gt;mxArray&lt;/tt&gt; named &lt;tt&gt;Sparrow&lt;/tt&gt; containing six nonzero elements, created by typing:&lt;/p&gt;&lt;pre class="programlisting"&gt;Sparrow = zeros(7,3);&lt;br /&gt;Sparrow(2,1) = 1;&lt;br /&gt;Sparrow(5,1) = 1;&lt;br /&gt;Sparrow(3,2) = 1;&lt;br /&gt;Sparrow(2,3) = 2;&lt;br /&gt;Sparrow(5,3) = 1;&lt;br /&gt;Sparrow(6,3) = 1;&lt;br /&gt;Sparrow = sparse(Sparrow);&lt;/pre&gt;Then, the matrix looks like below:&lt;br /&gt;&lt;br /&gt;&gt;&gt; full(Sparrow)&lt;br /&gt;&lt;br /&gt;ans =&lt;br /&gt;&lt;br /&gt;0     0     0&lt;br /&gt;1     0     2&lt;br /&gt;0     1     0&lt;br /&gt;0     0     0&lt;br /&gt;1     0     1&lt;br /&gt;0     0     1&lt;br /&gt;0     0     0&lt;br /&gt;&lt;br /&gt;Then&lt;br /&gt;&lt;p&gt;The contents of the &lt;tt&gt;ir&lt;/tt&gt;, &lt;tt&gt;jc&lt;/tt&gt;, and &lt;tt&gt;pr&lt;/tt&gt; arrays are listed in this table.&lt;/p&gt;&lt;table class="body" border="2" cellpadding="4" cellspacing="0"&gt;&lt;colgroup&gt;&lt;col width="20%"&gt;&lt;col width="11%"&gt;&lt;col width="12%"&gt;&lt;col width="11%"&gt;&lt;col width="46%"&gt;&lt;/colgroup&gt;&lt;thead&gt;&lt;tr valign="top"&gt;&lt;th bgcolor="#b2b2b2"&gt;&lt;p&gt;Subscript&lt;/p&gt;&lt;/th&gt;&lt;th bgcolor="#b2b2b2"&gt;&lt;p&gt;ir&lt;/p&gt;&lt;/th&gt;&lt;th bgcolor="#b2b2b2"&gt;&lt;p&gt;pr&lt;/p&gt;&lt;/th&gt;&lt;th bgcolor="#b2b2b2"&gt;&lt;p&gt;jc&lt;/p&gt;&lt;/th&gt;&lt;th bgcolor="#b2b2b2"&gt;&lt;p&gt;Comment&lt;/p&gt;&lt;/th&gt;&lt;/tr&gt; &lt;/thead&gt;&lt;tbody&gt;&lt;tr valign="top"&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;(2,1)&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;1&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-5"&gt;&lt;/a&gt;&lt;tt&gt;1&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-6"&gt;&lt;/a&gt;&lt;tt&gt;0&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;Column 1 contains two nonzero elements, with rows designated by &lt;tt&gt;ir[0]&lt;/tt&gt; and &lt;tt&gt;ir[1]&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt; &lt;tr valign="top"&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;(5,1)&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;4&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-7"&gt;&lt;/a&gt;&lt;tt&gt;1&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-8"&gt;&lt;/a&gt;&lt;tt&gt;2&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;Column 2 contains one nonzero element, with row designated by &lt;tt&gt;ir[2]&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt; &lt;tr valign="top"&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;(3,2)&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;2&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-9"&gt;&lt;/a&gt;&lt;tt&gt;1&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-10"&gt;&lt;/a&gt;&lt;tt&gt;3&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;Column 3 contains three nonzero elements, with rows designated by &lt;tt&gt;ir[3]&lt;/tt&gt;,&lt;tt&gt;ir[4]&lt;/tt&gt;, and &lt;tt&gt;ir[5]&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt; &lt;tr valign="top"&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;(2,3)&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;1&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-11"&gt;&lt;/a&gt;&lt;tt&gt;2&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-12"&gt;&lt;/a&gt;&lt;tt&gt;6&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;There are six nonzero elements in all.&lt;/p&gt;&lt;/td&gt;&lt;/tr&gt; &lt;tr valign="top"&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;(5,3)&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;4&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-13"&gt;&lt;/a&gt;&lt;tt&gt;1&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;br /&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;br /&gt;&lt;/td&gt;&lt;/tr&gt; &lt;tr valign="top"&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;(6,3)&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;tt&gt;5&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;p&gt;&lt;a name="bqvow6l-14"&gt;&lt;/a&gt;&lt;tt&gt;1&lt;/tt&gt;&lt;/p&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;br /&gt;&lt;/td&gt;&lt;td bgcolor="#f2f2f2"&gt;&lt;br /&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;&lt;p&gt; If the &lt;tt&gt;j&lt;/tt&gt;th column of the sparse &lt;tt&gt;mxArray&lt;/tt&gt; has any nonzero elements:&lt;/p&gt;&lt;ul type="disc"&gt;&lt;li&gt;&lt;p&gt;&lt;tt&gt;jc[j]&lt;/tt&gt; is the index in &lt;tt&gt;ir&lt;/tt&gt;, &lt;tt&gt;pr&lt;/tt&gt;, and &lt;tt&gt;pi&lt;/tt&gt; (if it exists) of the first nonzero element in the &lt;tt&gt;j&lt;/tt&gt;th column.&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;&lt;tt&gt;jc[j+1]-1&lt;/tt&gt; is the index of the last nonzero element in the &lt;tt&gt;j&lt;/tt&gt;th column.&lt;/p&gt;&lt;/li&gt;&lt;li&gt;&lt;p&gt;For the &lt;tt&gt;j&lt;/tt&gt;th column of the sparse matrix, &lt;tt&gt;jc[j]&lt;/tt&gt; is the total number of nonzero elements in all preceding columns.&lt;/p&gt;&lt;/li&gt;&lt;/ul&gt;&lt;p&gt;The number of nonzero elements in the &lt;tt&gt;j&lt;/tt&gt;th column of the sparse &lt;tt&gt;mxArray&lt;/tt&gt; is:&lt;/p&gt;&lt;pre class="programlisting"&gt;jc[j+1] - jc[j];&lt;br /&gt;&lt;br /&gt;Note that the size of jc is n+1.&lt;br /&gt;the size of ir is nz, the same as pr.&lt;br /&gt;&lt;br /&gt;Hence, to iterate over the spare matrix in c, you can use the following code:&lt;br /&gt;&lt;br /&gt; for (col=0; col &lt; n; col++){  &lt;br /&gt;   startIndex = jc[col];  &lt;br /&gt;   endIndex = jc[col+1];  &lt;br /&gt;   for (i=startIndex; i &lt; endIndex; i++){&lt;br /&gt;     row = ir[i];&lt;br /&gt;     val = pr[i];&lt;br /&gt;     ......&lt;br /&gt;&lt;br /&gt;   }&lt;br /&gt;&lt;br /&gt;}&lt;br /&gt;&lt;br /&gt;&lt;/pre&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6862143416951913728?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6862143416951913728/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6862143416951913728' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6862143416951913728'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6862143416951913728'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/12/matlab-sparse-matrix-format.html' title='matlab sparse matrix format'/><author><name>Lei</name><uri>http://www.blogger.com/profile/12910515244390416980</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-550719166136376721</id><published>2009-11-30T18:47:00.003-07:00</published><updated>2009-11-30T18:53:14.923-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='community detection'/><category scheme='http://www.blogger.com/atom/ns#' term='multi-dimensional networks'/><title type='text'>ICDM Trip Soon</title><content type='html'>This Sunday, I will be out of town for &lt;a href="http://www.cs.umbc.edu/ICDM09/program.html"&gt;ICDM conference&lt;/a&gt; held in Miami, FL.&lt;br /&gt;&lt;br /&gt;I will present my paper:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.public.asu.edu/%7Eltang9/papers/multi-dimensional_network.pdf"&gt;Uncovering Groups via Heterogeneous Interaction Analysis&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;This work addresses the community detection problem when multiple different types of interactions are presented between the same set of users.&lt;br /&gt;&lt;br /&gt;A more general case is that users registered at different social media sites. Can we somehow uncover the hidden community structure?&lt;br /&gt;&lt;br /&gt;We show that using an integration based on structural features is more robust.  For evaluation, I proposed a simple cross-dimension network validation scheme. Similar to cross validation. This could be used as a simple rule for evaluation in the future.&lt;br /&gt;&lt;br /&gt;Of course, there are many interesting directions to pursue in the future. One important aspect is that some of the dimensions are noisy. Is it possible to identify them? Is this the same as tensor decomposition?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-550719166136376721?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/550719166136376721/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=550719166136376721' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/550719166136376721'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/550719166136376721'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/11/icdm-trip-soon.html' title='ICDM Trip Soon'/><author><name>Lei</name><uri>http://www.blogger.com/profile/12910515244390416980</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-2043718269322812670</id><published>2009-11-18T11:29:00.008-07:00</published><updated>2009-12-03T16:58:45.675-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='collective behavior'/><category scheme='http://www.blogger.com/atom/ns#' term='social computing'/><title type='text'>about collective behavior</title><content type='html'>Recently, I just submitted a magzine article talking about collective behavior to IEEE Intelligent systems based on my recent work published this year:&lt;br /&gt;&lt;a href="http://www.public.asu.edu/%7Eltang9/papers/tang-kdd09.pdf" target="_blank"&gt;&lt;i&gt;Relational Learning via Latent Social Dimensions&lt;/i&gt;&lt;/a&gt;&amp;nbsp; KDD-2009&lt;br /&gt;&lt;a href="http://www.public.asu.edu/%7Eltang9/papers/sparse_social_dimension.pdf" target="_blank"&gt;&lt;i&gt;Scalable Learning of Collective Behavior based on Sparse Social Dimensions&lt;/i&gt;&lt;/a&gt; CIKM-2009.  &lt;br /&gt;&lt;br /&gt;So what is collective behavior? I define collective behavior as behaviors when individuals are exposed in a social network environment.&amp;nbsp;  &lt;br /&gt;&lt;br /&gt;I found that different areas have quite different definitions.&lt;span style="color: red;"&gt;&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span style="color: red;"&gt;(1) . According to &lt;a href="http://en.wikipedia.org/wiki/Collective_behavior" target="_blank"&gt;wik&lt;/a&gt;i&lt;/span&gt;  (mostly written by sociologist),"The term "&lt;b&gt;collective  behavior&lt;/b&gt;" was first used by &lt;a href="http://en.wikipedia.org/wiki/Robert_E._Park" target="_blank"&gt;Robert  E. Park&lt;/a&gt;, and employed definitively by &lt;a href="http://en.wikipedia.org/wiki/Herbert_Blumer" target="_blank"&gt;Herbert  Blumer&lt;/a&gt;, to refer to social processes and events which do not  reflect existing &lt;a href="http://en.wikipedia.org/wiki/Social_structure" target="_blank"&gt;social  structure&lt;/a&gt; (&lt;a href="http://en.wikipedia.org/wiki/Laws" target="_blank"&gt;laws&lt;/a&gt;,  conventions, and &lt;a href="http://en.wikipedia.org/wiki/Institutions" target="_blank"&gt;institutions&lt;/a&gt;),  but which emerge in a "spontaneous" way. "Collective  behavior can be divided into four categories:   &lt;br /&gt;&lt;blockquote style="margin-left: 0in;"&gt;1. The Crowd&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;2. The public&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;3. The mass&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;4. The social movement&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;It is subtle to differentiate these four terms based on the name. &lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;&lt;span style="color: red;"&gt;(2) &lt;/span&gt;On the other hand, in other field, such as &lt;span style="color: red;"&gt;artificial  life&lt;/span&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;h3 style="margin-bottom: 0.2in; margin-right: 0.39in; margin-top: 0in;"&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://www.softcomputing.net/sidm-chapter.pdf" target="_blank"&gt;Swarm Intelligence in &lt;i&gt;Data Mining &lt;/i&gt;&lt;/a&gt;&lt;/span&gt; &lt;/h3&gt;&lt;blockquote style="margin-left: 0in;"&gt;collective behavior refers to a group of agents which can be treated as an entity, which is commonly observed in bird flocks, ants and other animals.&amp;nbsp; There are several principles:&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;homogeneity (all the agents follow the same behavior model), locality (influenced only by neighbors), collision avoidance (avoid with nearby flock mates), velocity matching and flock centering.  &lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;Two popular methods are introduced in the chapter: particle swarm intelligence and ant colony optimization.&amp;nbsp; quite interesting ideas :)&lt;br /&gt;&lt;/blockquote&gt;&lt;span style="color: red;"&gt;(3)&lt;/span&gt; In &lt;span style="color: red;"&gt;data mining&lt;/span&gt; field, there is  one paper talking about &lt;a href="http://people.seas.harvard.edu/%7Ejenn/" target="_blank"&gt;learning  from collective behavior. &lt;/a&gt;&lt;br /&gt;&lt;blockquote style="margin-left: 0in;"&gt;The setup is like oracle: suppose you have the luxury to observe the interaction and corresponding actions of a group of people and each person in the group follow a fixed policy,&amp;nbsp; then&amp;nbsp; how can we learn the policy and strategies so that when new situations arrive, we can predict the collective behavior?&amp;nbsp; The authors provide some theoretical bounds about the learnability. Unfortunatelly, evertying is synthetic and it is really difficult for me to figure out a proper scenario such that their setup might be true.&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;The nice part of this work is that at least two kinds of policies are studied:&lt;br /&gt;&lt;/blockquote&gt;&lt;ul&gt;&lt;li&gt;one is mimic your friend&lt;/li&gt;&lt;li&gt; the other is try to differentiate from your  neighbors   &lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;blockquote style="margin-left: 0in;"&gt;while the first one is well known, the 2nd strategy, as I believe existing in the real world, do not studied well in data mining or SNA. One way of achieving this is connecting those nodes that are two hops away and seperate them from 1-hop away neighbors as suggested in &lt;a href="http://portal.acm.org/citation.cfm?id=1401925" target="_blank"&gt;using ghost edges for sparsely labeled networks&lt;/a&gt; But when everyting is mixed, more work needs to be done.  &lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;Another key difference of this work from my article is that my article talks about "spatial" prediction of collective behavior (given some observations, predict the others within the same network), while this one talks about "temporal" prediction.  &lt;br /&gt;&lt;/blockquote&gt;I think it is a great idea to combine these two aspects together. &lt;br /&gt;&lt;br /&gt;&lt;span style="color: red;"&gt;(4) Adaptive networks and behavior.&lt;/span&gt;  Concerning social networks and collective behavior, two directions  are converging. One study the dynamics of networks, the other study  the dynamics on networks. In reality, there two factors are evolving  simultaneously. Social networks can evolve, so are collective  behavior. How to capture these two factors in the modeling?&amp;nbsp;   &lt;br /&gt;&lt;br /&gt;&lt;blockquote style="margin-left: 0in;"&gt;This also relates to social influence and social selection and several papers are talking about this issue:&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;&lt;a href="http://stat.gamma.rug.nl/chapter_coevol.pdf" target="_blank"&gt;Modeling the co-evolution of network and behavior&lt;/a&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;&lt;a href="http://arxiv.org/abs/physics/0603023" target="_blank"&gt;Nonequilibrium phase transition in the coevolution of networks and opinions&lt;/a&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;blockquote style="margin-left: 0in;"&gt;&lt;a href="http://www.cs.cornell.edu/%7Edph/papers/kdd08-sim.pdf" target="_blank"&gt;Feedback effects between similarity and social influence in online communities&lt;/a&gt;&lt;a href="http://www.physorg.com/news114424579.html" target="_blank"&gt;&lt;span style="color: red;"&gt;&lt;/span&gt;&lt;/a&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;&lt;blockquote style="margin-left: 0in;"&gt;&lt;a href="http://www.physorg.com/news114424579.html" target="_blank"&gt;&lt;span style="color: red;"&gt;(5) Collective  Attention&lt;/span&gt;&lt;/a&gt; investigate how a news or a resource attracts  users' attentions. Does networks come into play? This can also be a  further direction for investigation.   &lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Anyway, I feel that this direction has many more issues to address. Also many challenges such as problem formulation, data collection, and evaluation. More in the near future :)  &lt;br /&gt;Just check &lt;a href="http://www.public.asu.edu/%7Eltang9/" target="_blank"&gt;my homepage&lt;/a&gt;&lt;br /&gt;&lt;div style="margin-bottom: 0in;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-2043718269322812670?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/2043718269322812670/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=2043718269322812670' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2043718269322812670'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2043718269322812670'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/11/about-collective-behavior.html' title='about collective behavior'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-3189903827718430637</id><published>2009-11-16T22:38:00.001-07:00</published><updated>2009-11-16T22:38:17.168-07:00</updated><title type='text'>Change default paper size in tetex in Linux</title><content type='html'>Run command in sudo mode:&lt;br&gt; &lt;br&gt; &lt;b&gt;texconfig&lt;/b&gt;&lt;br&gt; &lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-3189903827718430637?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/3189903827718430637/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=3189903827718430637' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3189903827718430637'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3189903827718430637'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/11/change-default-paper-size-in-tetex-in.html' title='Change default paper size in tetex in Linux'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8812463547500071282</id><published>2009-11-16T17:16:00.001-07:00</published><updated>2009-11-16T17:17:30.437-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='writing skills'/><category scheme='http://www.blogger.com/atom/ns#' term='Research'/><title type='text'>Tips in research</title><content type='html'>&lt;div class="gmail_quote"&gt;writing is like a hour glass.&lt;br&gt; &lt;br&gt; Starts big, cover every aspects of the neck, and present a big picture.&lt;br&gt; &lt;br&gt; Use conference deadlines to make yourself efficient.&lt;br&gt; &lt;br&gt; Start from bad writing then improve!&lt;br&gt; &lt;/div&gt;&lt;br&gt;&lt;br clear="all"&gt; &lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8812463547500071282?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8812463547500071282/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8812463547500071282' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8812463547500071282'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8812463547500071282'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/11/tips-in-research.html' title='Tips in research'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-3035515986083512428</id><published>2009-10-26T17:31:00.000-06:00</published><updated>2009-10-26T17:31:08.000-06:00</updated><title type='text'>www abstract due today</title><content type='html'>Today is the abstract due. I have uploaded the abstract.&lt;br /&gt;&lt;br /&gt;As I'm leaving on 29th, I need to finish the paper in 3 days!!!&lt;br /&gt;&lt;br /&gt;This time, we did some interesting work rather than conventional data mining (mostly focus on cross-validation result, not interesting enough). &lt;br /&gt;&lt;br /&gt;Hopefully we can make it!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-3035515986083512428?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/3035515986083512428/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=3035515986083512428' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3035515986083512428'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3035515986083512428'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/10/www-abstract-due-today.html' title='www abstract due today'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-1185869390414055331</id><published>2009-10-25T17:08:00.000-06:00</published><updated>2009-10-25T17:08:14.170-06:00</updated><title type='text'>Travel Plans this year</title><content type='html'>10/30/ - 11/07, Hongkong, for CIKM.&lt;br /&gt;Presenting "&lt;a href="http://www.public.asu.edu/%7Eltang9/papers/sparse_social_dimension.pdf"&gt;scalable learning of collective behavior based on sparse social dimensions&lt;/a&gt;" &lt;br /&gt;&lt;br /&gt;12/06 - 12/09, Miami, FL for ICDM.&lt;br /&gt;Presenting "&lt;a href="http://www.public.asu.edu/%7Eltang9/papers/multi-dimensional_network.pdf"&gt;uncovering groups via heterogeneous interaction analysis&lt;/a&gt;"&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-1185869390414055331?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/1185869390414055331/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=1185869390414055331' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1185869390414055331'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1185869390414055331'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/10/travel-plans-this-year.html' title='Travel Plans this year'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6105803445784323859</id><published>2009-10-22T18:46:00.000-06:00</published><updated>2009-10-22T18:46:04.981-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><category scheme='http://www.blogger.com/atom/ns#' term='machine learning'/><title type='text'>An interesting reading</title><content type='html'>I came accross this &lt;a href="http://projecteuclid.org/DPubS?verb=Display&amp;amp;version=1.0&amp;amp;service=UI&amp;amp;handle=euclid.ss/1009213726&amp;amp;page=record"&gt;interesing paper&lt;/a&gt; by a statistician &lt;span&gt;Leo Breiman talking about the difference of statistics and machine learning. Basically, there are two school: one takes the data model, trying to understand the nature process (which is widely studied by statistician), the other one&amp;nbsp; takes an algorithmic model. They do not care the process in the black box, but just try to approximate the process using whatever effective methods such as neural network and decisions trees (this is the philosophy of machine learning guys). &lt;/span&gt;&lt;br /&gt;&lt;span&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span&gt;Some of the discusssions are very incisive. Leo argued that the over emphasize of data model might lead to wrong conclusions. As there could be many different but comparable models to achieve the same performance. Taking a machine learning approach seems more practical. &lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span&gt;I am more interested in reading a paper by a machine learning guy talking about statistics.&amp;nbsp;&lt;/span&gt;&lt;br /&gt;&lt;span&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;span&gt;Especially, for a newbie to work on machine learning, should he go to the statistics department or computer science department? &lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6105803445784323859?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6105803445784323859/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6105803445784323859' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6105803445784323859'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6105803445784323859'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/10/interesting-reading.html' title='An interesting reading'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-4761622103733762258</id><published>2009-10-14T15:42:00.000-06:00</published><updated>2009-10-14T15:42:02.609-06:00</updated><title type='text'>refeshing by writing papers</title><content type='html'>I'm not sure whether this is a good symptom for reseachers. I feel excited when I write papers. This has been confirmed many times.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-4761622103733762258?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/4761622103733762258/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=4761622103733762258' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4761622103733762258'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4761622103733762258'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/10/refeshing-by-writing-papers.html' title='refeshing by writing papers'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-7036508653135197390</id><published>2009-10-07T15:58:00.000-06:00</published><updated>2009-10-07T15:58:06.364-06:00</updated><title type='text'>starting job search</title><content type='html'>After submitting all the journal papers relevant to my past work, I'm now ready to look for jobs.&lt;br /&gt;&lt;br /&gt;Go Go Go!&lt;br /&gt;&lt;br /&gt;Hope I can find an ideal job :)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-7036508653135197390?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/7036508653135197390/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=7036508653135197390' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7036508653135197390'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7036508653135197390'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/10/starting-job-search.html' title='starting job search'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-751681593505202327</id><published>2009-09-27T12:53:00.000-06:00</published><updated>2009-09-27T12:53:55.425-06:00</updated><title type='text'>US visa still pending...</title><content type='html'>As I am planning to go to HK to attend CIKM'09, I went to Mexico to renew my US visa two weeks ago.&amp;nbsp; As expected, I got checked. But fortunatelly, I "slipped" back into US.&lt;br /&gt;&lt;br /&gt;Until today, I still didn't get any update from the consulate about the visa.&lt;br /&gt;&lt;br /&gt;Hope I will receive the good news next week!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-751681593505202327?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/751681593505202327/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=751681593505202327' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/751681593505202327'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/751681593505202327'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/09/us-visa-still-pending.html' title='US visa still pending...'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8583024317999678436</id><published>2009-09-25T15:01:00.001-06:00</published><updated>2009-09-25T15:26:41.398-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='hierarchical clusering'/><category scheme='http://www.blogger.com/atom/ns#' term='clustering'/><title type='text'>determine number of clusters?</title><content type='html'>At present, many community detection methods have been proposed. "Unfortunately", most of these methods require users to specify the number of clusters, which seems insane at first glimpse. &lt;br /&gt;&lt;br /&gt;Imaging a conversation like this:&lt;br /&gt;&lt;br /&gt;Person: powerful machine, can you tell me something about the data set I have?&lt;br /&gt;Machine: You honored! Sure, but please tell me how many clusters are there. &lt;br /&gt;Person: I do not know the data set. That's why I am asking you! &lt;br /&gt;Machine: Sorry,  I cannot tell you anything unless you tell me something.&lt;br /&gt;Person: ....... (stupid machine!)&lt;br /&gt;Machine: ...... (this user is an idiot about data mining!)&lt;br /&gt;&lt;br /&gt;So when an algorithm claims that it automatically determines the number of clusters, everyone just embraces it. For instance, modularity is claimed to find the optimal number of clusters. So are Bayesian guys by enforcing a Chinese restaurant process as a prior. These methods, as I understand, is like setting a threshold for the clustering process or setting an objective to optimize. It basically tells a user that under this objective function, this number of clusters seems reasonable. Instead of setting the number of clusters, these algorithm implicitly set a parameter or an biased objective function to optimize. &lt;br /&gt;&lt;br /&gt;But do we really need to determine the number of clusters? &lt;br /&gt;&lt;br /&gt;If just for data exploration, I think a hierarchical organization of data objects are more reasonable. But it seems that current state of hierarchical clustering is too far away from satisfactory. Most methods just give you a binary tree, which does not reflect any interesting structure. Another disadvantage is high sensitivity of the hierarchical structure to the algorithm implementation and data processing order. &lt;br /&gt;&lt;br /&gt;Instead of begging for machines to determine the number of communities, organization clusters of multi-resolutions is more interesting and more consistent with users' request.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8583024317999678436?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8583024317999678436/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8583024317999678436' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8583024317999678436'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8583024317999678436'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/09/determine-number-of-clusters.html' title='determine number of clusters?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6441388156899821373</id><published>2009-08-19T17:58:00.000-06:00</published><updated>2009-08-19T17:59:00.683-06:00</updated><title type='text'>IJCAI post-conference report</title><content type='html'>This year, thanks to the support from IJCAI conference, I can travel to Pasadena to attend the IJCAI conference.  This is the first time I attended IJCAI (I attended AAAI in 2005).  AI motivated me to pursue my Ph.D study in the first place. I was quite excited to attend this top conference on AI.&lt;br /&gt;&lt;br /&gt;Overall, I like the invited talks and demos most. AI is now so separated into different subfields and each field currently (e.g.,  machine learning and data mining) rarely talks much about intelligence. So these high-quality invited talks in IJCAI really gave me a fresh view of AI from different perspectives. However, the papers in the conference, I have to say, are not quite interesting.  What is even worse, it is difficult to grasp any idea if you are not working directly in the field the paper is talking about.  While it seems IJCAI motivates to encourage collaboration and cross-discipline talks between AI researchers,  the barrier is heavier than ever before.  This makes me worry about the quality of the IJCAI conference.  How to attract high-quality work to publish in IJCAI and break the boundary between different areas require a lot of efforts, or even a revolutionary reform. Maybe, it is more reasonable to host IJCAI as a symposium consists of high-quality talks and pioneer work, rather than yet-another conference to publish delta papers. &lt;br /&gt;&lt;br /&gt;This conference provides two tours for APL and USC. I think these tours were great and such kind of activities should be kept in the following conferences as they provide a great opportunity to explore classical AI projects.   It was a little sad that some attendants reserved the seats but did not show up. On the contrary, some other people who wish to join the tour could not go because they had no ticket.  I hope the ticket-exchange at the registration desk would be allowed to accommodate more people in the next IJCAI.&lt;br /&gt;&lt;br /&gt;Another is concerning the organization of IJCAI. The sessions were well organized.  But the catering was really poor. I attended the student reception. The food there was not attractive at all.  Some other attendants were also complaining about the food. Maybe, the conference organizer spent most of the fundings to provide travel scholarship for students. Then, I'll take my words back given such a bad economic situation.&lt;br /&gt;&lt;br /&gt;I would like to thank IJCAI to provide me this great opportunity to make friends with other AI researchers, to meet those AI “stars”, and to re-cherish those AI dreams I used to have.  Hope this conference would become better and inspire more AI communications and explorations.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6441388156899821373?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6441388156899821373/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6441388156899821373' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6441388156899821373'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6441388156899821373'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/08/ijcai-post-conference-report.html' title='IJCAI post-conference report'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-2764844069372952778</id><published>2009-08-05T14:09:00.003-06:00</published><updated>2009-08-05T14:50:20.019-06:00</updated><title type='text'>what's your chance of acceptance if you submit late?</title><content type='html'>Just came across this question:&lt;br /&gt;&lt;br /&gt;The same paper, which one would have a higher chance of being accepted in a conference? &lt;br /&gt;&lt;br /&gt;Submit early? normal? or submit late?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Basically, would a larger submission number gives you a higher chance of being accepted?&lt;br /&gt;&lt;br /&gt;I believe so.&lt;br /&gt;&lt;br /&gt;Actually, as a reviewer and author, I have seen many people rushing their paper for a conference especially in CS. So a large submission number automatically hint that "the paper is rushed". So somehow the reviewer would have a low expectation. Then a good quality paper would stands out in the local region and is likely to get accepted. &lt;br /&gt;&lt;br /&gt;Of course, this is just a small trick and no evidence can be found. &lt;br /&gt;&lt;br /&gt;A simple way to avoid this bias is to reshuffle the paper when assigning them to the reviwers. Such that the paper's reviewer number has no correlation with the quality. Then, that seems a more fair process. &lt;br /&gt;&lt;br /&gt;Just my 2 cents.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-2764844069372952778?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/2764844069372952778/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=2764844069372952778' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2764844069372952778'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2764844069372952778'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/08/whats-your-chance-of-acceptance-if-you.html' title='what&apos;s your chance of acceptance if you submit late?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-4701979237981441316</id><published>2009-07-27T14:53:00.001-06:00</published><updated>2009-07-27T14:54:55.533-06:00</updated><title type='text'>A special issue of network analysis on Science</title><content type='html'>http://www.sciencemag.org/cgi/content/short/325/5939/405&lt;br /&gt;&lt;br /&gt;Quite interesting special issue. &lt;br /&gt;&lt;br /&gt;"It is not enough to look at patterns; we need to study how they evolve and change."&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-4701979237981441316?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/4701979237981441316/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=4701979237981441316' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4701979237981441316'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4701979237981441316'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/07/special-issue-of-network-analysis-on.html' title='A special issue of network analysis on Science'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-3393675821643334660</id><published>2009-06-27T23:47:00.000-06:00</published><updated>2009-06-27T23:49:07.797-06:00</updated><title type='text'>Recent Update</title><content type='html'>Finished conference papers for ICDM and CIKM.  (Still a paper of understanding groups in progress, probably submitting to WSDM)&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Need to come back to the journal papers and thesis proposal now.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-3393675821643334660?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/3393675821643334660/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=3393675821643334660' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3393675821643334660'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3393675821643334660'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/06/recent-update.html' title='Recent Update'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8049491590295630216</id><published>2009-05-28T16:03:00.001-06:00</published><updated>2009-05-28T16:05:16.980-06:00</updated><title type='text'>recent conferences to go</title><content type='html'>I'm working on papers to submit to CIKM or ICDE. &lt;br /&gt;&lt;br /&gt;I just checked both conferences, one requires 10 pages, and the other requires 12 pages.&lt;br /&gt;&lt;br /&gt;The ridiculous thing is that once you are accepted as short paper, you have to reduce to 2 pages for CIKM, and 4 pages for ICDE.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8049491590295630216?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8049491590295630216/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8049491590295630216' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8049491590295630216'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8049491590295630216'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/05/recent-conferences-to-go.html' title='recent conferences to go'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-4051137228092507100</id><published>2009-05-16T15:50:00.000-06:00</published><updated>2009-05-16T15:51:20.402-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='writing skills'/><title type='text'>some notes on professional writing</title><content type='html'>1. should use punctuations in equations.&lt;br /&gt;2. Avoid starting a sentence with a mathematical expression, seperate symbols by puncutation marks or words.&lt;br /&gt;3. "the" is inapproprieate when the object referred is not unique or does not exist&lt;br /&gt;4. write "the kth term", not "the k-th term"&lt;br /&gt;5. two types of ellipsis, vertically centered and ground level. The latter is used between a list of symbols or  to indicate a product. A_1...A_n. i = 1, 2, ..., n&lt;br /&gt;6. Several commonly used abbreviations, e.g., i.e., et al., &lt;br /&gt;&lt;br /&gt;7. avoid using the adj or advs. very, rather, quite, nice, interesting (all these are imprecise).  So is "essentially"&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-4051137228092507100?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/4051137228092507100/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=4051137228092507100' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4051137228092507100'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4051137228092507100'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/05/some-notes-on-professional-writing.html' title='some notes on professional writing'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-7433119810306904587</id><published>2009-04-22T11:17:00.003-06:00</published><updated>2009-04-22T11:20:22.218-06:00</updated><title type='text'>what's the point of 30% new material?</title><content type='html'>I decide to convert my KDD08 paper into a journal version. Unfortunately, it is said that I have to add at least 30% new materials. &lt;br /&gt;&lt;br /&gt;The sad thing is that I have wrote too much for the conference paper, since I want to make the paper more solid and comprehensive. &lt;br /&gt;&lt;br /&gt;But it turns out I have nothing to add for the journal paper(Of course, you can always add something new but the cost is way too high, almost another journal paper).&lt;br /&gt;&lt;br /&gt;So that's what I learned from this class: &lt;br /&gt;Don't write too much in one conference paper.  &lt;br /&gt;&lt;br /&gt;It's really funny and ridiculous.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-7433119810306904587?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/7433119810306904587/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=7433119810306904587' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7433119810306904587'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7433119810306904587'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2009/04/whats-point-of-30-new-material.html' title='what&apos;s the point of 30% new material?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8139611838727996542</id><published>2008-11-30T19:38:00.003-07:00</published><updated>2008-11-30T19:41:49.719-07:00</updated><title type='text'>Avoid being yesterday's sucker</title><content type='html'>I have been a sucker for a long while. Thanks to this miserable time, I, right now, start to read more than ever before. Though my reading speed is still too low considering the English context. However, this do give me insights and a hilarous time. &lt;br /&gt;&lt;br /&gt;My goal, avoid being yesterday's sucker and do something solid!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8139611838727996542?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8139611838727996542/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8139611838727996542' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8139611838727996542'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8139611838727996542'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2008/11/avoid-being-yesterdays-sucker.html' title='Avoid being yesterday&apos;s sucker'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-4388967597440013025</id><published>2008-06-18T11:31:00.001-06:00</published><updated>2008-06-18T11:34:15.720-06:00</updated><title type='text'>some interesting ICML papers</title><content type='html'>Maybe too much.&lt;br /&gt;&lt;meta name="Originator" content="Microsoft Word 11"&gt;&lt;link rel="File-List" href="file:///C:%5CDOCUME%7E1%5Cltang08%5CLOCALS%7E1%5CTemp%5Cmsohtml1%5C01%5Cclip_filelist.xml"&gt;&lt;link rel="Edit-Time-Data" href="file:///C:%5CDOCUME%7E1%5Cltang08%5CLOCALS%7E1%5CTemp%5Cmsohtml1%5C01%5Cclip_editdata.mso"&gt;&lt;!--[if !mso]&gt; &lt;style&gt; v\:* {behavior:url(#default#VML);} o\:* {behavior:url(#default#VML);} w\:* {behavior:url(#default#VML);} .shape {behavior:url(#default#VML);} &lt;/style&gt; &lt;![endif]--&gt;&lt;!--[if gte mso 9]&gt;&lt;xml&gt;  &lt;w:worddocument&gt;   &lt;w:view&gt;Normal&lt;/w:View&gt;   &lt;w:zoom&gt;0&lt;/w:Zoom&gt;   &lt;w:punctuationkerning/&gt;   &lt;w:validateagainstschemas/&gt;   &lt;w:saveifxmlinvalid&gt;false&lt;/w:SaveIfXMLInvalid&gt;   &lt;w:ignoremixedcontent&gt;false&lt;/w:IgnoreMixedContent&gt;   &lt;w:alwaysshowplaceholdertext&gt;false&lt;/w:AlwaysShowPlaceholderText&gt;   &lt;w:compatibility&gt;    &lt;w:breakwrappedtables/&gt;    &lt;w:snaptogridincell/&gt;    &lt;w:wraptextwithpunct/&gt;    &lt;w:useasianbreakrules/&gt;    &lt;w:dontgrowautofit/&gt;   &lt;/w:Compatibility&gt;   &lt;w:browserlevel&gt;MicrosoftInternetExplorer4&lt;/w:BrowserLevel&gt;  &lt;/w:WordDocument&gt; &lt;/xml&gt;&lt;![endif]--&gt;&lt;!--[if gte mso 9]&gt;&lt;xml&gt;  &lt;w:latentstyles deflockedstate="false" latentstylecount="156"&gt;  &lt;/w:LatentStyles&gt; &lt;/xml&gt;&lt;![endif]--&gt;&lt;style&gt; &lt;!--  /* Style Definitions */  p.MsoNormal, li.MsoNormal, div.MsoNormal 	{mso-style-parent:""; 	margin:0in; 	margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:12.0pt; 	font-family:"Times New Roman"; 	mso-fareast-font-family:"Times New Roman";} h3 	{mso-margin-top-alt:auto; 	margin-right:0in; 	mso-margin-bottom-alt:auto; 	margin-left:0in; 	mso-pagination:widow-orphan; 	mso-outline-level:3; 	font-size:13.5pt; 	font-family:"Times New Roman"; 	font-weight:bold;} a:link, span.MsoHyperlink 	{color:blue; 	text-decoration:underline; 	text-underline:single;} a:visited, span.MsoHyperlinkFollowed 	{color:purple; 	text-decoration:underline; 	text-underline:single;} p 	{mso-margin-top-alt:auto; 	margin-right:0in; 	mso-margin-bottom-alt:auto; 	margin-left:0in; 	mso-pagination:widow-orphan; 	font-size:12.0pt; 	font-family:"Times New Roman"; 	mso-fareast-font-family:"Times New Roman";} @page Section1 	{size:8.5in 11.0in; 	margin:1.0in 1.25in 1.0in 1.25in; 	mso-header-margin:.5in; 	mso-footer-margin:.5in; 	mso-paper-source:0;} div.Section1 	{page:Section1;} --&gt; &lt;/style&gt;&lt;!--[if gte mso 10]&gt; &lt;style&gt;  /* Style Definitions */  table.MsoNormalTable 	{mso-style-name:"Table Normal"; 	mso-tstyle-rowband-size:0; 	mso-tstyle-colband-size:0; 	mso-style-noshow:yes; 	mso-style-parent:""; 	mso-padding-alt:0in 5.4pt 0in 5.4pt; 	mso-para-margin:0in; 	mso-para-margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:10.0pt; 	font-family:"Times New Roman"; 	mso-ansi-language:#0400; 	mso-fareast-language:#0400; 	mso-bidi-language:#0400;} &lt;/style&gt; &lt;![endif]--&gt;  &lt;h3&gt;The GroupLASSO for Generalized Linear Models: Uniqueness of Solutions and Efficient Algorithms&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Volker Roth and Bernd Fischer&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;The GroupLASSO method for finding important explanatory factors suffers from the potential non-uniqueness of solutions and also from high computational costs. We formulate conditions for the uniqueness of GroupLASSO solutions which lead to an easily implementable test procedure. In addition to merely detecting ambiguities in solutions, this testing procedure identifies all potentially active groups. These results are used to derive an efficient algorithm that can deal with input dimensions in the millions and can approximate the solution path efficiently. The derived methods are applied to large-scale learning problems where they exhibit excellent performance. We show that the proposed testing procedure helps to avoid misinterpretations of GroupLASSO solutions.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/113.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Dirichlet Component Analysis: Feature Extraction for Compositional Data&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Hua-Yan Wang, Qiang Yang, Hong Qin, and Hongbin Zha&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We consider feature extraction (dimensionality reduction) for compositional data, where the data vectors are constrained to be positive and constant-sum. In real-world problems, the data components (variables) usually have complicated "correlations" while their total number is huge. Such scenario demands feature extraction. That is, we shall de-correlate the components and reduce their dimensionality. Traditional techniques such as the Principle Component Analysis (PCA) are not suitable for these problems due to unique statistical properties and the need to satisfy the constraints in compositional data. This paper presents a novel approach to feature extraction for compositional data. Our method first identifies a family of dimensionality reduction projections that preserve all relevant constraints, and then finds the optimal projection that maximizes the estimated Dirichlet precision on projected data. It reduces the compositional data to a given lower dimensionality while the components in the lower-dimensional space are de-correlated as much as possible. We develop theoretical foundation of our approach, and validate its effectiveness on some synthetic and real-world datasets.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/129.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;paper ID: 145&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Pairwise Constraint Propagation by Semidefinite Programming for Semi-Supervised Classification&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Zhenguo Li, Jianzhuang Liu, and Xiaoou Tang&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We consider the general problem of learning from pairwise constraints and unlabeled data. The pairwise constraints specify whether two objects belong to the same class or not, known as the must-link constraints and the cannot-link constraints. We propose to learn a mapping that is smooth over the data graph and maps the data onto a unit hypersphere, where two must-link objects are mapped to the same point while two cannot-link objects are mapped to be orthogonal. We show that such a mapping can be achieved by formulating a semidefinite programming problem, which is convex and can be solved globally. Our approach can effectively propagate pairwise constraints to the whole data set. It can be directly applied to multi-class classification and can handle data labels, pairwise constraints, or a mixture of them in a unified framework. Promising experimental results are presented for classification tasks on a variety of synthetic and real data sets.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/145.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Localized Multiple Kernel Learning&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Mehmet Gonen and Ethem Alpaydin&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Recently, instead of selecting a single kernel, multiple kernel learning (MKL) has been proposed which uses a convex combination of kernels, where the weight of each kernel is optimized during training. However, MKL assigns the same weight to a kernel over the whole input space. In this paper, we develop a localized multiple kernel learning (LMKL) algorithm using a gating model for selecting the appropriate kernel function locally. The localizing gating model and the kernel-based classifier are coupled and their optimization is done in a joint manner. Empirical results on ten benchmark and two bioinformatics data sets validate the applicability of our approach. LMKL achieves statistically similar accuracy results compared with MKL by storing fewer support vectors. LMKL can also combine multiple copies of the same kernel function localized in different parts. For example, LMKL with multiple linear kernels gives better accuracy results than using a single linear kernel on bioinformatics data sets.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/158.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Uncorrelated Multilinear Principal Component Analysis through Successive Variance Maximization&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Haiping Lu, Konstantinos Plataniotis, and Anastasios Venetsanopoulos&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Tensorial data are frequently encountered in various machine learning tasks today and dimensionality reduction is one of their most important applications. This paper extends the classical principal component analysis (PCA) to its multilinear version by proposing a novel dimensionality reduction algorithm for tensorial data, named as uncorrelated multilinear PCA (UMPCA). UMPCA seeks a tensor-to-vector projection that captures most of the variation in the original tensorial input while producing uncorrelated features through successive variance maximization. We evaluate the proposed algorithm on a second-order tensorial problem, face recognition, and the experimental results show its superiority, especially in low-dimensional spaces, through the comparison with three other PCA-based algorithms.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/163.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;A Dual Coordinate Descent Method for Large-scale Linear SVM&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Cho-Jui Hsieh, Kai-Wei Chang, Chih-Jen Lin, S. Sathiya Keerthi, and S. Sundararajan&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;In many applications, data appear with a huge number of instances as well as features. Linear Support Vector Machines (SVM) is one of the most popular tools to deal with such large-scale sparse data. This paper presents a novel dual coordinate descent method for linear SVM with L1- and L2-loss functions. The proposed method is simple and reaches an epsilon-accurate solution in O(log (1/epsilon)) iterations. Experiments indicate that our method is much faster than state of the art solvers such as Pegasos, Tron, svmperf, and a recent primal coordinate descent implementation.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/166.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Efficient MultiClass Maximum Margin Clustering&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Bin Zhao, Fei Wang, and Changshui Zhang&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;This paper presents a cutting plane algorithm for multiclass maximum margin clustering (MMC). The proposed algorithm constructs a nested sequence of successively tighter relaxations of the original MMC problem, and each optimization problem in this sequence could be efficiently solved using the constrained concave-convex procedure (CCCP). Experimental evaluations on several real world datasets show that our algorithm converges much faster than existing MMC methods with guaranteed accuracy, and can thus handle much larger datasets efficiently.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/168.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Nearest Hyperdisk Methods for High-Dimensional Classification&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Hakan Cevikalp, Bill Triggs, and Robi Polikar&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;In high-dimensional classification problems it is infeasible to include enough training samples to cover the class regions densely. Irregularities in the resulting sparse sample distributions cause local classifiers such as Nearest Neighbors (NN) and kernel methods to have irregular decision boundaries. One solution is to "fill in the holes" by building a convex model of the region spanned by the training samples of each class and classifying examples based on their distances to these approximate models. Methods of this kind based on affine and convex hulls and bounding hyperspheres have already been studied. Here we propose a method based on the bounding hyperdisk of each class -- the intersection of the affine hull and the smallest bounding hypersphere of its training samples. We argue that in many cases hyperdisks are preferable to affine and convex hulls and hyperspheres: they bound the classes more tightly than affine hulls or hyperspheres while avoiding much of the sample overfitting and computational complexity that is inherent in high-dimensional convex hulls. We show that the hyperdisk method can be kernelized to provide nonlinear classifiers based on non-Euclidean distance metrics. Experiments on several classification problems show promising results.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/178.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Local Likelihood Modeling of Temporal Text Streams&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Guy Lebanon and Yang Zhao&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Temporal text data is often generated by a time-changing process or distribution. Such a drift in the underlying distribution cannot be captured by stationary likelihood techniques. We consider the application of local likelihood methods to generative and conditional modeling of temporal document sequences. We examine the asymptotic bias and variance and present an experimental study using the RCV1 dataset containing a temporal sequence of Reuters news stories.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/180.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Learning to Classify with Missing and Corrupted Features&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Ofer Dekel and Ohad Shamir&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;After a classifier is trained using a machine learning algorithm and put to use in a real world system, it often faces noise which did not appear in the training data. Particularly, some subset of features may be missing or may become corrupted. We present two novel machine learning techniques that are robust to this type of classification-time noise. First, we solve an approximation to the learning problem using linear programming. We analyze the tightness of our approximation and prove statistical risk bounds for this approach. Second, we define the online-learning variant of our problem, address this variant using a modified Perceptron, and obtain a statistical learning algorithm using an online-to-batch technique. We conclude with a set of experiments that demonstrate the effectiveness of our algorithms.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/202.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Fast Solvers and Efficient Implementations for Distance Metric Learning&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Kilian Weinberger and Lawrence Saul&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;In this paper we study how to improve nearest neighbor classification by learning a Mahalanobis distance metric. We build on a recently proposed framework for distance metric learning known as large margin nearest neighbor (LMNN) classification. Within this framework, we focus specifically on the challenges in scalability and adaptability posed by large data sets. Our paper makes three contributions. First, we describe a highly efficient solver for the particular instance of semidefinite programming that arises in LMNN classification; our solver can handle problems with billions of large margin constraints in a few hours. Second, we show how to reduce both training and testing times using metric ball trees; the speedups from ball trees are further magnified by learning low dimensional representations of the input space. Third, we show how to learn different Mahalanobis distance metrics in different parts of the input space. For large data sets, these mixtures of locally adaptive metrics lead to even lower error rates.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/215.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Random Classification Noise Defeats All Convex Potential Boosters&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Philip M. Long and Rocco A. Servedio&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;A broad class of boosting algorithms can be interpreted as performing coordinate-wise gradient descent to minimize some potential function of the margins of a data set. This class includes AdaBoost, LogitBoost, and other widely used and well-studied boosters. In this paper we show that for a broad class of convex potential functions, any such boosting algorithm is highly susceptible to random classification noise. We do this by showing that for any such booster and any nonzero random classification noise rate R, there is a simple data set of examples which is efficiently learnable by such a booster if there is no noise, but which cannot be learned to accuracy better than 1/2 if there is random classification noise at rate R. This negative result is in contrast with known branching program based boosters which do not fall into the convex potential function framework and which can provably learn to high accuracy in the presence of random classification noise.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/258.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Learning Diverse Rankings with Multi-Armed Bandits&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Filip Radlinski, Robert Kleinberg, and Thorsten Joachims&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Algorithms for learning to rank Web documents usually assume a document's relevance is independent of other documents. This leads to learned ranking functions that produce rankings with redundant results. In contrast, user studies have shown that diversity at high ranks is often preferred. We present two new learning algorithms that directly learn a diverse ranking of documents based on users' clicking behavior. We show that these algorithms minimize abandonment, or alternatively, maximize the probability that a relevant document is found in the top k positions of a ranking. We show that one of our algorithms asymptotically achieves the best possible payoff obtainable in polynomial time even as user's interests change. The other performs better empirically when user interests are static, and is still theoretically near-optimal in that case.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/264.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;SVM Optimization: Inverse Dependence on Training Set Size&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Shai Shalev-Shwartz and Nathan Srebro&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We discuss how the runtime of SVM optimization should decrease as the size of the training data increases. We present theoretical and empirical results demonstrating how a simple subgradient descent approach indeed displays such behavior, at least for linear kernels.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/266.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Training Structural SVMs when Exact Inference is Intractable&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Thomas Finley and Thorsten Joachims&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;While discriminative training (e.g., CRF, structural SVM) holds much promise for machine translation, image segmentation, and clustering, the complex inference these applications require make exact training intractable. This leads to a need for approximate training methods. Unfortunately, knowledge about how to perform efficient and effective approximate training is limited. Focusing on structural SVMs, we provide and explore algorithms for two different classes of approximate training algorithms, which we call undergenerating (e.g., greedy) and overgenerating (e.g., relaxations) algorithms. We provide a theoretical and empirical analysis of both types of approximate trained structural SVMs, focusing on fully connected pairwise Markov random fields. We find that models trained with overgenerating methods have theoretic advantages over undergenerating methods, are empirically robust relative to their undergenerating brethren, and relaxed trained models favor non-fractional predictions from relaxed predictors.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/279.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Graph Transduction via Alternating Minimization&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Jun Wang, Tony Jebara, and Shih-Fu Chang&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Graph transduction methods label input data by learning a classification function that is regularized to exhibit smoothness along a graph over labeled and unlabeled samples. In practice, these algorithms are sensitive to the initial set of labels provided by the user. For instance, classification accuracy drops if the training set contains weak labels, if imbalances exist across label classes or if the labeled portion of the data is not chosen at random. This paper introduces a propagation algorithm that more reliably minimizes a cost function over both a function on the graph and a binary label matrix. The cost function generalizes prior work in graph transduction and also introduces node normalization terms for resilience to label imbalances. We demonstrate that global minimization of the function is intractable but instead provide an alternating minimization scheme that incrementally adjusts the function and the labels towards a reliable local minimum. Unlike prior methods, the resulting propagation of labels does not prematurely commit to an erroneous labeling and obtains more consistent labels. Experiments are shown for synthetic and real classification tasks including digit and text recognition. A substantial improvement in accuracy compared to state of the art semi-supervised methods is achieved. The advantage are even more dramatic when labeled instances are limited.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/296.pdf"&gt;Full paper&lt;/a&gt;]&lt;!-- [&lt;a href="papers/DokuWikiLink#296"&gt;Discussion&lt;/a&gt;]--&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Grassmann Discriminant Analysis: a Unifying View on Subspace-Based Learning&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Jihun Hamm and Daniel Lee&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;In this paper we propose a discriminant learning framework for problems in which data consist of linear subspaces instead of vectors. By treating subspaces as basic elements, we can make learning algorithms adapt naturally to the problems with linear invariant structures. We propose a unifying view on the subspace-based learning method by formulating the problems on the Grassmann manifold, which is the set of fixed-dimensional subspaces of a Euclidean space. Previous methods on the problem typically adopt an inconsistent strategy: feature extraction is performed in the Euclidean space while non-Euclidean dissimilarity measures are used. In our approach, we treat each subspace as a point in the Grassmann space, and perform feature extraction and classification in the same space. We show feasibility of the approach by using the Grassmann kernel functions such as the Projection kernel and the Binet-Cauchy kernel. Experiments with real image databases show that the proposed method performs well compared with state-of-the-art algorithms.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/312.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;On-line Discovery of Temporal-Difference Networks&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Takaki Makino and Toshihisa Takagi&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We present an algorithm for on-line, incremental discovery of temporal-difference (TD) networks. The key contribution is the establishment of three criteria to expand a node in TD network: a node is expanded when the node is well-known, independent, and has a prediction error that requires further explanation. Since none of these criteria requires centralized calculation operations, they are easily computed in a parallel and distributed manner, and scalable for bigger problems compared to other discovery methods of predictive state representations. Through computer experiments, we demonstrate the empirical effectiveness of our algorithm.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/317.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Confidence-Weighted Linear Classification&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Mark Dredze, Koby Crammer, and Fernando Pereira&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We introduce confidence-weighted linear classifiers, a new class of algorithms that maintain confidence information about classifier parameters. Learning in this framework updates parameters by estimating weights and increasing model confidence. We investigate a new online algorithm that maintains a Gaussian distribution over weight vectors, updating the mean and variance of the model with each instance. Empirical evaluation on a range of NLP tasks show that our algorithm improves over other state of the art online and batch methods, learns faster in the online setting, and lends itself to better classifier combination after parallel training.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/322.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;On the Chance Accuracies of Large Collections of Classifiers&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Mark Palatucci and Andrew Carlson&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We provide a theoretical analysis of the chance accuracies of large collections of classifiers. We show that on problems with small numbers of examples, some classifier can perform well by random chance, and we derive a theorem to explicitly calculate this accuracy. We use this theorem to provide a principled feature selection criteria for sparse, high-dimensional problems. We evaluate this method on both microarray and fMRI datasets and show that it performs very close to the optimal accuracy obtained from an oracle. We also show that on the fMRI dataset this technique chooses relevant features successfully while another state-of-the-art method, the False Discovery Rate (FDR), completely fails at standard significance levels.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/323.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Efficient Projections onto the L1-Ball for Learning in High Dimensions&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;John Duchi, Shai Shalev-Shwartz, Yoram Singer, and Tushar Chandra&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We describe efficient algorithms for projecting a vector onto the L1-ball. We present two methods for projection. The first performs exact projection in O(n) time, where n is the dimension of the space. The second works on vectors k of whose elements are perturbed outside the L1-ball, projecting in O(k log(n)) time. This setting is especially useful for online learning in sparse feature spaces such as text categorization applications. We demonstrate the merits and effectiveness of our algorithms in numerous batch and online learning tasks. We show that variants of stochastic gradient projection methods augmented with our efficient projection procedures outperform state-of-the-art optimization techniques such as interior point methods. We also show that in online settings gradient updates with L1 projections outperform the EG algorithm while obtaining models with high degrees of sparsity.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/361.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Tailoring Density Estimation via Reproducing Kernel Moment Matching&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Le Song, Xinhua Zhang, Alex Smola, Arthur Gretton, and Bernhard Schoelkopf&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Moment matching is a popular means of parametric density estimation. We extend this technique to nonparametric estimation of mixture models. Our approach works by embedding distributions into a reproducing kernel Hilbert space, and performing moment matching in that space. This allows us to tailor density estimators to a function class of interest (i.e., for which we would like to compute expectations). We show our density estimation approach is useful in applications such as message compression in graphical models, and image classification and retrieval.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/377.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Graph Kernels Between Point Clouds&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Francis Bach&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Point clouds are sets of points in two or three dimensions. Most kernel methods for learning on sets of points have not yet dealt with the specific geometrical invariances and practical constraints associated with point clouds in computer vision and graphics. In this paper, we present extensions of graph kernels for point clouds, which allow to use kernel methods for such objects as shapes, line drawings, or any three-dimensional point clouds. In order to design rich and numerically efficient kernels with as few free parameters as possible, we use kernels between covariance matrices and their factorizations on graphical models. We derive polynomial time dynamic programming recursions and present applications to recognition of handwritten digits and Chinese characters from few training examples.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/379.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Learning Dissimilarities by Ranking: From SDP to QP&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Hua Ouyang and Alexander Gray&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We consider the problem of learning dissimilarities between points via formulations which preserve a specified ordering between points rather than the numerical values of the dissimilarities. Dissimilarity ranking (d-ranking) learns from instances like "A is more similar to B than C is to D" or "The distance between E and F is larger than that between G and H". Three formulations of d-ranking problems are presented and new algorithms are presented for two of them, one by semidefinite programming (SDP) and one by quadratic programming (QP). Among the novel capabilities of these approaches are out-of-sample prediction and scalability to large problems.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/392.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;The Skew Spectrum of Graphs&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Risi Kondor and Karsten Borgwardt&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;The central issue in representing graph-structured data instances in learning algorithms is designing features which are invariant to permuting the numbering of the vertices. We present a new system of invariant graph features which we call the skew spectrum of graphs. The skew spectrum is based on mapping the adjacency matrix to a function on the symmetric group and computing bispectral invariants. The reduced form of the skew spectrum is computable in O(n&lt;sup&gt;3&lt;/sup&gt;) time, and experiments show that on several benchmark datasets it can outperform state of the art graph kernels.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/396.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Accurate Max-margin Training for Structured Output Spaces&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Sunita Sarawagi and Rahul Gupta&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Tsochantaridis et al 2005 proposed two formulations for maximum margin training of structured spaces: margin scaling and slack scaling. While margin scaling has been extensively used since it requires the same kind of MAP inference as normal structured prediction, slack scaling is believed to be more accurate and better-behaved. We present an efficient variational approximation to the slack scaling method that solves its inference bottleneck while retaining its accuracy advantage over margin scaling. We further argue that existing scaling approaches do not separate the true labeling comprehensively while generating violating constraints. We propose a new max-margin trainer PosLearn that generates violators to ensure separation at each position of a decomposable loss function. Empirical results on real datasets illustrate that PosLearn can reduce test error by up to 25%. Further, PosLearn violators can be generated more efficiently than slack violators; for many structured tasks the time required is just twice that of MAP inference.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/402.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Optimized Cutting Plane Algorithm for Support Vector Machines&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Vojtech Franc and Soeren Sonnenburg&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We have developed a new Linear Support Vector Machine (SVM) training algorithm called OCAS. Its computational effort scales linearly with the sample size. In an extensive empirical evaluation OCAS significantly outperforms current state of the art SVM solvers, like SVMLight, SVMPerf and BMRM, achieving speedups of over 1,000 on some datasets over SVMLight and 20 over SVMPerf, while obtaining the same precise Support Vector solution. OCAS even in the early optimization steps shows often faster convergence than the so far in this domain prevailing approximative methods SGD and Pegasos. Effectively parallelizing OCAS we were able to train on a dataset of size 15 million examples (itself about 32GB in size) in just 671 seconds --- a competing string kernel SVM required 97,484 seconds to train on 10 million examples sub-sampled from this dataset.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/411.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Self-taught Clustering&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Wenyuan Dai, Qiang Yang, Gui-Rong Xue, and Yong Yu&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;This paper focuses on a new clustering task, called &lt;i&gt;self-taught clustering&lt;/i&gt;. Self-taught clustering is an instance of &lt;i&gt;unsupervised transfer learning&lt;/i&gt;, which aims at clustering a small collection of target unlabeled data with the help of a large amount of &lt;i&gt;auxiliary&lt;/i&gt; unlabeled data. The target and auxiliary data can be different in topic distribution. We show that even when the target data are not sufficient to allow effective learning of a high quality feature representation, it is possible to learn the useful features with the help of the auxiliary data on which the target data can be clustered effectively. We propose a co-clustering based self-taught clustering algorithm to tackle this problem, by clustering the target and auxiliary data simultaneously to allow the feature representation from the auxiliary data to influence the target data through a common set of features. Under the new data representation, clustering on the target data can be improved. Our experiments on image clustering show that our algorithm can greatly outperform several state-of-the-art clustering methods when utilizing irrelevant unlabeled auxiliary data.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/432.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;A Quasi-Newton Approach to Nonsmooth Convex Optimization&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Jin Yu, S.V.N. Vishwanathan, Simon Guenter, and Nicol Schraudolph&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We extend the well-known BFGS quasi-Newton method and its limited-memory variant (LBFGS) to the optimization of nonsmooth convex objectives. This is done in a rigorous fashion by generalizing three components of BFGS to subdifferentials: The local quadratic model, the identification of a descent direction, and the Wolfe line search conditions. We apply the resulting sub(L)BFGS algorithm to &lt;i&gt;L&lt;sub&gt;2&lt;/sub&gt;&lt;/i&gt;-regularized risk minimization with binary hinge loss, and its direction-finding component to &lt;i&gt;L&lt;sub&gt;1&lt;/sub&gt;&lt;/i&gt;-regularized risk minimization with logistic loss. In both settings our generic algorithms perform comparable to or better than their counterparts in specialized state-of-the-art solvers.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/461.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Predicting Diverse Subsets Using Structural SVMs&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Yisong Yue and Thorsten Joachims&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;In many retrieval tasks, one important goal involves retrieving a diverse set of results (e.g., documents covering a wide range of topics for a search query). First of all, this reduces redundancy, effectively presenting more information with the presented results. Secondly, search queries are often ambiguous at some level. For example, the query “Jaguar” can refer to many different topics (such as the car or the feline). A set of documents with high topic diversity ensures that fewer users abandon the query because none of the results are relevant to them. Unlike existing approaches to learning retrieval functions, we present a method that explicitly trains to diversify results. In particular, we formulate the learning problem of predicting a diverse subset and derive a training algorithm based on structural SVMs.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/470.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Improved Nystrom Low-Rank Approximation and Error Analysis&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Kai Zhang, Ivor Tsang, and James Kwok&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Low-rank matrix approximation is an effective tool in alleviating the memory and computational burdens of kernel methods and sampling, as the mainstream of such algorithms, has drawn considerable attention in both theory and practice. This paper presents detailed studies on the Nystrom sampling scheme and in particular, an error analysis that directly relates the Nystrom approximation quality with the encoding powers of the landmark points in summarizing the data. The resultant error bound suggests a simple and efficient sampling scheme, the &lt;i&gt;k&lt;/i&gt;-means clustering algorithm, for Nystrom low-rank approximation. We compare it with state-of-the-art approaches that range from greedy schemes to probabilistic sampling. Our algorithm achieves significant performance gains in a number of supervised/unsupervised learning tasks including kernel PCA and least squares SVM.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/476.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Expectation-Maximization for Sparse and Non-Negative PCA&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Christian David Sigg and Joachim M. Buhmann&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We study the problem of finding the dominant eigenvector of the sample covariance matrix, under additional constraints on its elements: a cardinality constraint limits the number of non-zero elements, and non-negativity forces the elements to have equal sign. This problem is known as sparse and non-negative principal component analysis (PCA), and has many applications including dimensionality reduction and feature selection. Based on expectation-maximization for probabilistic PCA, we present an algorithm for any combination of these constraints. Its complexity is at most quadratic in the number of dimensions of the data. We demonstrate significant improvements in performance and computational efficiency compared to the state-of-the-art, using large data sets from biology and computer vision.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/484.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Training SVM with Indefinite Kernels&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Jianhui Chen and Jieping Ye&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Similarity matrices generated from many applications may not be positive semidefinite, and hence can't fit into the kernel machine framework. In this paper, we study the problem of training support vector machines with an indefinite kernel. We consider a regularized SVM formulation, in which the indefinite kernel matrix is treated as a noisy observation of some unknown positive semidefinite one (proxy kernel) and the support vectors and the proxy kernel can be computed simultaneously. We propose a semi-infinite quadratically constrained linear program formulation for the optimization, which can be solved iteratively to find a global optimum solution. We further propose to employ an additional pruning strategy, which significantly improves the efficiency of the algorithm, while retaining the convergence property of the algorithm. In addition, we show the close relationship between the proposed formulation and multiple kernel learning. Experiments on a collection of benchmark data sets demonstrate the efficiency and effectiveness of the proposed algorithm.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/531.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;An RKHS for Multi-View Learning and Manifold Co-Regularization&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Vikas Sindhwani and David Rosenberg&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Inspired by co-training, many multi-view semi-supervised kernel methods implement the following idea: find a function in each of multiple Reproducing Kernel Hilbert Spaces (RKHSs) such that (a) the chosen functions make similar predictions on unlabeled examples, and (b) the average prediction given by the chosen functions performs well on labeled examples. In this paper, we construct a single RKHS with a data-dependent “co-regularization” norm that reduces these approaches to standard supervised learning. The reproducing kernel for this RKHS can be explicitly derived and plugged into any kernel method, greatly extending the theoretical and algorithmic scope of co-regularization. In particular, with this development, the Rademacher complexity bound for co-regularization given in (Rosenberg &amp;amp; Bartlett, 2007) follows easily from well-known results. Furthermore, more refined bounds given by localized Rademacher complexity can also be easily applied. We propose a co-regularization based algorithmic alternative to manifold regularization (Belkin et al., 2006; Sindhwani et al., 2005a) that leads to major empirical improvements on semi-supervised tasks. Unlike the recently proposed transductive approach of (Yu et al., 2008), our RKHS formulation is truly semi-supervised and naturally extends to unseen test data.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/641.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;A Generalization of Haussler's Convolution Kernel - Mapping Kernel&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Kilho Shin and Tetsuji Kuboyama&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Haussler's convolution kernel provides a successful framework for engineering new positive semidefinite kernels, and has been applied to a wide range of data types and applications. In the framework, each data object represents a finite set of finer grained components. Then, Haussler's convolution kernel takes a pair of data objects as input, and returns the sum of the return values of the predetermined primitive kernel calculated for all the possible pairs of the components of the input data objects. Due to the definition, Haussler's convolution kernel is also known as the cross product kernel, and is positive semidefinite, if so is the primitive kernel. On the other hand, the &lt;i&gt;mapping kernel&lt;/i&gt; that we introduce in this paper is a natural generalization of Haussler's convolution kernel, in that the input to the primitive kernel moves over a predetermined subset rather than the entire cross product. Although we have plural instances of the mapping kernel in the literature, their positive semidefiniteness was investigated in case-by-case manners, and worse yet, was sometimes incorrectly concluded. In fact, there exists a simple and easily checkable necessary and sufficient condition, which is generic in the sense that it enables us to investigate the positive semidefiniteness of an arbitrary instance of the mapping kernel. This is the first paper that presents and proves the validity of the condition. In addition, we introduce two important instances of the mapping kernel, which we refer to as the &lt;i&gt;size-of-index-structure-distribution&lt;/i&gt; kernel and the &lt;i&gt;edit-cost-distribution&lt;/i&gt; kernel. Both of them are naturally derived from well known (dis)similarity measurements in the literature (e.g. the maximum agreement tree, the edit distance), and are reasonably expected to improve the performance of the existing measures by evaluating their distributional features rather than their peak (maximum/minimum) features.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/643.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Composite Kernel Learning&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Marie Szafranski, Yves Grandvalet, and Alain Rakotomamonjy&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;The Support Vector Machine (SVM) is an acknowledged powerful tool for building classifiers, but it lacks flexibility, in the sense that the kernel is chosen prior to learning. Multiple Kernel Learning (MKL) enables to learn the kernel, from an ensemble of basis kernels, whose combination is optimized in the learning process. Here, we propose Composite Kernel Learning to address the situation where distinct components give rise to a group structure among kernels. Our formulation of the learning problem encompasses several setups, putting more or less emphasis on the group structure. We characterize the convexity of the learning problem, and provide a general wrapper algorithm for computing solutions. Finally, we illustrate the behavior of our method on multi-channel data where groups correspond to channels.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/665.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Nonnegative Matrix Factorization via Rank-One Downdate&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Michael Biggs, Ali Ghodsi, and Stephen Vavasis&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;Nonnegative matrix factorization (NMF) was popularized as a tool for data mining by Lee and Seung in 1999. NMF attempts to approximate a matrix with nonnegative entries by a product of two low-rank matrices, also with nonnegative entries. We propose an algorithm called rank-one downdate (R1D) for computing a NMF that is partly motivated by singular value decomposition. This algorithm computes the dominant singular values and vectors of adaptively determined submatrices of a matrix. On each iteration, R1D extracts a rank-one submatrix from the dataset according to an objective function. We establish a theoretical result that maximizing this objective function corresponds to correctly classifying articles in a nearly separable corpus. We also provide computational experiments showing the success of this method in identifying features in realistic datasets. The method is much faster than either LSI or other NMF routines.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/667.pdf"&gt;Full paper&lt;/a&gt;]&lt;!-- [&lt;a href="papers/DokuWikiLink#667"&gt;Discussion&lt;/a&gt;]--&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;div class="MsoNormal" style="text-align: center;" align="center"&gt;  &lt;hr size="2" width="100%" align="center"&gt;  &lt;/div&gt;  &lt;p&gt;&lt;a name="668"&gt;&lt;/a&gt;paper ID: 668&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;h3&gt;Closed-form Supervised Dimensionality Reduction with Generalized Linear Models&lt;o:p&gt;&lt;/o:p&gt;&lt;/h3&gt;  &lt;p&gt;&lt;i&gt;Irina Rish, Genady Grabarnilk, Guillermo Cecchi, Francisco Pereira, and Geoffrey J. Gordon&lt;/i&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;We propose a family of supervised dimensionality reduction (SDR) algorithms that combine feature extraction (dimensionality reduction) with learning a predictive model in a unified optimization framework, using data- and class-appropriate generalized linear models (GLMs), and handling both classification and regression problems. Our approach uses simple closed-form update rules and is provably convergent. Promising empirical results are demonstrated on a variety of high-dimensional datasets.&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p&gt;[&lt;a href="http://icml2008.cs.helsinki.fi/papers/668.pdf"&gt;Full paper&lt;/a&gt;]&lt;o:p&gt;&lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt; &lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-4388967597440013025?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/4388967597440013025/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=4388967597440013025' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4388967597440013025'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4388967597440013025'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2008/06/some-interesting-icml-papers.html' title='some interesting ICML papers'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8712124725705613109</id><published>2008-03-17T09:45:00.004-06:00</published><updated>2008-03-17T09:57:42.846-06:00</updated><title type='text'>Tibet Riot</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_5cXrM_tJ7aw/R96SvRE2SKI/AAAAAAAAAAc/pP6J2HdngBw/s1600-h/cnn.jpg"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://4.bp.blogspot.com/_5cXrM_tJ7aw/R96SvRE2SKI/AAAAAAAAAAc/pP6J2HdngBw/s320/cnn.jpg" alt="" id="BLOGGER_PHOTO_ID_5178737962257893538" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;It seems that all the governments and media will do certain degree of censorship. The above figure is just an example. The left one is used in CNN to show that Chinese communist government is trying to overwhelm the riot. The right one shows the original picture. As can be seen, several Tibetan is trying to hurt some innocent people walking by the street.&lt;br /&gt;&lt;br /&gt;DaLai only cares about the independence of Tibet but never morn for those innocent Han or Hui people who were killed during the riot. Ridiculous!&lt;br /&gt;&lt;br /&gt;Here is one foreigner who is traveling in Lhasa  and experienced the riot. &lt;br /&gt;&lt;a href="http://kadfly.blogspot.com/"&gt;http://kadfly.blogspot.com/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;and here is a video took by an Italian to show that those people were attacking an innocent motorcyclist.&lt;br /&gt;&lt;a href="http://rapidshare.de/files/38832674/MVI_0483.AVI.html"&gt;http://rapidshare.de/files/38832674/MVI_0483.AVI.html  &lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Also, for those who support "Tibet Independence", please check out the following videos. Tibet will always be part of China.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.youtube.com/watch?v=x9QNKB34cJo"&gt; http://www.youtube.com/watch?v=x9QNKB34cJo&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8712124725705613109?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8712124725705613109/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8712124725705613109' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8712124725705613109'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8712124725705613109'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2008/03/tibet-riot.html' title='Tibet Riot'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_5cXrM_tJ7aw/R96SvRE2SKI/AAAAAAAAAAc/pP6J2HdngBw/s72-c/cnn.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6138585262099065798</id><published>2008-03-02T17:17:00.002-07:00</published><updated>2008-03-02T17:20:16.832-07:00</updated><title type='text'>Using Samba in Ubuntu</title><content type='html'>Really simple!&lt;br /&gt;&lt;br /&gt;Right click the folder actually has an effect of sharing folders.&lt;br /&gt;&lt;br /&gt;All you need is to add a smbuser by the following command:&lt;br /&gt;smbpasswd -a user&lt;br /&gt;&lt;br /&gt;and then restart the service by the following command:&lt;br /&gt;sudo /etc/init.d/samba restart&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6138585262099065798?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6138585262099065798/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6138585262099065798' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6138585262099065798'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6138585262099065798'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2008/03/using-samba-in-ubuntu.html' title='Using Samba in Ubuntu'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8836826995880922273</id><published>2007-12-12T17:11:00.000-07:00</published><updated>2007-12-12T17:15:45.394-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><title type='text'>Daily Linux (1) -- FreeNX, remote access, list files</title><content type='html'>1. Here is a very useful instruction to &lt;a href="http://ubuntu-utah.ubuntuforums.org/showthread.php?t=620057"&gt;install FreeNX server on Ubuntu Gusty&lt;/a&gt;,&lt;br /&gt;&lt;br /&gt; and it works pretty cool! Much much faster than VNC.&lt;br /&gt;&lt;br /&gt;2.  To list files matches certain pattern in one directory, use the following command:&lt;br /&gt;&lt;br /&gt;find . -name "*.c"&lt;br /&gt;&lt;br /&gt;This basically find all the c-files in a directory.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8836826995880922273?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8836826995880922273/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8836826995880922273' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8836826995880922273'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8836826995880922273'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/12/daily-linux-1-freenx-remote-access-list.html' title='Daily Linux (1) -- FreeNX, remote access, list files'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-1781334570014456338</id><published>2007-11-20T10:06:00.000-07:00</published><updated>2007-11-20T10:08:34.350-07:00</updated><title type='text'>dynamic topic models</title><content type='html'>Recently, I'm pretty interested in topic models.&lt;br /&gt;But this is very difficult to follow.&lt;br /&gt;&lt;br /&gt;Instead, I'll read some material about Kalman filtering and Wavelet first.&lt;br /&gt;&lt;br /&gt;Learning is endless...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-1781334570014456338?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/1781334570014456338/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=1781334570014456338' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1781334570014456338'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1781334570014456338'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/11/dynamic-topic-models.html' title='dynamic topic models'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-7471842463954258221</id><published>2007-11-19T17:49:00.001-07:00</published><updated>2007-11-19T17:52:08.610-07:00</updated><title type='text'>NIPS 08  proceeding available</title><content type='html'>Here are some interesting paper I'm planning to read or browse.&lt;br /&gt;My biggest concern is how effective is the work. It seems currently most are beautiful with formulas, but can not even beat the simplest method.&lt;br /&gt;&lt;br /&gt;It's always human who make the world so complicated.&lt;br /&gt;My belief is "&lt;span style="color: rgb(0, 0, 153);font-size:130%;" &gt;The World is Simple!&lt;/span&gt;"&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Heterogeneous Component Analysis&lt;br /&gt;Shigeyuki Oba, Motoaki Kawanabe, Klaus-Robert Müller, Shin Ishii&lt;br /&gt;&lt;br /&gt;Neural characterization in partially observed populations of spiking neurons&lt;br /&gt;Jonathan Pillow, Peter Latham&lt;br /&gt;&lt;br /&gt;Probabilistic Matrix Factorization&lt;br /&gt;Ruslan Salakhutdinov, Andriy Mnih&lt;br /&gt;&lt;br /&gt;Hidden Common Cause Relations in Relational Learning&lt;br /&gt;Ricardo Silva, Wei Chu, Zoubin Ghahramani&lt;br /&gt;&lt;br /&gt;Hierarchical Penalization&lt;br /&gt;Marie Szafranski, Yves Grandvalet, Pierre Morizet-Mahoudeaux&lt;br /&gt;&lt;br /&gt;Learning with Transformation Invariant Kernels&lt;br /&gt;Christian Walder&lt;br /&gt;&lt;br /&gt;A Spectral Regularization Framework for Multi-Task Structure Learning&lt;br /&gt;Andreas Argyriou, Charles A. Micchelli, Massimiliano Pontil, Yiming Ying&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Supervised Topic Models&lt;br /&gt;David Blei, Jon McAuliffe&lt;br /&gt;&lt;br /&gt;Learning Bounds for Domain Adaptation&lt;br /&gt;John Blitzer, Koby Crammer, Alex Kulesza, Fernando Pereira, Jennifer Wortman&lt;br /&gt;&lt;br /&gt;Multi-task Gaussian Process Prediction&lt;br /&gt;Edwin Bonilla, Kian Ming Chai, Chris Williams&lt;br /&gt;&lt;br /&gt;Automatic Generation of Social Tags for Music Recommendation&lt;br /&gt;Douglas Eck, Paul Lamere, Thierry Bertin-Mahieux, Stephen Green&lt;br /&gt;&lt;br /&gt;Kernel Measures of Conditional Dependence&lt;br /&gt;Kenji Fukumizu, Arthur Gretton, Xiaohai Sun, Bernhard Sch??lkopf&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-7471842463954258221?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/7471842463954258221/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=7471842463954258221' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7471842463954258221'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7471842463954258221'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/11/nips-08-proceeding-available.html' title='NIPS 08  proceeding available'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6865271412902176952</id><published>2007-11-05T20:13:00.001-07:00</published><updated>2007-11-05T20:16:02.697-07:00</updated><title type='text'>What do you see?</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_5cXrM_tJ7aw/Ry_cT0YFLaI/AAAAAAAAAAU/IHA3AI7OllA/s1600-h/lispnd7.png"&gt;&lt;img style="margin: 0px auto 10px; display: block; text-align: center; cursor: pointer;" src="http://2.bp.blogspot.com/_5cXrM_tJ7aw/Ry_cT0YFLaI/AAAAAAAAAAU/IHA3AI7OllA/s400/lispnd7.png" alt="" id="BLOGGER_PHOTO_ID_5129560733633555874" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6865271412902176952?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6865271412902176952/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6865271412902176952' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6865271412902176952'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6865271412902176952'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/11/what-do-you-see.html' title='What do you see?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_5cXrM_tJ7aw/Ry_cT0YFLaI/AAAAAAAAAAU/IHA3AI7OllA/s72-c/lispnd7.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-4568200466299906084</id><published>2007-08-27T09:40:00.000-06:00</published><updated>2007-08-27T09:41:16.545-06:00</updated><title type='text'>A funny joke</title><content type='html'>&lt;span style="font-family:楷体_GB2312;font-size:130%;color:#006666;"&gt;狼来了，经过山洞，见兔子面前放台笔记本电脑，噼噼噼的乱敲。 狼说：“兔子！干嘛呢？” “写论文！” 狼想：“世道真变啦，兔子也能写论文？”。 “啥题目？” “论兔子比狼厉害！” “你有病吧？” “你进洞瞧瞧！我收集好多资料！” 狼进洞了，半天没出来。天近黄昏，兔子收拾起笔记本电脑，回山洞了。 洞中一只狮子正在剔牙。 见兔子进来，拍着胸脯狂笑：“早跟你说了，不在乎你写什么题目，什么内容，关键是 ，你导师是谁！&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-4568200466299906084?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/4568200466299906084/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=4568200466299906084' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4568200466299906084'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4568200466299906084'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/08/funny-joke.html' title='A funny joke'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-1367148312138749707</id><published>2007-08-22T22:33:00.000-06:00</published><updated>2007-08-22T22:38:17.534-06:00</updated><title type='text'>Research is a joke?</title><content type='html'>Just read the chapter of Quasi-Newton Methods and came across this paragraph:&lt;br /&gt;"The first quasi-Newton algorithm turned out to be one of the most creative ideas in nonlinear optimization. ....  An interesting historical irony is that Davidon's paper was not accepted for publication; it remained as a technical report for more than thirty years until it appeared in the first issue of the SIAM Journal on Optimization in 1991."&lt;br /&gt;&lt;br /&gt;It seems that high-quality work is more difficult to publish.&lt;br /&gt;&lt;br /&gt;However, computer science still generates thousands of papers each year.  How many paper will be remembered after 5 years?&lt;br /&gt;&lt;br /&gt;Fast food is everywhere. This also includes "fast paper". Ridiculous!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-1367148312138749707?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/1367148312138749707/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=1367148312138749707' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1367148312138749707'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1367148312138749707'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/08/research-is-joke.html' title='Research is a joke?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6721375037299350145</id><published>2007-07-18T00:29:00.000-06:00</published><updated>2007-07-18T00:45:13.226-06:00</updated><title type='text'>SIGIR &amp; KDD papers</title><content type='html'>&lt;p class="MsoNormal"&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;span style="color: rgb(51, 0, 153);"&gt;SIGIR'07&lt;/span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p class="MsoNormal"&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;An InterActive Algorithm For Asking And Incorporating Feature Feedback into Support Vector Machines&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;Hema Raghavan, James Allan&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;Analyzing Feature Trajectories for Event Detection&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;QI HE, Kuiyu Chang, Ee-Peng Lim&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;strong&gt;&lt;span lang="EN-US"&gt;New Event Detection Based on Indexing-tree and Named Entity&lt;/span&gt;&lt;/strong&gt;&lt;span lang="EN-US"&gt;&lt;br /&gt;Kuo ZHANG, JuanZi LI, Gang WU&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal" style="text-align: left;" align="left"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;Session 12: Learning to Rank I (Wed 09:00-10:30)&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;  &lt;p class="MsoNormal" style="text-align: left;" align="left"&gt;&lt;i&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;Chair: Bruce Croft&lt;/span&gt;&lt;/i&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;, Foyer Room &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;ul type="disc"&gt;&lt;li class="MsoNormal" style="text-align: left;"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;A Support      Vector Method for Optimizing Average Precision&lt;/span&gt;&lt;/b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;&lt;br /&gt;     Yisong Yue, Thomas Finley, Filip Radlinski, Thorsten Joachims&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;li class="MsoNormal" style="text-align: left;"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;Ranking with      Multiple Hyperplanes&lt;/span&gt;&lt;/b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;&lt;br /&gt;     Tie-Yan Liu, Tao Qin, Wei Lai, Xu-Dong Zhang, De-Sheng Wang, Hang Li&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;li class="MsoNormal" style="text-align: left;"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;A Regression      Framework for Learning Ranking Functions Using Relative Relevance      Judgments&lt;/span&gt;&lt;/b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;&lt;br /&gt;     Zhaohui Zheng, Hongyuan Zha, Keke Chen, Gordon Sun&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;  &lt;p class="MsoNormal" style="text-align: left;" align="left"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;Session 16: Learning to Rank II (Wed 14:30-16:30)&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/b&gt;&lt;/p&gt;  &lt;p class="MsoNormal" style="text-align: left;" align="left"&gt;&lt;i&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;Chair: Keith van Rijsbergen&lt;/span&gt;&lt;/i&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;, Grand Ballroom &lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;ul type="disc"&gt;&lt;li class="MsoNormal" style="text-align: left;"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;FRank: A      Ranking Method with Fidelity Loss&lt;/span&gt;&lt;/b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;&lt;br /&gt;     Ming-Feng Tsai, Tie-Yan Liu, Tao Qin, Hsin-Hsi Chen, Wei-Ying Ma&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;li class="MsoNormal" style="text-align: left;"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;AdaRank: A      Boosting Algorithm for Information Retrieval&lt;/span&gt;&lt;/b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;&lt;br /&gt;     Jun Xu, Hang Li&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;li class="MsoNormal" style="text-align: left;"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;A Combined      Component Approach for Finding Collection-Adapted Ranking Functions based      on Genetic Programming&lt;/span&gt;&lt;/b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;&lt;br /&gt;     Humberto Almeida, Marcos Goncalves, Marco Cristo, Pavel Calado&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;li class="MsoNormal" style="text-align: left;"&gt;&lt;b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;Feature Selection      for Ranking&lt;/span&gt;&lt;/b&gt;&lt;span style="font-size: 12pt; font-family: 宋体;" lang="EN-US"&gt;&lt;br /&gt;     Tie-Yan Liu, Xiubo Geng, Tao Qin&lt;o:p&gt;&lt;/o:p&gt;&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;  &lt;p class="MsoNormal"&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p style="color: rgb(51, 0, 153);" class="MsoNormal"&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; KDD'07&lt;/o:p&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Active Exploration for Learning Rankings from Clickthrough Data &lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;- &lt;i&gt;Filip Radlinski and Thorsten Joachims&lt;o:p&gt;&lt;/o:p&gt;&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Co-clustering based Classification for Out-of-domain Documents &lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;- &lt;i&gt;Wenyuan Dai, Gui-Rong Xue, Qiang Yang, and Yong Yu&lt;o:p&gt;&lt;/o:p&gt;&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Enhancing Semi-Supervised Clustering: A Feature Projection Perspective &lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;- &lt;i&gt;Wei Tang, Hui Xiong, Shi Zhong, and Jie Wu&lt;o:p&gt;&lt;/o:p&gt;&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Evolutionary Spectral Clustering by Incorporating Temporal Smoothness &lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;- &lt;i&gt;Yun Chi, Xiaodan Song, Dengyong Zhou, Koji Hino, and Belle Tseng&lt;o:p&gt;&lt;/o:p&gt;&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Mining Statistically Important Equivalence Classes &lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;- &lt;i&gt;Jinyan Li, Guimei Liu, and Limsoon Wong&lt;o:p&gt;&lt;/o:p&gt;&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Model-Shared Subspace Boosting for Multi-label Classification &lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;- &lt;i&gt;Rong Yan, Jelena Tesic, and John Smith&lt;o:p&gt;&lt;/o:p&gt;&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;i&gt;&lt;span lang="EN-US"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/span&gt;&lt;/i&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;b&gt;&lt;span lang="EN-US"&gt;Support Feature Machine for Classification of Abnormal Brain Activity &lt;/span&gt;&lt;/b&gt;&lt;span lang="EN-US"&gt;- &lt;i&gt;W. Art Chaovalitwongse, Ya-Ju Fan, and Rajesh Sachdeo&lt;/i&gt;&lt;/span&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6721375037299350145?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6721375037299350145/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6721375037299350145' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6721375037299350145'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6721375037299350145'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/07/sigir-kdd-papers.html' title='SIGIR &amp; KDD papers'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8174898637133705682</id><published>2007-05-27T01:13:00.000-06:00</published><updated>2007-05-27T01:23:32.580-06:00</updated><title type='text'>ICML 07 proceeding is online now</title><content type='html'>&lt;a href="http://oregonstate.edu/conferences/icml2007/paperlist.html"&gt;http://oregonstate.edu/conferences/icml2007/paperlist.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Here are some interesting papers:&lt;br /&gt;&lt;p class="MsoNormal"&gt;&lt;span style="background: yellow none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Kernel Selection:&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Kernel Selection for Semi-Supervised Kernel Machines &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/237.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/237.pdf"&gt;[Paper]&lt;/a&gt; &lt;/p&gt;  &lt;p class="MsoNormal"&gt;Learning Nonparametric Kernel Matrices from Pairwise Constraints&lt;br /&gt;&lt;/p&gt;&lt;table&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;  &lt;br /&gt;&lt;/td&gt;&lt;td colspan="2"&gt;&lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/537.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/537.pdf"&gt;[Paper]&lt;/a&gt;&lt;/td&gt;&lt;/tr&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;More Efficiency in Multiple Kernel Learning &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/148.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/148.pdf"&gt;[Paper]&lt;/a&gt;  &lt;p class="MsoNormal"&gt;Multiclass Multiple Kernel Learning &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/372.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/372.pdf"&gt;[Paper]&lt;/a&gt;&lt;b style=""&gt;&lt;o:p&gt;&lt;/o:p&gt;&lt;/b&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="background: yellow none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;MTL and Transfer Learning:&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Uncovering Shared Structures in Multiclass Classification &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/229.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/229.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Discriminative Learning for Differing Training and Test Distributions &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/303.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/303.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Boosting for Transfer Learning &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/72.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/72.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Learning a Meta-Level Prior for Feature Relevance from Multiple Related Tasks &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/559.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/559.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Multi-Task Learning for Sequential Data via iHMMs and the Nested Dirichlet Process &lt;span style=""&gt; &lt;/span&gt;&lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/170.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/170.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Self-taught Learning: Transfer Learning from Unlabeled Data &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/515.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/515.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;The Matrix Stick-Breaking Process for Flexible Multi-Task Learning &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/255.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/255.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Asymptotic Bayesian Generalization Error When Training and Test Distributions Are Different &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/124.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/124.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Robust Multi-Task Learning with $t$-Processes &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/378.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/378.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="background: yellow none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Relational Learning and structured prediction&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Relational Clustering by Symmetric Convex Coding &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/188.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/188.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Fast and Effective Kernels for Relational Learning from Texts &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/461.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/461.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Exponentiated Gradient Algorithms for Log-Linear Structured Prediction &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/472.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/472.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="background: yellow none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Ranking:&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Learning Random Walks to Rank Nodes in Graphs &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/562.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/562.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;span style="background: yellow none repeat scroll 0% 50%; -moz-background-clip: -moz-initial; -moz-background-origin: -moz-initial; -moz-background-inline-policy: -moz-initial;"&gt;Feature Selection:&lt;/span&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Supervised Feature Selection via Dependence Estimation &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/244.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/244.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Feature Selection in Kernel Space &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/16.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/16.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Minimum Reference Set Based Feature Selection for Small Sample Classifications &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/160.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/160.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;Spectral Feature Selection for Supervised and Unsupervised Learning &lt;a href="http://oregonstate.edu/conferences/icml2007/abstracts/444.htm"&gt;[Abstract]&lt;/a&gt;&lt;a href="http://www.machinelearning.org/proceedings/icml2007/papers/444.pdf"&gt;[Paper]&lt;/a&gt;&lt;/p&gt;  &lt;p class="MsoNormal"&gt;&lt;o:p&gt; &lt;/o:p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8174898637133705682?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8174898637133705682/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8174898637133705682' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8174898637133705682'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8174898637133705682'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/05/icml-07-proceeding-is-online-now.html' title='ICML 07 proceeding is online now'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-5260753709423499070</id><published>2007-05-21T10:02:00.000-06:00</published><updated>2007-05-21T10:07:22.612-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Intern Interview'/><title type='text'>MSR internship offer</title><content type='html'>I just received an offer from MSR for this summer intern. But I decided to reject it since it's a more application oriented intern. Maybe, staying in school can help produce more works. That's more hope and belief.&lt;br /&gt;&lt;br /&gt;BTW: I just checked with Martha and found that we can use CPT twice. Anyhow, this is a good news for ASU student. I'm not sure whether or not my complaint makes the difference. Again, do things proactively can always makes future better.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-5260753709423499070?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/5260753709423499070/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=5260753709423499070' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/5260753709423499070'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/5260753709423499070'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/05/msr-internship-offer.html' title='MSR internship offer'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-3567125087507153432</id><published>2007-05-08T00:23:00.000-06:00</published><updated>2007-05-08T00:44:51.648-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Research'/><title type='text'>Crafting papers in machine learning</title><content type='html'>Pat is like a tutor. &lt;br /&gt;&lt;br /&gt;http://lyonesse.stanford.edu/~langley/papers/craft.ml2k.ps&lt;br /&gt;&lt;br /&gt;I just encountered Pat's interesting paper on how to write a machine learning paper. Not sure whether or not he is a little regretful  about emphasizing the experimental evaluation  too much in the paper. But I think most of the general guide lines are still reasonable.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-3567125087507153432?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/3567125087507153432/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=3567125087507153432' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3567125087507153432'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3567125087507153432'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/05/crafting-papers-in-machine-learning.html' title='Crafting papers in machine learning'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6262199818525470536</id><published>2007-05-04T02:04:00.000-06:00</published><updated>2007-05-04T02:06:02.784-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><title type='text'>linux trick</title><content type='html'>I just found the rdesktop tool in linux is better to remote access windows system. Especially in the full-screen mode.&lt;br /&gt;&lt;br /&gt;One trick to quit full screen mode:&lt;br /&gt;Ctrl+Alt+Enter&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6262199818525470536?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6262199818525470536/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6262199818525470536' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6262199818525470536'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6262199818525470536'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/05/linux-trick.html' title='linux trick'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-971025336411073356</id><published>2007-05-01T00:13:00.000-06:00</published><updated>2007-05-04T02:06:19.634-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='misc'/><category scheme='http://www.blogger.com/atom/ns#' term='linux'/><title type='text'>Windows Backup &amp; Norton Ghost</title><content type='html'>After my system crashed, I have to reinstall a windows system on my laptop, which is horrible. It takes almost three days to finish all the installation and configuration process. Suffering from reinstall system, I decided to try to use Norton Ghost to backup the system to make my life easier. Unfortunately,  GHOST requires a floppy drive to make sure your system is bootable when crashed.   More serious, I installed a dual-boot system on my laptop (ubuntu7.04+windows xp pro (sp2)) with the first parition for windows being NTFS.  So the MBR of windows is actually modified by linux, and ghost can not find the proper MBR infor for windows. To backup linux system is easy, just compress all the stuff into one tar file would be OK. &lt;br /&gt;&lt;br /&gt;So I'm wondering whether or not I can backup the whole hard disk under linux system. But the problem is,  even if I can backup the windows partition in linux, I cannot restore it as Ubuntu 7.04 does not support modification on NTFS. Haha, really frustrated with this backup research work.&lt;br /&gt;&lt;br /&gt;It seems that linux backup is way too much easier than windows. But I'm still a newbie to linux. Maybe, after sometime, I can get rid of windows.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-971025336411073356?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/971025336411073356/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=971025336411073356' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/971025336411073356'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/971025336411073356'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/05/windows-backup-norton-ghost.html' title='Windows Backup &amp; Norton Ghost'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-1642237827479434605</id><published>2007-04-03T16:13:00.000-06:00</published><updated>2007-04-03T16:15:56.434-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Progress'/><title type='text'>Feedback of ICML</title><content type='html'>"I think the paper is techincally sound, but I think that it is quite narrow in&lt;br /&gt;scope, and am not sure it will be of great general interest. There are only few&lt;br /&gt;and unsurprising conclusions, and moreover the results seem to indicate that&lt;br /&gt;there is an interacation between IN/OUT and the specific FS algorithms used.&lt;br /&gt;Focussing so strongly on toy data, is also a considerable worry. I recommend&lt;br /&gt;rejection. "&lt;br /&gt;&lt;br /&gt;It seems that this year is not my "lucky" year. This is the same situation for my intern applications. Getting so many interviews(7 interviews), but none of them makes an offer. &lt;br /&gt;&lt;br /&gt;Anyway, just keep going!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-1642237827479434605?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/1642237827479434605/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=1642237827479434605' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1642237827479434605'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1642237827479434605'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/04/feedback-of-icml.html' title='Feedback of ICML'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-9057998695964824987</id><published>2007-03-26T16:32:00.000-06:00</published><updated>2007-03-26T17:01:13.206-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='personal life'/><category scheme='http://www.blogger.com/atom/ns#' term='text categorization'/><category scheme='http://www.blogger.com/atom/ns#' term='Feature Selection'/><category scheme='http://www.blogger.com/atom/ns#' term='Transfer Learning'/><title type='text'>Several Future directions</title><content type='html'>While reckon over the future directions to go, I think the following problems might be interesting concerning text categorization.&lt;br /&gt;&lt;br /&gt;1. Large-number of categories, multi-label classification problem.  Typically, a hierarchy is employed to dissect the problem, thus reduce to learning with structured output.&lt;br /&gt;&lt;br /&gt;2. "Dirty" text categorization. Typical text categorization requires the features to be clean, such as newswire articles, paper abstract etc. However, current fashion extends to "dirty" texts, such as notes(spell errors),  prescription(lots of abbreviations)  customer telephone log (usually with noisy, contradicting facts). Another example is email spam filtering.  Currently, most of spams consists of images rather than just text. However, existing OCR techniques can not extract the characters very correctly. Hence, the final words/terms obtained might not a "proper" feature. Hence, some techniques are required to transform a "image-derived" word into a word in the dictionary.  Such kind of transformation can be done via some algorithm like shortest-path algorithm.  However, when the spammer add noise in purpose in the text within images, this problem seems more complicated.  Is it possible to automatically learn feature similarity? How to extract useful similarities measure between these noisy vectors? How to derive a useful kernel? So, this problem actually is related to feature extraction, kernel learning, and robust learning and uncertainty.&lt;br /&gt;&lt;br /&gt;3.  Event detection and concept drift. I believe such kind of directions has more promising effect. I think the difficulty lays mainly on the lack of benchmark data set. But with the development of Web 2.0, this kind of problem should gain some attention in the future.&lt;br /&gt;&lt;br /&gt;4.  Ambiguous label problem. I really doubt the existence of small sample in text classification. Seems labellings some documents requires very little human labor. Now, some websites already provides some schemes for users to tag some web pages and blog posts. How to effectively employ the tag info seems to be missing in current work. When I tried delicious, only some key-word matching are performed. How to organize the text into more sensible way?&lt;br /&gt;&lt;br /&gt;5. "Universal" text classification. As so many benchmark data sets are online, can we any how use all of them. This might be related to transfer learning. At least, the benchmark data can serve to provide a common prior for the target classification task. But can we extract more? Human beings can immediately classify the documents given very few examples. Existing transfer learning (most actually are doing MTL), in nature, is doing dimensionality reduction. How to related the features of different domains? Is it possible to extract the "structural" information? Zhang Tong's work talks about that, but it actually focus more on semi-supervised learning. &lt;br /&gt;&lt;br /&gt;6. Sentimental classification/author gender identification/ genre identification. Such kind of problems requires new feature extraction techniques.&lt;br /&gt;&lt;br /&gt;Some other concerns:&lt;br /&gt;&lt;br /&gt;Feature selection for text categorization? As far as I can see, I do not think this direction will provide more interesting result.  It works, and efficient. It can be used as a preprocessing step to reduce the computation burden. But some complicated methods (such as kernel learning) can be used to do a better job.&lt;br /&gt;&lt;br /&gt;Active Learning, a greedy method can works fine enough. &lt;br /&gt;&lt;br /&gt;Clustering, not making any sense to me. But as for simple text, clustering might show some potential impact. I believe clustering should be "customized". Different user will ask different clustering results. It seems more interesting to do clustering given some prespecified parameters. Clustering of multi-labels under concept drift can also be explored.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-9057998695964824987?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/9057998695964824987/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=9057998695964824987' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/9057998695964824987'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/9057998695964824987'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/03/several-future-directions.html' title='Several Future directions'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-9078721498893270682</id><published>2007-03-22T22:56:00.000-06:00</published><updated>2007-03-22T22:59:17.331-06:00</updated><title type='text'>Leonhard Euler</title><content type='html'>"Leonhard Euler(1707-1783) A Swiss mathematician and physicist. &lt;br /&gt;......&lt;br /&gt;During the last 17 years of his life, he was almost totally blind, and yet he produced nearly half of his results during that period."&lt;br /&gt;&lt;br /&gt;(Copied from Pattern Recognition and Machine Learning, Page 465)&lt;br /&gt;&lt;br /&gt;I am wondering whether or not he has any chance to win the Fields Award:)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-9078721498893270682?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/9078721498893270682/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=9078721498893270682' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/9078721498893270682'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/9078721498893270682'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/03/leonhard-euler.html' title='Leonhard Euler'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-3542028556780485859</id><published>2007-03-22T12:07:00.000-06:00</published><updated>2007-03-22T12:12:38.950-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Progress'/><title type='text'>Where to Go?</title><content type='html'>Recently, I seems to lose the enthusiasm of research. No mood to read, to discuss, to work on experiments. I know this is not correct way to go, but I just couldn't help become anxious.  I would rather there's a problem let me to just jump in.&lt;br /&gt;&lt;br /&gt;I don't know whether this is related to the frustration of recent progress. Too slow.&lt;br /&gt;&lt;br /&gt;To be frank, I really doubt current development of machine learning field. Too fast. Each year, in the top level conference, there are lots of publications, but very few of them worth reading. Lots of them are just rubbish. Is this common in other fields? &lt;br /&gt;&lt;br /&gt;I am wondering which direction should be a correct way to go. Searching in blind is really difficult:(&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-3542028556780485859?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/3542028556780485859/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=3542028556780485859' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3542028556780485859'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3542028556780485859'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/03/where-to-go.html' title='Where to Go?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8916124829747790001</id><published>2007-03-02T13:36:00.000-07:00</published><updated>2007-03-02T13:57:20.621-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='AI'/><title type='text'>Human Intelligence and Current Artificial Intelligence</title><content type='html'>&lt;span style="color: rgb(51, 0, 153);"&gt;I've take &lt;/span&gt;&lt;a style="color: rgb(51, 0, 153);" href="http://www.isle.org/%7Elangley/"&gt;Pat Langley&lt;/a&gt;&lt;span style="color: rgb(51, 0, 153);"&gt;'s cognitive system class this semester.   &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(51, 0, 153);"&gt;My feeling of intelligence is rather vague. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(51, 0, 153);"&gt;What's intelligence?  If a system is complicated enough, is that intelligent? What should be a component of an intelligence?  Like Deep Blue, is that intelligence. I guess most people won't vote for intelligence. Now, people are trying to develop a  meta-game player, which can automatically train itself as long as the rules of the game are given. Is that intelligent? No. Human beings are actually open-minded. Yep, machines can outperform human beings in one field, but they can never be "artificial intelligence". That's why I feel AI would never come true in my life.  But I agree with Pat that more focus should be on the nature of mind.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(51, 0, 153);"&gt;For a long time, I've been thinking that the brain and a machine's difference is mainly due to the hardware difference (brain's processing speed is low, memory is limited, but highly parallel). Like playing chess, planning, scheduling, while machines can do an exhaustive search, or deeper search, human beings try to prune a lot of branches in each step (I  guess there should be some pattern recognition involved). It's more like beam-search, as I think. The problem for efficient processing is how to extract those useful patterns.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(51, 0, 153);"&gt;Until recently, Pat mentioned this paper:&lt;/span&gt;&lt;a style="color: rgb(51, 0, 153);" href="http://psychclassics.yorku.ca/Miller/"&gt;&lt;br /&gt;The magic of 7 +/- 2&lt;/a&gt;&lt;br /&gt;&lt;span style="color: rgb(51, 0, 153);"&gt;which tries to understand why human beings did a better job to remember 7 digits more easily than 4 or more digits(This is partly the reason why telephone digits are around 7). &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(51, 0, 153);"&gt;Based on information theory, the more digits we have, the more information they contains, thus require more bits to represent the information. But for human beings, this is not the case. This phenomenon is really interesting, and justifies the study for human brains. I guess Pat didn't realize this example is so exciting to me:)&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8916124829747790001?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8916124829747790001/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8916124829747790001' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8916124829747790001'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8916124829747790001'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/03/human-intelligence-and-current.html' title='Human Intelligence and Current Artificial Intelligence'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-887840569979438602</id><published>2007-03-01T10:59:00.000-07:00</published><updated>2007-03-01T11:03:23.020-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Progress'/><title type='text'>KDD submisssion is done</title><content type='html'>Nitin and I coauthored one paper talking about finding influentials in blogosphere. It's an interesting problem. the method seems a little ad hoc. But we finally made it. &lt;br /&gt;&lt;br /&gt;Through a long distance communication (that's the wonder of Web), Nitin is working in India while Dr.Liu and I were working in the united states. The final version seems much better. But whether or not this work can be accepted is really depending on our luck. Part of my concern the technical part is really subjective.  &lt;br /&gt;&lt;br /&gt;I don't know whether the reviewer is obsessed with cross validation. &lt;br /&gt;&lt;br /&gt;Let's see.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-887840569979438602?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/887840569979438602/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=887840569979438602' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/887840569979438602'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/887840569979438602'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/03/kdd-submisssion-is-done.html' title='KDD submisssion is done'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-2937602216353943789</id><published>2007-02-13T13:51:00.000-07:00</published><updated>2007-02-10T14:07:05.097-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Intern Interview'/><title type='text'>No hope for this google interview</title><content type='html'>Just done with the google interview. I believe this this the worst case I've ever done.&lt;br /&gt;It's so bad. The interviewee is a Chinese, but we still talk in English.&lt;br /&gt;&lt;br /&gt;First question ask me to write a SQL query. I haven't touch database for a long time. So after 10 mins, I still can't come up with a good answer. &lt;br /&gt;&lt;br /&gt;Second question: how to implement a merge sort? write the pseudo code. Then, I ask him whether or not I can use recursion. He questioned: without recursion, how do you do that? (Stupid, why not just give the answer with recursion). What's the time complexity and space complexity. I know time complexity is O(nlgn) but space complexity is not sure. I finally figure out the correct solution should be O(n).  I guess he already  kicked me out of his candidate list based on my performance.&lt;br /&gt;&lt;br /&gt;3rd question: 10G large array data, 2G RAM, how do you sort them? I know this question should be merge sort? But how many data should you load each time into the memory? &lt;br /&gt;I answered you can split an array of size n into 2 parts, 3 parts or even more, and then do a combination. So for 2 parts, you need  a working memory of (n/2), 3 parts, you need a working memory of (n/3). but what should be the optimal number?  No idea.&lt;br /&gt;&lt;br /&gt;Too nervous, cannot think any more. And he mentioned we were out of time several times.&lt;br /&gt;&lt;br /&gt;4. it's about the project. How to solve their problem. &lt;br /&gt;&lt;br /&gt;He asked me if I have any questions, but I already gave up. So just ask him some normal questions. &lt;br /&gt;&lt;br /&gt;Really need to know the algorithm and structure book well. But too busy to prepare the interview. Really shame on myself.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-2937602216353943789?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/2937602216353943789/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=2937602216353943789' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2937602216353943789'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2937602216353943789'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/02/no-hope-for-this-google-interview.html' title='No hope for this google interview'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-2931395493602015111</id><published>2007-02-10T14:02:00.000-07:00</published><updated>2007-02-10T14:06:07.337-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Research'/><category scheme='http://www.blogger.com/atom/ns#' term='papers'/><title type='text'>How to collobrate with others?</title><content type='html'>I just read &lt;a href="http://hunch.net"&gt;John's blog &lt;/a&gt;and found it is really very helpful. Thus, I just copy and past his stuff here.&lt;br /&gt;&lt;br /&gt;The full article can be found via the link below:&lt;br /&gt;&lt;a href="http://hunch.net/?p=251"&gt;http://hunch.net/?p=251&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;2/10/2007&lt;br /&gt;Best Practices for Collaboration&lt;br /&gt;Filed under: Papers, Research — jl @ 1:51 pm&lt;br /&gt;&lt;br /&gt;Many people, especially students, haven’t had an opportunity to collaborate with other researchers. Collaboration, especially with remote people can be tricky. Here are some observations of what has worked for me on collaborations involving a few people.&lt;br /&gt;&lt;br /&gt;   1. Travel and Discuss Almost all collaborations start with in-person discussion. This implies that travel is often necessary. We can hope that in the future we’ll have better systems for starting collaborations remotely (such as blogs), but we aren’t quite there yet.&lt;br /&gt;   2. Enable your collaborator. A collaboration can fall apart because one collaborator disables another. This sounds stupid (and it is), but it’s far easier than you might think.&lt;br /&gt;         1. Avoid Duplication. Discovering that you and a collaborator have been editing the same thing and now need to waste time reconciling changes is annoying. The best way to avoid this to be explicit about who has write permission to what. Most of the time, a write lock is held for the entire document, just to be sure.&lt;br /&gt;         2. Don’t keep the write lock unnecessarily. Some people are perfectionists so they have a real problem giving up the write lock on a draft until it is perfect. This prevents other collaborators from doing things. Releasing write lock (at least) when you sleep, is a good idea.&lt;br /&gt;         3. Send all necessary materials. Some people try to save space or bandwidth by not passing ‘.bib’ files or other auxiliary components. Forcing your collaborator to deal with the missing subdocument problem is disabling. Space and bandwidth are cheap while your collaborators time is precious. (Sending may be pass-by-reference rather than attach-to-message in most cases.)&lt;br /&gt;         4. Version Control. This doesn’t mean “use version control software”, although that’s fine. Instead, it means: have a version number for drafts passed back and forth. This means you can talk about “draft 3″ rather than “the draft that was passed last tuesday”. Coupled with “send all necessary materials”, this implies that you naturally backup previous work.&lt;br /&gt;   3. Be Generous. It’s common for people to feel insecure about what they have done or how much “credit” they should get.&lt;br /&gt;         1. Coauthor standing. When deciding who should have a chance to be a coauthor, the rule should be “anyone who has helped produce a result conditioned on previous work”. “Helped produce” is often interpreted too narrowly—a theoretician should be generous about crediting experimental results and vice-versa. Potential coauthors may decline (and senior ones often do so). Control over who is a coauthor is best (and most naturally) exercised by the choice of who you talk to.&lt;br /&gt;         2. Author ordering. Author ordering is the wrong thing to worry about, so don’t. The CS theory community has a substantial advantage here because they default to alpha-by-author ordering, as is understood by everyone.&lt;br /&gt;         3. Who presents. A good default for presentations at a conference is “student presents” (or suitable generalizations). This gives young people a real chance to get involved and learn how things are done. Senior collaborators already have plentiful alternative methods to present research at workshops or invited talks.&lt;br /&gt;   4. Communicate by default Not cc’ing a collaborator is a bad idea. Even if you have a very specific question for one collaborator and not another, it’s a good idea to cc everyone. In the worst case, this is a few-second annoyance for the other collaborator. In the best case, the exchange answers unasked questions. This also prevents “conversation shifts into subjects interesting to everyone, but oops! you weren’t cced” problem.&lt;br /&gt;&lt;br /&gt;These practices are imperfectly followed even by me, but they are a good ideal to strive for.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-2931395493602015111?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/2931395493602015111/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=2931395493602015111' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2931395493602015111'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2931395493602015111'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/02/how-to-collobrate-with-others.html' title='How to collobrate with others?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-4309078574699645886</id><published>2007-02-09T11:53:00.000-07:00</published><updated>2007-02-05T21:29:19.027-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Progress'/><title type='text'>Done with the ICML paper</title><content type='html'>OK. Payam and I just finished the ICML paper. It's good. I believe it should be accepted unless we are really unlucky.&lt;br /&gt;&lt;br /&gt;Anyway, it's a good experience to work with Payam. I believe our cooperation makes the task much easier and efficient.&lt;br /&gt;&lt;br /&gt;Hope we can get good news soon. &lt;br /&gt;&lt;br /&gt;Need to rush for my KDD paper now:)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-4309078574699645886?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/4309078574699645886/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=4309078574699645886' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4309078574699645886'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4309078574699645886'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/02/done-with-icml-paper.html' title='Done with the ICML paper'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-1567751310079090751</id><published>2007-02-05T21:18:00.000-07:00</published><updated>2007-02-05T21:29:19.148-07:00</updated><title type='text'>How to test two data sets from the same distribution?</title><content type='html'>Suppose I have two data sets, how can I tell that they are from two distributions? what's the difference of these two data sets?&lt;br /&gt;&lt;br /&gt;This is actually very generic question emerged in machine learning.&lt;br /&gt;&lt;br /&gt;Some intuitive ideas:&lt;br /&gt;1. estimate the density of data samples for each data set. This method might be very weak. In reality, density estimation is a very difficult task compared with "simple classification". This approach is generally not application in reality.&lt;br /&gt;&lt;br /&gt;2. Estimate the sufficient statistics of each data set. Like the mean, variance of each feature, the class conditional distribution. This can be interpreted as analogy in cognition. An analogy can be derived if the relationship between multiple symbols can be maintained. The problem is when can we conclude that the difference is large enough? It seems some hypothesis testing is required.&lt;br /&gt;&lt;br /&gt;3. Transformation. A data set can be transformed into another data set. But how do you know the feature mapping? A more reasonable way is to enforce equivalence of sufficient statistics in a newly generated space.&lt;br /&gt;&lt;br /&gt;4. Dimensionality reduction.  Assume that two data set shared the same distribution on a fixed number of dimensions. By projecting the two data set into those dimensions, probably we can find some interesting patterns.&lt;br /&gt;&lt;br /&gt;5. Learn the classification function. Use the decision function to measure the difference. this is quite related to transfer learning.&lt;br /&gt;&lt;br /&gt;6. Any other ways? &lt;br /&gt;&lt;br /&gt;OK. To endeavor this direction, where can we find the data set? &lt;br /&gt;&lt;br /&gt;The approach of &lt;a href="http://www.dbs.informatik.uni-muenchen.de/~borgward/MMD/"&gt;Kernel Maximum Mean discrepancy&lt;/a&gt; might be related.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-1567751310079090751?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/1567751310079090751/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=1567751310079090751' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1567751310079090751'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1567751310079090751'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/02/how-to-test-two-data-sets-from-same.html' title='How to test two data sets from the same distribution?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6137401583574451190</id><published>2007-02-05T20:26:00.000-07:00</published><updated>2007-02-05T20:27:36.293-07:00</updated><title type='text'>最牛的博士论文！(转载)</title><content type='html'>最牛的博士论文！教授们紧张到恍惚以为自己才是答辩人&lt;br /&gt;&lt;br /&gt;　 1. 最牛博士论文就是在还没答辩之前已经发表在最好的期刊上，而且鉴于论文很长&lt;br /&gt;，该期刊必须像小说一样连载。&lt;br /&gt;&lt;br /&gt;　 实例：张五常博士论文《佃农理论》，当年在JLE上连载四期。&lt;br /&gt;&lt;br /&gt;2. 最牛博士论文答辩就是答辩人一直在挑战答辩委员会成员，直到问得这些教授们紧&lt;br /&gt;张到恍惚以为自己才是答辩人。&lt;br /&gt;&lt;br /&gt;　 实例：萨缪尔森的博士论文答辩结束后，答辩委员会成员之一的熊彼特（上世纪最&lt;br /&gt;伟大的经济学家之一）转过头去问另一位成员里昂剔夫（诺奖得主）：“瓦西里，我们&lt;br /&gt;通过了么？”&lt;br /&gt;&lt;br /&gt;3. 最牛投稿论文就是让编辑满世界都找不到一个能看懂这篇论文的匿名审稿人，最后&lt;br /&gt;只能发表，根本不需要修改的。&lt;br /&gt;&lt;br /&gt;实例：SIMS1971年发表在《数理统计年鉴》上的论文《无穷维参数空间中的分布滞后估&lt;br /&gt;计》。SIMS写完这篇论文后没投经济学杂志，因为他显然知道没人看的懂。于是投给了&lt;br /&gt;最牛B的数理统计杂志，结果编辑死活找不到审稿人，最后好不容易凑合拉来一个，审&lt;br /&gt;稿报告是这么写的：“我真的不明白这篇论文在说什么，但是我检验了其中的几个定理&lt;br /&gt;，好像是对的。所以我猜应该发表。”&lt;br /&gt;&lt;br /&gt;4. 最牛B的论文没必要长篇大论，千把字足以。实例：德布罗意是个花花公子贵族，本&lt;br /&gt;科是历史学专的，后来实在闲着无聊去读了5年博士，最后交的博士论文是一页纸，还&lt;br /&gt;涉嫌“抄袭”。&lt;br /&gt;&lt;br /&gt;答辩委员会气的都不想让他答辩。他的导师、著名物理学家朗之万感到很没面子，自己&lt;br /&gt;学生毕业不了真是耻辱，于是他鼓动了爱因斯坦一起帮着求情：让这小子过了吧，他老&lt;br /&gt;爸是法国内政部长，咱惹不起。那篇“垃圾”论文后来被薛定谔看到了，薛定谔看着这&lt;br /&gt;页论文苦思冥想了1个月，发表了量子力学里最重要的理论之一的薛定谔方程，薛定谔&lt;br /&gt;猫也成为最有趣的一只猫。&lt;br /&gt;&lt;br /&gt;德布罗意因这篇论文说阐述的观点获得了诺贝尔物理学奖。薛定谔凭借德布罗意的这篇&lt;br /&gt;论文对量子力学作出了杰出贡献，从一名普通而不得志的讲师一跃成为了一名伟大的科&lt;br /&gt;学家并获得了诺贝尔物理学奖。可以说，一篇1页纸的博士论文成就2个诺贝尔物理学奖&lt;br /&gt;可谓前无古人，估计也是后无来者。由此看来，最牛b的论文不必象张五常那样连载，&lt;br /&gt;一页A4的纸足以。不过我想德布罗意要是在中国读博士就惨了，论文因为字数太少，根&lt;br /&gt;本连答辩的资格都没有。&lt;br /&gt;&lt;br /&gt;  不得不说两句：德布罗意幼年即失去双亲，被他的哥哥莫里斯公爵（也是一名杰出的&lt;br /&gt;物理学家）一手养大的，在他1924年的著名博士论文之前一年，德布罗意就已连续发表&lt;br /&gt;三篇论文提出物质波的猜想，至于博士论文是几页纸，这个我还没考证过。&lt;br /&gt;&lt;br /&gt;  关于薛定谔：薛定谔多才多艺，会4种语言，出过诗集。另外他于1944年出版的《生&lt;br /&gt;命是什么》，吸引了一大批物理学家转向生物学研究。其中包括后来双螺旋的发现者沃&lt;br /&gt;森和克里克。所以说，这帮牛人并不一定像人们想象的那样传奇，也不能把其成功单纯&lt;br /&gt;的归结为偶然的因素。正所谓：牛者恒牛。&lt;br /&gt;--&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6137401583574451190?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6137401583574451190/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6137401583574451190' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6137401583574451190'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6137401583574451190'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/02/blog-post.html' title='最牛的博士论文！(转载)'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-3238858165515991676</id><published>2007-01-31T15:28:00.000-07:00</published><updated>2007-01-31T15:49:32.459-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Research'/><title type='text'>science research is not proof</title><content type='html'>Two days ago, we had a very nice discussion with &lt;a href="http://www.isle.org/~langley/"&gt;Pat Langley&lt;/a&gt;. &lt;br /&gt;Pat is more a cognitive scientist than a typical computer scientist, also very talkative:)&lt;br /&gt;&lt;br /&gt;Basically, I question him why he recommend &lt;span style="font-style:italic;"&gt;&lt;a href="http://www.sfu.ca/~jeffpell/Cogs300/RipsKnightsKnaves.pdf"&gt;one paper&lt;/a&gt;&lt;/span&gt; saying that it took longer time for human beings to reason about difficult puzzles. The authors use a forward chaining scheme to build their model. Based on the similar performance of the model and human beings, it concludes that it took longer time for human beings to solve difficult puzzles. &lt;br /&gt;&lt;br /&gt;OK. That paper is a little boring to me as the result is totally not surprising, or even too "obvious". Also, I really doubt human beings reasons like forward chaining or backward chaining.  At the first sight, it seems the result of the paper is totally not convincing to me though the result is obvious.&lt;br /&gt;&lt;br /&gt;So I asked him about the validality of this experiment set up.  Then, I got his question, except this way, what can you do?  Yep, I just post questions but forgot to think out of the box. What would I do if I am the researcher? You propose a model, then what you can do is to find "sufficient" evidence to support your claim. However, in most disciplines, this cannot be proved as mathematics. And even it's proved in someway, it might not work in reality. That's a common case in data mining, also in planning.  The disadvantage of deduction is you have to trust your premise 100%, if one of your assumption is wrong, you mess up. &lt;br /&gt;&lt;br /&gt;After reckon over that problem for a while, I have to sadly admit that's the only possible way or proper way to justify a new claim. So the problem is how to find "enough" evidence? &lt;br /&gt;&lt;br /&gt;It seems that our science research is very weak. Or maybe that's exactly the process of doing research. You propose a model to solve a problem, explain some phenomenon. Then people use one counter example to disprove your model. (Disproof is always much easier than proof, but this seems not the case for refutation in logic proof). Then, a new model or theory is proposed. &lt;br /&gt;&lt;br /&gt;You'll find that almost all science or engineering research are repeating the same cycle.&lt;br /&gt;&lt;br /&gt;When going back to machine learning experiment, I think I've already put too much belief to 10-fold cross validation. But actually, most of tasks have no correct evaluation method. Like information retrieval, data analysis. these tasks requires a reasonable good answer but not an optimal solution. &lt;br /&gt;&lt;br /&gt;Like the recent paper I coauthored with Payam for ICML, we found that two dramatically different feature selection evaluation methods turn out to be almost the same for comparing feature selection methods. There's actually no proof. But I believe it's an interesting research. &lt;br /&gt;&lt;br /&gt;Science Research is pragmatic.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-3238858165515991676?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/3238858165515991676/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=3238858165515991676' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3238858165515991676'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3238858165515991676'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/science-research-is-not-proof.html' title='science research is not proof'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6335470093989669836</id><published>2007-01-25T14:51:00.000-07:00</published><updated>2007-01-25T14:56:05.137-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Progress'/><title type='text'>Just finished the journal paper</title><content type='html'>I've just finished the journal paper for TKDD. &lt;br /&gt;This paper really takes a while to fill in. I even got puke for rewriting it. &lt;br /&gt;&lt;br /&gt;I guess I've looked through it at least 15 times. I really cannot bear to read it any more.&lt;br /&gt;&lt;br /&gt;I am wondering, to do research, should I put more effort to polish a paper or to think about the idea? &lt;br /&gt;&lt;br /&gt;I really hate spending so much time on writing the paper. i think it's not worthwhile.&lt;br /&gt;&lt;br /&gt;The ridiculous thing about current stage of machine learning and data mining is that all these papers tend to foster UNNECESSARY formula to make paper more easy to accept.&lt;br /&gt;&lt;br /&gt;That's not the goal for research. i would like more to motivate the problem well and present some simple, intuitive, sensible and working methods!!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6335470093989669836?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6335470093989669836/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6335470093989669836' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6335470093989669836'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6335470093989669836'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/just-finished-journal-paper.html' title='Just finished the journal paper'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-3065911210807162824</id><published>2007-01-21T22:08:00.000-07:00</published><updated>2007-03-22T12:15:03.520-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Transfer Learning'/><title type='text'>Some experiment result</title><content type='html'>I just finished some demo experiments. Originally, I wanted to find some toy example to show task selection's effect in transfer learning. Unfortunately, all the results I found are very disappointing.&lt;br /&gt;Let me summarize the results a little bit:&lt;br /&gt;(1) If the target task has very limited training data,  transfer learning do help a lot compared with single task learning.&lt;br /&gt;(2) The tasks selected make a very tiny difference (within 1% percent). Actually, it seems that combine all the tasks together is a very robust and reliable strategy for the data set I'm working on.&lt;br /&gt;(3) Combine all the data together seems always better than transfer learning.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;(1) is not surprising and has been approved by existing researchers.&lt;br /&gt;(2) can not justify task selection.&lt;br /&gt;(3) It seems that there's no difference between these tasks in the data set.&lt;br /&gt;&lt;br /&gt;Maybe, one interesting problem is to determine whether the data extracted from multiple sources are actually the same. But I feel that's a more difficult problem than task selection.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-3065911210807162824?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/3065911210807162824/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=3065911210807162824' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3065911210807162824'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3065911210807162824'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/some-experiment-result.html' title='Some experiment result'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8213055317718133567</id><published>2007-01-20T14:10:00.000-07:00</published><updated>2007-01-20T14:13:34.988-07:00</updated><title type='text'>Flexing Muscle, China Destroys Satellite in Test</title><content type='html'>This is great news for  Chinese!!&lt;br /&gt;&lt;br /&gt;http://www.nytimes.com/2007/01/19/world/asia/19china.html?n=Top%2fNews%2fWorld%2fCountries%20and%20Territories%2fChina&lt;br /&gt;&lt;br /&gt;Article Tools Sponsored By&lt;br /&gt;By WILLIAM J. BROAD and DAVID E. SANGER&lt;br /&gt;Published: January 19, 2007&lt;br /&gt;&lt;br /&gt;China successfully carried out its first test of an antisatellite weapon last week, signaling its resolve to play a major role in military space activities and bringing expressions of concern from Washington and other capitals, the Bush administration said yesterday.&lt;br /&gt;&lt;br /&gt;Only two nations — the Soviet Union and the United States — have previously destroyed spacecraft in antisatellite tests, most recently the United States in the mid-1980s.&lt;br /&gt;&lt;br /&gt;Arms control experts called the test, in which the weapon destroyed an aging Chinese weather satellite, a troubling development that could foreshadow an antisatellite arms race. Alternatively, however, some experts speculated that it could precede a diplomatic effort by China to prod the Bush administration into negotiations on a weapons ban.&lt;br /&gt;&lt;br /&gt;“This is the first real escalation in the weaponization of space that we’ve seen in 20 years,” said Jonathan McDowell, a Harvard astronomer who tracks rocket launchings and space activity. “It ends a long period of restraint.”&lt;br /&gt;&lt;br /&gt;White House officials said the United States and other nations, which they did not identify, had “expressed our concern regarding this action to the Chinese.” Despite its protest, the Bush administration has long resisted a global treaty banning such tests because it says it needs freedom of action in space.&lt;br /&gt;&lt;br /&gt;Jianhua Li, a spokesman at the Chinese Embassy in Washington, said that he had heard about the antisatellite story but that he had no statement or information.&lt;br /&gt;&lt;br /&gt;At a time when China is modernizing its nuclear weapons, expanding the reach of its navy and sending astronauts into orbit for the first time, the test appears to mark a new sphere of technical and military competition. American officials complained yesterday that China had made no public or private announcements about its test, despite repeated requests by American officials for more openness about its actions.&lt;br /&gt;&lt;br /&gt;The weather satellite hit by the weapon had circled the globe at an altitude of roughly 500 miles. In theory, the test means that China can now hit American spy satellites, which orbit closer to Earth. The satellites presumably in range of the Chinese missile include most of the imagery satellites used for basic military reconnaissance, which are essentially the eyes of the American intelligence community for military movements, potential nuclear tests and even some counterterrorism, and commercial satellites.&lt;br /&gt;&lt;br /&gt;Experts said the weather satellite’s speeding remnants could pose a threat to other satellites for years or even decades.&lt;br /&gt;&lt;br /&gt;In late August, President Bush authorized a new national space policy that ignored calls for a global prohibition on such tests. The policy said the United States would “preserve its rights, capabilities, and freedom of action in space” and “dissuade or deter others from either impeding those rights or developing capabilities intended to do so.” It declared the United States would “deny, if necessary, adversaries the use of space capabilities hostile to U.S. national interests.”&lt;br /&gt;&lt;br /&gt;The Chinese test “could be a shot across the bow,” said Theresa Hitchens, director of the Center for Defense Information, a private group in Washington that tracks military programs. “For several years, the Russians and Chinese have been trying to push a treaty to ban space weapons. The concept of exhibiting a hard-power capability to bring somebody to the negotiating table is a classic cold war technique.”&lt;br /&gt;&lt;br /&gt;Gary Samore, the director of studies at the Council on Foreign Relations, said in an interview: “I think it makes perfect sense for the Chinese to do this both for deterrence and to hedge their bets. It puts pressure on the U.S. to negotiate agreements not to weaponize space.”&lt;br /&gt;&lt;br /&gt;Ms. Hitchens and other critics have accused the administration of conducting secret research on advanced antisatellite weapons using lasers, which are considered a far speedier and more powerful way of destroying satellites than the weapons of two decades ago.&lt;br /&gt;&lt;br /&gt;The White House statement, issued by the National Security Council, said China’s “development and testing of such weapons is inconsistent with the spirit of cooperation that both countries aspire to in the civil space area.”&lt;br /&gt;&lt;br /&gt;An administration official who had reviewed the intelligence about China’s test said the launching was detected by the United States in the early evening of Jan. 11, which would have been early morning on Jan. 12 in China. American satellites tracked the launching of the medium-range ballistic missile, and later space radars saw the debris.&lt;br /&gt;&lt;br /&gt;The antisatellite test was first reported late Wednesday on the Web site of Aviation Week and Space Technology, an industry magazine. It said intelligence agencies had yet to “complete confirmation of the test.”&lt;br /&gt;&lt;br /&gt;The test, the magazine said, appeared to employ a ground-based interceptor that used the sheer force of impact rather than an exploding warhead to shatter the satellite.&lt;br /&gt;&lt;br /&gt;Dr. McDowell of Harvard said the satellite was known as Feng Yun, or “wind and cloud.” Launched in 1999, it was the third in a series. He said that it was a cube measuring 4.6 feet on each side, and that its solar panels extended about 28 feet. He added that it was due for retirement but that it still appeared to be electronically alive, making it an ideal target.&lt;br /&gt;&lt;br /&gt;“If it stops working,” he said, “you know you have a successful hit.”&lt;br /&gt;&lt;br /&gt;David C. Wright, a senior scientist at the Union of Concerned Scientists, a private group in Cambridge, Mass., said he calculated that the Chinese satellite had shattered into 800 fragments four inches wide or larger, and millions of smaller pieces.&lt;br /&gt;&lt;br /&gt;The Soviet Union conducted roughly a dozen antisatellite tests from 1968 to 1982, Dr. McDowell said, adding that the Reagan administration carried out its experiments in 1985 and 1986.&lt;br /&gt;&lt;br /&gt;The Bush administration has conducted research that critics say could produce a powerful ground-based laser weapon that would be used against enemy satellites.&lt;br /&gt;&lt;br /&gt;The largely secret project, parts of which were made public through Air Force budget documents submitted to Congress last year, appears to be part of a wide-ranging administration effort to develop space weapons, both defensive and offensive.&lt;br /&gt;&lt;br /&gt;The administration’s laser research is far more ambitious than a previous effort by the Clinton administration to develop an antisatellite laser, though the administration denies that it is an attempt to build a laser weapon.&lt;br /&gt;&lt;br /&gt;The current research takes advantage of an optical technique that uses sensors, computers and flexible mirrors to counteract the atmospheric turbulence that seems to make stars twinkle. The weapon would essentially reverse that process, shooting focused beams of light upward with great clarity and force.&lt;br /&gt;&lt;br /&gt;Michael Krepon, co-founder of the Henry L. Stimson Center, a group that studies national security, called the Chinese test very un-Chinese.&lt;br /&gt;&lt;br /&gt;“There’s nothing subtle about this,” he said. “They’ve created a huge debris cloud that will last a quarter century or more. It’s at a higher elevation than the test we did in 1985, and for that one the last trackable debris took 17 years to clear out.”&lt;br /&gt;&lt;br /&gt;Mr. Krepon added that the administration had long argued that the world needed no space-weapons treaty because no such arms existed and because the last tests were two decades ago. “It seems,” he said, “that argument is no longer operative.”&lt;br /&gt;&lt;br /&gt;Mark Mazzetti contributed reporting.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8213055317718133567?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8213055317718133567/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8213055317718133567' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8213055317718133567'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8213055317718133567'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/flexing-muscle-china-destroys-satellite.html' title='Flexing Muscle, China Destroys Satellite in Test'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8127582314616708173</id><published>2007-01-19T22:40:00.000-07:00</published><updated>2007-01-19T22:42:57.019-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Intern Interview'/><title type='text'>Rejected by Google</title><content type='html'>I thought  I could reject Google, but unfortunately Google rejected me!! :(&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style:italic;"&gt;&lt;br /&gt;We would like to thank you for your interest in Google.  After carefully reviewing your experience and qualifications, we have determined that we do not have a 'Software Engineering Intern' position available which is a strong match at this time.&lt;br /&gt;&lt;br /&gt;Thanks again for considering Google.  We wish you well in your endeavors&lt;br /&gt;and hope you might consider us again in the future.&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8127582314616708173?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8127582314616708173/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8127582314616708173' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8127582314616708173'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8127582314616708173'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/rejected-by-google.html' title='Rejected by Google'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-297155161969974204</id><published>2007-01-18T09:57:00.000-07:00</published><updated>2007-01-18T09:58:47.333-07:00</updated><title type='text'>Two TA work again!!!!</title><content type='html'>This semester, I have to work as TA for both AI and Data Mining Class again. So sad...&lt;br /&gt;I am wondering why I am always so unlucky?&lt;br /&gt;&lt;br /&gt;Just returned the new version of journal paper to boss. Feels really tired to make any change to that.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-297155161969974204?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/297155161969974204/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=297155161969974204' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/297155161969974204'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/297155161969974204'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/two-ta-work-again.html' title='Two TA work again!!!!'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-2006982578795433112</id><published>2007-01-17T13:37:00.001-07:00</published><updated>2007-02-13T14:03:39.943-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Intern Interview'/><title type='text'>Google Phone Interview (1st round)</title><content type='html'>(1st round)&lt;br /&gt;&lt;br /&gt;Why do you like google?&lt;br /&gt;What's the difference between process and threads?&lt;br /&gt;What's the difference between Java and C++?&lt;br /&gt;Tell me the basic concepts of object-oriented programming?&lt;br /&gt;How to implement mutiple inherience in Java? Why java use interface while C++ keeps feature of multiple inherience?&lt;br /&gt;&lt;br /&gt;Have you ever involved into any team project? what did you do?&lt;br /&gt;How do you handle the case that you have different opinions with manager?&lt;br /&gt;&lt;br /&gt;Then discuss about my research topic.&lt;br /&gt;&lt;br /&gt;How to find most common word in billions of documents? &lt;br /&gt;1. If memory allowed (hashtable). what's the time complexity?&lt;br /&gt;2. What if multiple machines? What's the bottleneck?&lt;br /&gt;3. What if just one machine and hashtable can not be stored in the memory?&lt;br /&gt;&lt;br /&gt;Then, I asked hime some general questions. &lt;br /&gt;The last task, he asked me to send code to him in 30 mintues. The task is write a function to transform a string into an integer.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-2006982578795433112?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/2006982578795433112/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=2006982578795433112' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2006982578795433112'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2006982578795433112'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/google-phone-interview-1st-round.html' title='Google Phone Interview (1st round)'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8232007846611857476</id><published>2007-01-17T13:14:00.000-07:00</published><updated>2007-01-17T13:39:29.998-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Intern Interview'/><title type='text'>Google Phone Interview(2nd round)</title><content type='html'>Just finished my interview with Google.  This time is an engineer, asking lots of detailed questions.&lt;br /&gt;&lt;br /&gt;He knows a lot about lisp, so we just discuss about the issues about lisp.  like&lt;br /&gt;what's the difference of lisp and other languages?&lt;br /&gt;which kind of lisp complier do you use?  emacs lisp vs. clisp?&lt;br /&gt;&lt;br /&gt;Then, some questions about operating system?&lt;br /&gt;What's the difference between process and thread? What kind of information does thread maintain? its own stack? heap?&lt;br /&gt;How and when  to do a context switch?  How do you handle an time slice interrupt?&lt;br /&gt;What are the possible pitfalls for multi-thread programming?&lt;br /&gt;&lt;br /&gt;How compiler works? &lt;br /&gt;Can regular expression resolve the problem of nested structures?&lt;br /&gt;Tell me something about grammar?&lt;br /&gt;Is type check done before or after parsing? &lt;br /&gt;(I did pretty bad in this session, so he finally stopped)&lt;br /&gt;&lt;br /&gt;Familiar with TCP/IP, RPC, network programming? (NO, skipped) &lt;br /&gt;&lt;br /&gt;Are you familiar with B-tree, red-black tree?  (No, so we switch to binary search tree)&lt;br /&gt;What's the time complexity of insertion or query in a binary search tree?  O(lg n)&lt;br /&gt;Worst case?  (O (n))&lt;br /&gt;How to transform a unbalanced tree into balanced tree? &lt;br /&gt;Are you familiar with TreeMap in Java?&lt;br /&gt;&lt;br /&gt;How hash table works? What if two object have the same key value?    Show me one example of hash function.&lt;br /&gt;What is the innate structure of a hashtable? (I said array)  How do you map a key value to an index?&lt;br /&gt;&lt;br /&gt;Finally, one technical question:&lt;br /&gt;Given a source word, a target word,  and a dictionary, how to transform the source word into target word by changing only one letter in each step.  The word you get in each step must be in the dictionary.&lt;br /&gt;&lt;br /&gt;Then, I asked him about some projects details.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8232007846611857476?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8232007846611857476/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8232007846611857476' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8232007846611857476'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8232007846611857476'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/google-phone-interview2nd-round.html' title='Google Phone Interview(2nd round)'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-8919200683887617139</id><published>2007-01-11T20:51:00.001-07:00</published><updated>2007-01-11T20:51:30.407-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Intern Interview'/><title type='text'>2nd round phone interview from google</title><content type='html'>good news.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-8919200683887617139?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/8919200683887617139/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=8919200683887617139' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8919200683887617139'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/8919200683887617139'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/2nd-round-phone-interview-from-google.html' title='2nd round phone interview from google'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-7224974363996745641</id><published>2007-01-03T13:04:00.000-07:00</published><updated>2007-01-03T13:09:31.514-07:00</updated><title type='text'>Desirable features of blog site</title><content type='html'>I just initiated two blogs, one in google blogspot, the other one in windows live space.  Compare these two blog site is kind of interesting.&lt;br /&gt;&lt;br /&gt;Windows Live:&lt;br /&gt;1. Better handling photos&lt;br /&gt;2. Template seems to be more beautiful.&lt;br /&gt;&lt;br /&gt;Google blogspot:&lt;br /&gt;&lt;br /&gt;1. Support group blog which is absent in windows live space.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Both sucks:&lt;br /&gt;1. Can not set the permission for a specific post;&lt;br /&gt;2. Can not change the layout and template freely, like change the background of template.&lt;br /&gt;&lt;br /&gt;I am wondering how could these two blog sites to be so successful. Is there any other good ones?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-7224974363996745641?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/7224974363996745641/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=7224974363996745641' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7224974363996745641'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/7224974363996745641'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/desirable-features-of-blog-site.html' title='Desirable features of blog site'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-2615597916047115890</id><published>2007-01-02T23:16:00.000-07:00</published><updated>2007-01-02T23:57:18.941-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Feature Selection'/><title type='text'>Redundant Features useful or not?</title><content type='html'>I just browsed one interesting paper published in ICML'06:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://people.csail.mit.edu/gamir/pubs/fdrop_camera_postfix.pdf"&gt;nightmare at test time: robust learning by feature deletion &lt;/a&gt;&lt;br /&gt;&lt;br /&gt;The motivation for the paper can be described as like buying stock.&lt;br /&gt;&lt;br /&gt;Suppose you have several stocks with the same risk, and you have $1000. What would you do?&lt;br /&gt;&lt;br /&gt;Of course, divide all the money evenly into these stocks should be more reliable than putting them into just one.&lt;br /&gt;&lt;br /&gt;This is the same situation for feature selection. Suppose you select some relevant features from training data, but it could be wrong due to small samples or noise or any kind of noise.&lt;br /&gt;&lt;br /&gt;In this process, you probably remove those redundant features as well.   From this point of view, it seems more robust to keep those redundant features rather than remove them.&lt;br /&gt;&lt;br /&gt;But from curse of dimensionality view,  redundant features should be removed.&lt;br /&gt;&lt;br /&gt;How to trade off redundancy and robustness?&lt;br /&gt;I guess this is highly related to the definition of redundancy.&lt;br /&gt;&lt;br /&gt;I'll comment on this issue more in future.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-2615597916047115890?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/2615597916047115890/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=2615597916047115890' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2615597916047115890'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2615597916047115890'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/redundant-features-useful-or-not.html' title='Redundant Features useful or not?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6600675057808146719</id><published>2007-01-01T13:53:00.000-07:00</published><updated>2007-01-01T13:58:08.855-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='information retrieval'/><title type='text'>Some interesting search engines</title><content type='html'>Some interesting search engines: &lt;a href="http://www.hakia.com/"&gt;Hakia&lt;/a&gt;, &lt;a href="http://www.powerset.com/"&gt;powerset&lt;/a&gt;, &lt;a href="http://www.snap.com/"&gt;snap&lt;/a&gt;&lt;br /&gt;there's &lt;a href="http://www.nytimes.com/2007/01/01/technology/01search.html?ei=5088&amp;en=5fb9c9a36e8ccb8d&amp;amp;ex=1325307600&amp;partner=rssnyt&amp;amp;emc=rss&amp;amp;pagewanted=all"&gt;an article&lt;/a&gt; in nytimes talking about these search engines.&lt;br /&gt;&lt;br /&gt;Will these be one of the future google?&lt;br /&gt;&lt;br /&gt;I used to be a google fan. But I just found that google maps sucks. I tried different maps for LA trip, and mapquest seems to do a much better job. Google map finally and always got me lost. To be sure, this is not the first time.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6600675057808146719?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6600675057808146719/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6600675057808146719' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6600675057808146719'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6600675057808146719'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2007/01/some-interesting-search-engines.html' title='Some interesting search engines'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-2032948313323262343</id><published>2006-12-22T14:12:00.000-07:00</published><updated>2006-12-22T14:13:23.361-07:00</updated><title type='text'>Travel to LA (23rd-27th)</title><content type='html'>&lt;span style="color: rgb(204, 255, 255);"&gt;More will come after I come back:) &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-2032948313323262343?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/2032948313323262343/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=2032948313323262343' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2032948313323262343'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/2032948313323262343'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2006/12/travel-to-la-23rd-27th.html' title='Travel to LA (23rd-27th)'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6588309595479348364</id><published>2006-12-19T15:49:00.000-07:00</published><updated>2006-12-21T11:46:33.966-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Application'/><category scheme='http://www.blogger.com/atom/ns#' term='Data Mining'/><title type='text'>Data Mining dead for effective counterterrorism?</title><content type='html'>Jeff Jonas has just posted one paper in &lt;a href="http://jeffjonas.typepad.com/jeff_jonas/2006/12/effective_count.html"&gt;his blog &lt;/a&gt;saying that data mining would fail to detect suspicious for counterterrorism. This kind of voice is really interesting to me.&lt;br /&gt;&lt;br /&gt;In his article, he mentioned the limitations of existing data mining techniques:&lt;br /&gt;&lt;br /&gt;1. typical data mining requires lots of data available to discover the pattern of terrorism. However, terrorism's frequency is so low that data mining would fail to detect it.&lt;br /&gt;&lt;br /&gt;2. the cost of false positive. If the number of population is large, suppose 1 billion people under monitored, even with a very high accuracy(say 99.9%), the false positive would result in 1,000,000 suspicious, which would be too costly to perform furthur investigation and tracking.&lt;br /&gt;&lt;br /&gt;After reading it, I just have several questions in mind:&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Is inductive data mining dead for this case?&lt;/li&gt;&lt;li&gt;Is deductive analysis more suitble for this case?&lt;/li&gt;&lt;li&gt;How to detect novelty? What is an "anomly"?&lt;/li&gt;&lt;li&gt;Terrorists will change their stratgy for next attack. Is it possible to find it?&lt;/li&gt;&lt;/ol&gt;&lt;p&gt;Instead of claiming the limitedness of data mining, I would cheer for the "dead" of statistical data mining. Machine learning, finally have to rethink about its correct direction. &lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6588309595479348364?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6588309595479348364/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6588309595479348364' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6588309595479348364'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6588309595479348364'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2006/12/data-mining-dead-for-effective.html' title='Data Mining dead for effective counterterrorism?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-4067252716995005313</id><published>2006-12-18T20:41:00.000-07:00</published><updated>2006-12-18T20:44:40.286-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Application'/><title type='text'>Data mining to detect perpetrators and accomplices</title><content type='html'>&lt;h2&gt;&lt;span style="font-size:85%;"&gt;&lt;span style="font-weight: normal;"&gt;One article seems interesting. In this article, the researchers tried to use data mining technique to detect perpetrators and accomplices during online auction.  Seems an interesting problem. But really difficult to obtain the data. I guess. Such kind of data always involves lots of private issues.&lt;/span&gt;&lt;/span&gt;&lt;br /&gt;&lt;/h2&gt;&lt;br /&gt;&lt;h2&gt;&lt;span style="font-size:85%;"&gt; Carnegie Mellon Researchers Uncover Online Auction Fraud; Data Mining Software Fingers Both Perpetrators and Accomplices&lt;/span&gt;&lt;/h2&gt;  &lt;p&gt;         PITTSBURGH, Dec. 5 (AScribe Newswire) -- Computer scientists at Carnegie Mellon University are using data mining techniques to identify perpetrators of fraud among online auction users as well as their otherwise unknown accomplices.  &lt;/p&gt;&lt;p&gt;         The new method analyzes publicly available histories of transactions posted by online auction sites such as eBay and identifies suspicious online behaviors and dubious associations among users.  &lt;/p&gt;&lt;p&gt;         Online auction sites are immensely popular. The largest, eBay, reported third quarter revenues of $1.449 billion, up 31 percent from the previous year, and registered 212 million users, up 26 percent. But the popularity of online auction sites also makes them a target for crooks.  Internet auction fraud, such as failure to deliver goods after a sale, accounted for almost two-thirds of the 97,000 complaints referred to law enforcement agencies last year by the federal Internet Crime Complaint Center.  &lt;/p&gt;&lt;p&gt;         Perpetrators of these frauds have distinctive online behaviors that cause them to be readily purged from an online auction site, said Computer Science Professor Christos Faloutsos. The software developed by his research team - Network Detection via Propagation of Beliefs, or NetProbe - could prevent future frauds by identifying their accomplices, who can lurk on a site indefinitely and enable new generations of fraudsters.  &lt;/p&gt;&lt;p&gt;         In a test analysis of about one million transactions between almost 66,000 eBay users, NetProbe correctly detected 10 previously identified perpetrators, as well as more than a dozen probable fraudsters and several dozen apparent accomplices.  &lt;/p&gt;&lt;p&gt;         "To the best of our knowledge, this is the first work that uses a systematic approach to analyze and detect electronic auction frauds," said Faloutsos, who noted that NetProbe could eventually be useful for both law enforcement and security personnel of online sites.  &lt;/p&gt;&lt;p&gt;         The researchers have already adapted the software to provide a trustworthiness score for individual user IDs. Though not yet available to the public, the NetProbe score would complement user reputation scores that many auction sites already provide to help prevent fraud.  &lt;/p&gt;&lt;p&gt;         "We want to help people detect potential fraud before the fraud occurs," said research associate Duen Horng "Polo" Chau, who developed NetProbe with Faloutsos, undergraduate student Samuel Wang and graduate student Shashank Pandit.  &lt;/p&gt;&lt;p&gt;         Many auction sites try to avert fraud with so-called reputation systems. In eBay's case, buyers can report whether they had a positive, neutral or negative experience in a transaction, and that report is then translated into a feedback score for that seller.  &lt;/p&gt;&lt;p&gt;         Unfortunately, a crook can manipulate these feedback scores, obtaining a favorable score by engaging in a number of legitimate sales. But that is costly and time-consuming and, once the fraudster starts cheating buyers, that user identification is quickly red-flagged and shut down.  &lt;/p&gt;&lt;p&gt;         Perpetrating frauds may be sustainable, however, if a fraudster has accomplices or sets up separate user IDs to serve as accomplices. These accomplice accounts conduct legitimate transactions and maintain good reputations. They also have many transactions with the user IDs of fraudsters, using their good reputations to boost the fraudsters' feedback scores. Because accomplices don't perpetrate frauds, they usually escape notice and can keep working to establish new fraudster accounts, Faloutsos said.  &lt;/p&gt;&lt;p&gt;         But an unnatural pattern becomes evident when the transactions are plotted as a graph, with each user represented as a node, or dot, and transactions between individual users represented by lines connecting the nodes.  &lt;/p&gt;&lt;p&gt;         In the resulting graph, transactions between accomplices and fraudsters create a pattern that sticks out like "a guiding light," Chau said. Graph theorists call this pattern a "bipartite core" - members of one group have lots of transactions with members of a second group, but don't have transactions with members of their own group. One group, the accomplices, also deals with honest eBay users, but most of the transactions are with fraudster groups.  &lt;/p&gt;&lt;p&gt;         The researchers tested their method, in part, by accumulating transaction histories from eBay and demonstrating that they could detect the distinctive fraud patterns within these massive data sets.  Chau reported on an analysis involving about 100 eBay users at a September data mining conference in Berlin. The team has since analyzed about a million transactions between almost 66,000 eBay users, and those as-yet unpublished findings have been submitted for presentation at an upcoming scientific conference.  &lt;/p&gt;&lt;p&gt;         "Crooks are extremely ingenious," Faloutsos warned, so identifying accomplices would not eliminate all online auction fraud. But eliminating accomplices would force crooks to resort to more sophisticated, complex schemes. "These schemes will require more effort and cost, so fraud would be increasingly unprofitable," he added.  &lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-4067252716995005313?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/4067252716995005313/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=4067252716995005313' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4067252716995005313'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/4067252716995005313'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2006/12/data-mining-to-detect-perpetrators-and.html' title='Data mining to detect perpetrators and accomplices'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-3814461433885347496</id><published>2006-12-18T10:47:00.000-07:00</published><updated>2006-12-18T11:14:47.859-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Transfer Learning'/><title type='text'>Domain adaptation &amp; learning from multiple sources</title><content type='html'>&lt;span style="color: rgb(204, 204, 255);"&gt;It seems that different people interpret transfer learning in different terms:&lt;/span&gt;&lt;br /&gt;&lt;span style="color: rgb(204, 204, 255);"&gt;Multitask learning, Domain Adaptation, learning from multiple sources, sample selection bias.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(204, 204, 255);"&gt;Here, I just tried to distinguish these terms and discuss about the difference.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="color: rgb(204, 204, 255);"&gt;Multitask learning has been studied a lot. Usually, the objective function is to learn a model for each task such that the overall performance is optimized. These models share some commonality.&lt;/span&gt;&lt;br /&gt;&lt;span style="color: rgb(204, 204, 255);"&gt;A very strong limitation of this method is that&lt;/span&gt;&lt;br /&gt;&lt;span style="color: rgb(204, 204, 255);"&gt;&lt;span style="color: rgb(255, 102, 102); font-weight: bold;font-size:130%;" &gt;overall performance !=  performance on one specific task.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;In my opinion,  MTL actually solves the learning problem in an indirect way. There are lots of issues involved: how to find similar tasks, which tasks should I trust, how much should I trust for each task,  what's the trade-off between data and tasks. All these problems requires additional knowledge from domain or data, which make it rather limited. Actually, MTL can only works if very few data is available for each task.  If there's no training data for one task,  MTL can not work.&lt;br /&gt;&lt;br /&gt;Domain adaptation,  is more like a transfer learning setting. Domain adaptation can be considered as involving two tasks: one is source domain(support task), one is target domain(target task).  This term is coined(as I see) from NLP community.   The goal is exactly the same as transfer learning, to improve the performance of target task.&lt;br /&gt;&lt;br /&gt;Learning from multiple sources, can be a little tricky. There are actually two interpretations: one assume that all the sources are drawn from the same underlying distribution, but with various level noise or uncertainty(If some additional information like the bound of the noise can be obtained, then it's possible to take advantage of it);  The other one assumes that different source have different underlying distribution.  But they are related.  Thus, the former is more like data selection, how to learn ONE model given some noisy data while the latter is exactly like Multitask learning (develope one model for each task(source)).&lt;br /&gt;&lt;br /&gt;Finally,  transfer learning can also be connected to sample selection bias as I mentioned in last post. How ever sample selection bias always deals with the case such that only one biased sample is available, how to obtain an unbiased model.  This situation is more like domain adaptation.  We can effectively apply the method in one field to the other.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-3814461433885347496?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/3814461433885347496/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=3814461433885347496' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3814461433885347496'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/3814461433885347496'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2006/12/domain-adaptation-learning-from.html' title='Domain adaptation &amp; learning from multiple sources'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-5296265074899143586</id><published>2006-12-15T16:47:00.000-07:00</published><updated>2006-12-16T23:54:29.473-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Transfer Learning'/><title type='text'>Sample selection bias = Multitask learning?</title><content type='html'>I just browsed one interesting paper:&lt;br /&gt;Dirichlet enhanced spam filtering based on biased samples. which pose a question to me:&lt;br /&gt;&lt;br /&gt;Is sample selection bias = multitask learning?&lt;br /&gt;&lt;br /&gt;As in this paper, each user's spam filtering can be considered as one task.&lt;br /&gt;&lt;br /&gt;However, there are two very &lt;span style="font-weight: bold;"&gt;"Strong"&lt;/span&gt;  assumptions:&lt;br /&gt;&lt;br /&gt;1.  The sample selection bias of publicly available data  and personalized data is still 0-1 consistent. That is P(x|public) !=0 then P(x|personal) !=0;&lt;br /&gt;&lt;br /&gt;2.  P(y|x, public) = P(y|x, personal).&lt;br /&gt;&lt;br /&gt;The 1st assumption is still OK, since most sample selection bias adopt this one. (Otherwise, I think there's no way to inference something you have no chance to know).&lt;br /&gt;&lt;br /&gt;The 2nd assumption is way too &lt;span style="font-weight: bold;"&gt;UNACCEPTABLE&lt;/span&gt; to me.  Given a message, some users might treat it as a ham, some might treat it as a spam.  &lt;span style="font-weight: bold;"&gt;How could this be possible to be equal?!&lt;br /&gt;&lt;/span&gt;(Is this because it's easy to analyze?)&lt;br /&gt;&lt;br /&gt;In my opinion, MTL can not be considered as a feature bias (in terms of sample selection bias), nor a label bias. A more general model should be a complete bias. That is,  P(s=1|x,y) cannot be decomposed into any simpler form.&lt;br /&gt;&lt;br /&gt;Actually, I am thinking whether  or not it's possible to learn P(s=1|x,y) (as in the paper I mentioned) if biased and unbiased samples are both provided.  OK. Let's start with the ideal case. Then a logistic regression learner can be adopted to estimate the bias. The trick here is treat (x, y) as input feature and s=1 or 0 as output.&lt;br /&gt;&lt;br /&gt;In MTL, only very few labeled data are provided for the unbiased setting. Can we estimate the density reliably?   The dilemma is that: we need "enough" data to improve the bias density estimation, but if we have "enough" data, we can already derive a very good model.&lt;br /&gt;&lt;br /&gt;Any other possible way to enhance the estimation?&lt;br /&gt;&lt;br /&gt;This paper adopt the Dirichlet process to improve the estimation. But why does it work? I couldn't figure it out.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-5296265074899143586?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/5296265074899143586/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=5296265074899143586' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/5296265074899143586'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/5296265074899143586'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2006/12/sample-selection-bias-multitask.html' title='Sample selection bias = Multitask learning?'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-6762332128447597723</id><published>2006-12-14T20:01:00.001-07:00</published><updated>2009-10-21T15:58:04.616-06:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='personal life'/><title type='text'>Finally done with course work</title><content type='html'>&lt;span style="font-family: arial;"&gt;Finally finished all the finals and TA work.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: arial;"&gt;Since this is the end of the semester, I would like to make a summary of the work I have done.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: arial;"&gt;OK, this year, I took 2 courses, one about &lt;/span&gt;&lt;span style="font-family: arial; font-weight: bold;"&gt;numerical linear algebra.&lt;/span&gt;&lt;span style="font-family: arial;"&gt; I think this class is pretty good. I finally do not have scare away from those stupid matrix and SVD stuff.  But unfortunately,  I didn't spend much time on this. I think the projects on this class are pretty cool. Especially the Arnoldi process. I didn't expect such a large sparse matrix can be handled by only few iterations. &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: arial;"&gt;The SVD stuff, LU decomposition, QR decomposition(finally I got a chance to show off in this Blog:) I think I won't use these much in future research work. But combined with convex optimization, it should help.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: arial;"&gt;Well... The 2nd course : &lt;/span&gt;&lt;span style="font-family: arial; font-weight: bold;"&gt;distributed operating system. &lt;/span&gt;&lt;span style="font-family: arial;"&gt;The instructor is a good talker and explains everything very well. Unfortunately,  I am an eager learner(not kNN--"lazy learner"). He speaks so slow that I couldn't help getting asleep?  Is this my fault? &lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: arial;"&gt;Anyway, it's good to learn the concept about lock/unlock, synchronization, multi threading, PVM, messger system, shared memory, net messenger server, RPC, lock server, distributed shared memory, distributed mutual exclusive/snapshot.  All the terms sound very cool... I did really bad in the final. But who cares?  (PHD do not care about course work. That becomes my quote now. hehe...)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: arial;"&gt;Finally, let's talk about the TA work. I am working as a TA for &lt;/span&gt;&lt;span style="font-family: arial; font-size: 85%;"&gt;&lt;span lang="EN-US"&gt;&lt;a href="http://rakaposhi.eas.asu.edu/rao.html"&gt;Subbarao Kambhampati. &lt;/a&gt;&lt;br /&gt;&lt;span style="font-size: 100%;"&gt;To be frank, this is the toughest TA work I've ever have. 4 homeworks (the last one has around 50 questions, which really drove me crazy!!!!) and 5 projects.  I was grading either the homework or the project nearly every weekend. So many stuff.&lt;br /&gt;&lt;br /&gt;But from another perspective, I've never understand AI concept so clear.  Rao connected agent design, search, planning, MDP, logic, Bayesian network, learning so well that I have to admit I learned much more by doing this TA work than taking a course. Actually, I don't think I learned anything really exciting when I took Dr. Liu's AI. Rao did a much better job.  But I don't like his projects. No challenge, though lots of students spent way too much time on lisp. Anyway, I would like to thank to this experience.   (I guess this is due to my gf. She changed me a lot!)&lt;br /&gt;&lt;br /&gt;But...... I have to work as TA again next semester, both AI and data mining. I want to kill someone.....&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;br /&gt;&lt;a href="http://rakaposhi.eas.asu.edu/rao.html"&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-6762332128447597723?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/6762332128447597723/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=6762332128447597723' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6762332128447597723'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/6762332128447597723'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2006/12/finally-done-with-course-work.html' title='Finally done with course work'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-385484053794930263</id><published>2006-12-14T19:50:00.000-07:00</published><updated>2006-12-21T13:46:17.520-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='papers'/><title type='text'>Some interesting papers from NIPS 2006</title><content type='html'>&lt;span style="font-weight: bold; color: rgb(204, 255, 255);font-family:arial;font-size:85%;"  &gt;NIPS'2006 has just released their online proceedings.&lt;br /&gt;&lt;a href="http://books.nips.cc/nips19.html"&gt;http://books.nips.cc/nips19.html&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Here are some interesting papers:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;------------------------------------------------------------&lt;br /&gt;Dirichlet-Enhanced Spam Filtering based on Biased Samples&lt;br /&gt;Steffen Bickel, Tobias Scheffer [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Denoising and Dimension Reduction in Feature Space&lt;br /&gt;Mikio Braun, Joachim Buhmann, Klaus-Robert Müller [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Sparse Multinomial Logistic Regression via Bayesian L1 Regularisation&lt;br /&gt;Gavin Cawley, Nicola Talbot, Mark Girolami [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Modeling General and Specific Aspects of Documents with a Probabilistic Topic Model&lt;br /&gt;Chaitanya Chemudugunta, Padhraic Smyth, Mark Steyvers [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Learning from Multiple Sources&lt;br /&gt;Koby Crammer, Michael Kearns, Jennifer Wortman [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Optimal Single-Class Classification Strategies&lt;br /&gt;Ran El-Yaniv, Mordechai Nisenson [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Clustering Under Prior Knowledge with Application to Image Segmentation&lt;br /&gt;Mario Figueiredo, Dong Seon Cheng, Vittorio Murino [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Data Integration for Classification Problems Employing Gaussian Process Priors&lt;br /&gt;Mark Girolami, Mingjun Zhong [ps.gz][pdf][bibtex][zip]&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Correcting Sample Selection Bias by Unlabeled Data&lt;br /&gt;Jiayuan Huang, Alex Smola, Arthur Gretton, Karsten M. Borgwardt, Bernhard Schölkopf [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Cross-Validation Optimization for Large Scale Hierarchical Classification Kernel Methods&lt;br /&gt;Matthias Seeger [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;---------------------------------------------------------------------&lt;br /&gt;Interesting:&lt;br /&gt;Image Retrieval and Classification Using Local Distance Functions&lt;br /&gt;Andrea Frome, Yoram Singer, Jitendra Malik [ps.gz][pdf][bibtex][tgz]&lt;br /&gt;&lt;br /&gt;Branch and Bound for Semi-Supervised Support Vector Machines&lt;br /&gt;Olivier Chapelle, Vikas Sindhwani, Sathiya Keerthi [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Max-margin classification of incomplete data&lt;br /&gt;Gal Chechik, Geremy Heitz, Gal Elidan, Pieter Abbeel, Daphne Koller [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Bayesian Ensemble Learning&lt;br /&gt;Hugh Chipman, Edward George, Robert McCulloch [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Map-Reduce for Machine Learning on Multicore&lt;br /&gt;Cheng-Tao Chu, Sang Kyun Kim, Yi-An Lin, YuanYuan Yu, Gary Bradski, Andrew Ng, Kunle Olukotun [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Hierarchical Dirichlet Processes with Random Effects&lt;br /&gt;Seyoung Kim, Padhraic Smyth [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;Ordinal Regression by Extended Binary Classification&lt;br /&gt;Ling Li, Hsuan-Tien Lin [ps.gz][pdf][bibtex]&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-385484053794930263?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/385484053794930263/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=385484053794930263' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/385484053794930263'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/385484053794930263'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2006/12/some-interesting-papers-from-nips-2006.html' title='Some interesting papers from NIPS 2006'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8369377483851107448.post-1060992035579370175</id><published>2006-12-14T19:48:00.000-07:00</published><updated>2006-12-14T19:49:42.558-07:00</updated><title type='text'>Commercement!!</title><content type='html'>OK. This is the 3rd time I tried to create a Blog.&lt;br /&gt;&lt;br /&gt;I am wondering why I cannot last for blog posting. Who knows?&lt;br /&gt;&lt;br /&gt;Hope this time, I can make it work:)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8369377483851107448-1060992035579370175?l=machinelearner.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://machinelearner.blogspot.com/feeds/1060992035579370175/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8369377483851107448&amp;postID=1060992035579370175' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1060992035579370175'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8369377483851107448/posts/default/1060992035579370175'/><link rel='alternate' type='text/html' href='http://machinelearner.blogspot.com/2006/12/commercement.html' title='Commercement!!'/><author><name>Lei</name><uri>http://www.blogger.com/profile/15170397378072423783</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
