<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
    xmlns:admin="http://webns.net/mvcb/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:content="http://purl.org/rss/1.0/modules/content/">

    <channel>
    
    <title>Stuff we can&apos;t be without &#45; Searchdaimon AS</title>
    <link>http://www.searchdaimon.com/</link>
    <description>From time to time we come across companies and websites that make our life easier, or are just freaking awesome.</description>
    <dc:language>en</dc:language>
    <dc:rights>Copyright 2011</dc:rights>
    <dc:date>2011-10-21T12:48:39+00:00</dc:date>
    <admin:generatorAgent rdf:resource="http://expressionengine.com/" />
    

    <item>
      <title>A beautiful filename, 251 char long</title>
      <link>http://www.searchdaimon.com/a_beautiful_filename_251_char_long/</link>
      <guid>http://www.searchdaimon.com/a_beautiful_filename_251_char_long/#When:12:48:39Z</guid>
      <description><![CDATA[<p>
A customer of us recently complained that he couldn't find a specific file, even when searching for word that he knows was in it. This is't a total uncommon question. Sometime the user don't have permission to the file, or the location is't indexed yet or some other problem. So our cto Runar Buvik asked what the name of the file was, so he could take a look.
</p>

<p>
- It's <b>Protocol_Amending_the_Agreements_Conventions_and_Protocols_on_Narcotic_Drugs _concluded_at_The_Hague_on_23_January_1912_at_Geneva_on_11_February_1925_and_19  _February_1925_and_13_July_1931_at_Bangkok_on_27_November_1931_and_at_Geneva_ on_26_June_1936.doc</b>.<br>
- Protocol_Amending_the_Agree...
</p>

<p>
Yes, the name was in fact <b>"Protocol_Amending_the_Agreements_Conventions_and_Protocols _on_Narcotic_Drugs_concluded_at_The_Hague_on_23_January_1912_at_Geneva_on_11_ February_1925_and_19 _February_1925_and_13_July_1931_at_Bangkok_on_27_November_ 1931_and_at_Geneva_on_26_June_1936.doc
"</b>. That 251 characters long! After some investigation it turned out that the underlying filesystem, ntfs, allow filename as long as 255 characters, but Windows refused to serve this file by SMB. Instead we got a "No such file or directory" error, even if opening the folder as a network share in Windows Explorer and clicking on the file.
</p>

<p>
There actual is such a treaty name according to <a href="http://en.wikipedia.org/wiki/Protocol_Amending_the_Agreements,_Conventions_and_Protocols_on_Narcotic_Drugs_concluded_at_The_Hague_on_23_January_1912,_at_Geneva_on_11_February_1925_and_19_February_1925,_and_13_July_1931,_at_Bangkok_on_27_November_1931_and_at_Geneva_on_26_June_1936">Wikipedia</a>, but that dos't mean that the file need to be named the same. Please keep you filenames below 128 characters people, or you will be in trouble sooner or later! 
<p/>

<p>Bdw, The ES supports filenames up to1024 characters. Longer then that, and is't probably just noise anyway.</p>]]></description> 
      <dc:subject></dc:subject>
      <dc:date>2011-10-21T12:48:39+00:00</dc:date>
      <dc:creator>Runar Buvik</dc:creator>
    </item>

    <item>
      <title>OpenMP, automatic threading</title>
      <link>http://www.searchdaimon.com/openmp_automatic_threading/</link>
      <guid>http://www.searchdaimon.com/openmp_automatic_threading/#When:14:27:50Z</guid>
      <description><![CDATA[<p>Tired of creating threes and writing code to manage deadlocks and work queues? Search is cpu intensive, and we uses a lot of threads. For example indexes are sorted in parallel, and the pages that go on the result page is fetched from the disk and processed in parallel.
We started out creating threads manually, but that i slow going in C.  We have now almost entirely changed to OpenMP, and haven't looked back since.</p>

<p>Initializing a large array in parallel is as easy as this.</p>

<p><span class="sourceCode1">int</span> main(<span class="sourceCode1">int</span> argc, <span class="sourceCode1">char</span> *argv[]) { <br />
  <span class="sourceCode1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;const int</span> N <span class="sourceCode8">=</span> 100000<span class="sourceCode8">;</span> <br />
  <span class="sourceCode1">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;int</span> i, a<span class="sourceCode8">[</span>N<span class="sourceCode8">];</span> <br />
  <br />
  <span class="sourceCode7">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;#pragma omp  parallel for </span><br />
  <span class="sourceCode8">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;for</span> (i <span class="sourceCode8">=</span> 0; i <span class="sourceCode8">&lt;</span> N; i<span class="sourceCode8">++</span>) <br />
  &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;a<span class="sourceCode8">[</span>i<span class="sourceCode8">]</span> <span class="sourceCode8">=</span> 2 <span class="sourceCode8">*</span> i<span class="sourceCode8">; </span><br />
  <br />
  <span class="sourceCode8">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;return</span> 0<span class="sourceCode8">;</span> <br />
} <br />
<span style="font-size: 10px; ">Example from <a href="http://en.wikipedia.org/wiki/OpenMP">Wikipedia</a>.</span></p>


<p>OpenMP will decide how many threads to use.</p>
]]></description> 
      <dc:subject></dc:subject>
      <dc:date>2010-10-28T14:27:50+00:00</dc:date>
      <dc:creator>Runar Buvik</dc:creator>
    </item>

    <item>
      <title>The Regex Coach, interactive regex testing</title>
      <link>http://www.searchdaimon.com/the_regex_coach_interactive_regex_testing/</link>
      <guid>http://www.searchdaimon.com/the_regex_coach_interactive_regex_testing/#When:11:47:29Z</guid>
      <description><![CDATA[<p>We are using a lot of regular expressions her at Searchdaimon. Regex are used through Lex and Yacc to pars queries, pars html and to make the snippets on the result page. It is also heavily used to extract and validate data, tags and entropies in the crawlers.</p>

<img src="/images/sd_images/RegexCoach,.png" alt="" width="516" height="358">

<p>Her I am testing out a regex to extract email addresses and names from documents. The names and email addresses could then be added as attributes to the document, to enable filtering in the search results. Constructing regexs like this using only a text editor and relaying on try and fail won't be easy.</p>

<p>Link: <a href="http://www.weitz.de/regex-coach/">http://www.weitz.de/regex-coach/</a></p>
]]></description> 
      <dc:subject></dc:subject>
      <dc:date>2010-10-28T11:47:29+00:00</dc:date>
      <dc:creator>Runar Buvik</dc:creator>
    </item>

    <item>
      <title>Blekko, internet search engine</title>
      <link>http://www.searchdaimon.com/blekko_internet_search_engine/</link>
      <guid>http://www.searchdaimon.com/blekko_internet_search_engine/#When:10:52:14Z</guid>
      <description><![CDATA[<p>Start-up company Blekko have made some revolutionary innovation in the field of internet search.  Using their invention “slashtags” you can easy filter and sort your results. For example a search for “Apple Computers” gives you the results you would get in Google. But you can also add slastags to filter the results:</p>

<ul style="line-height: 80%">
<li><b>Apple Computers /shop</b> – Gives prizes and shopping opportunities</li>
<li><b>Apple Computers /history</b> – Gives pages about the history of apple</li>
<li><b>Apple Computers /finance</b> – Hits from forbes.com, businessweek.com</li>
<li><b>Apple Computers /date</b> – Newest pages first</li>
<li><b>http://www.apple.com/ /seo</b> – Graphs showing seo and link info</li>
</ul>

<p>We think slashtags is a great idea, and may change the way we use the web. Read more at <a href="http://searchengineland.com/blekko-a-new-search-engine-that-lets-you-spin-the-web-47215">http://searchengineland.com/blekko-a-new-search-engine-that-lets-you-spin-the-web-47215</a> or get a beta invite at <a href="http://blekko.com/">http://blekko.com/</a>.</p>

<p><a href="http://www.skrenta.com/2009/07/theres_no_such_thing_as_a_goog.html">There’s No Such Thing As A Google Killer</a>, but both Blekko and <a href="http://www.wolframalpha.com/">Wolfram|Alpha</a> have made great leap forward in the field of search technology, and may help to thin out Google’s dominance.</p>
]]></description> 
      <dc:subject></dc:subject>
      <dc:date>2010-10-28T10:52:14+00:00</dc:date>
      <dc:creator>Runar Buvik</dc:creator>
    </item>

    
    </channel>
</rss>
