<?xml version="1.0" encoding="utf-8" ?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">

    <title type="text">Searchdaimon Forum</title>
    <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/" />
    <link rel="self" type="application/atom+xml" href="http://www.searchdaimon.com/forum/atom/" />
    <updated>0</updated>
    <rights>Copyright (c) 2013</rights>
    <generator uri="http://expressionengine.com/" version="2.4.0">ExpressionEngine</generator>
    <id>tag:searchdaimon.com,2013:05:10</id>


    <entry>
      <title>Excellent Product!</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/705/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.705</id>
      <published>2013-05-09T21:47:06Z</published>
      <updated>0</updated>
      <author><name>jkwcape</name></author>
      <content type="html">
      <![CDATA[
        <p>I have been searching for MONTHS for a good intranet file search engine, and I am happy to report that yours is the best I have found after testing out many similar products and online services.</p>

<p>I have deployed your cloud-based solution on Amazon Web Services and thanks to you I am offering my client a multi-server full text index/file search service across multiple websites from one simple search form on their intranet. We are protecting some files using simple htaccess, and Searchdaimon crawled those protected files just fine. I also got it to log in and crawl our WordPress intranet site just fine as well but I am mostly using it to index protected and unprotected file directories in several locations. Nothing else I have found does the job we needed of indexing all the text inside protected documents.&nbsp; Your solution works for us, and I think it is simply the best one out there right now.</p>

<p>Nice job, Searchdaimon!</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Authentication problems when crawling</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/687/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.687</id>
      <published>2013-04-05T11:08:06Z</published>
      <updated>2013-04-05T11:08:37Z</updated>
      <author><name>radman</name></author>
      <content type="html">
      <![CDATA[
        <p>Hi,</p>

<p>I&#8217;m trying out SearchDaimon, with a view to using it for our intranet. I&#8217;ve got a VirtualBox installation working and can successfully crawl sites that don&#8217;t require authentication.</p>

<p>However, when I try to crawl our intranet - Which uses https:// and authentication, I just keep getting the error:</p>

<p>Can&#8217;t get url https://.../: 401 Unauthorized </p>

<p>(Note: ... contains the URL of our intranet, which I have removed before posting here)</p>

<p>I did see the following error in the crawling management log yesterday, but have not seen it since:</p>

<p>Noting &#123;500 Can&#8217;t locate object method &#8220;new&#8221; via package &#8220;LWP::Protocol::https::Socket&#8221;&#125; error at https://.../ </p>

<p>I wondered if the problem was related to a missing Perl-Crypt-SSLeay (as mentioned in <a href="http://www.searchdaimon.com/forum/viewthread/46/">http://www.searchdaimon.com/forum/viewthread/46/</a> ), but if I try to install the RPM I just see the message that it is already installed.</p>

<p>Here&#8217;s what I&#8217;ve done so far then - Within the collection settings page:</p>

<p>I&#8217;m using Fake ad as we only want to create a public search<br />
I&#8217;ve put the URL including https:// within the url text field</p>

<p>Finally, for authentication, I&#8217;ve tried username and password without a user prefix (our Active Directory name) and with a user prefix. Whichever way I do it, it still doesn&#8217;t work.</p>

<p>Our intranet is accessible via the internet, so isn&#8217;t configured to only use Integrated Windows Authentication - I can connect to it using any browser without any problems by just entering my username into the popup login box.</p>

<p>I don&#8217;t really know what else to try!</p>

<p>Any help would be much appreciated.</p>

<p>Thanks,</p>

<p>Adam.</p>


      ]]>
      </content>
    </entry>

    <entry>
      <title>Search result links are sdsmb:// and do not open</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/651/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.651</id>
      <published>2013-03-06T02:05:48Z</published>
      <updated>0</updated>
      <author><name>bechemeko</name></author>
      <content type="html">
      <![CDATA[
        <p>When I perform a search against my SMB collection, the results returned have links that start with sdsmb:// and when I click on any of the result links nothing happens. I read through the document posted on how to  add the chrome extension that allows file:// links to be open but all the results seem to be sdsmb:// links. Please advise.</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Control of the Crawl Process.</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/616/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.616</id>
      <published>2013-02-13T08:12:35Z</published>
      <updated>0</updated>
      <author><name>maxit</name></author>
      <content type="html">
      <![CDATA[
        <p>
1. When crawling a SharePoint site can certain pages or lists be restricted from the crawl?</p>

<p> 2.&nbsp; And at the opposite end can SD follow links on a web page or document and crawl that source too. </p>

<p>&nbsp;  &nbsp;  I realize that one could end up crawling the whole Internet that way . I have heard of people doing that with MS SharePoint Search only discover their SQL DB in the with massive Gigs of data.</p>


      ]]>
      </content>
    </entry>

    <entry>
      <title>The Crawl Process</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/615/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.615</id>
      <published>2013-02-13T08:01:07Z</published>
      <updated>0</updated>
      <author><name>maxit</name></author>
      <content type="html">
      <![CDATA[
        <p>
Just how does SD crawl a site ? Does it internally open each record or page &#8220;read&#8221; through it and index it to its database?&nbsp; is there any possible destructive or invasive effect on any data during the  crawled process?</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>VERY Slow crawl on a SharePoint Page</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/614/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.614</id>
      <published>2013-02-13T07:56:10Z</published>
      <updated>0</updated>
      <author><name>maxit</name></author>
      <content type="html">
      <![CDATA[
        <p> I am crawling a SharePoint site that has a page with an Audit log that keeps track of the transactions on that sub-site and its taking hours to crawl each Audit entry . The Audit entries very simple forms as below . What could be taking so long? </p>

<p>Example of each Audit log item: </p>

<p><br />
ListName &nbsp;  &nbsp;  &nbsp;  &nbsp;  &nbsp; Accounts<br />
Action &nbsp;  &nbsp;  &nbsp;  &nbsp;  &nbsp;  &nbsp;   Edit<br />
Field &nbsp;  &nbsp;  &nbsp;  &nbsp;  &nbsp;  &nbsp;  Name Source<br />
PreviousValue &nbsp;  &nbsp;  &nbsp;   Referral<br />
NewValue &nbsp;  &nbsp;  &nbsp;  &nbsp;  Other<br />
ItemID &nbsp;  &nbsp;  &nbsp;  &nbsp;  &nbsp;  &nbsp; 10,314<br />
ModifiedBy &nbsp;  &nbsp;  &nbsp;  &nbsp;   H2201V01\administrator<br />
ModifiedDateTime &nbsp;  &nbsp;  &nbsp;  11/9/2012 6:09 PM<br />
Ad Agency<br />
 temTitle<br /></p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Tag mismatch:</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/613/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.613</id>
      <published>2013-02-13T07:46:46Z</published>
      <updated>0</updated>
      <author><name>maxit</name></author>
      <content type="html">
      <![CDATA[
        <p>
&nbsp; I entered the number 1966 in the serch window and I got this result: Any ideas?<br />
 </p>

<p>XML parse error: Opening and ending tag mismatch: td line 244 and title Opening and ending tag mismatch: tr line 244 and result Opening and ending tag mismatch: table line 240 and search Premature end of data in tag snippet line 239 Premature end of data in tag description line 239 Premature end of data in tag result line 217 Premature end of data in tag search line 2 at /usr/lib64/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/XML/LibXML/SAX.pm line 64 at /usr/lib/perl5/vendor_perl/5.8.8/XML/Simple.pm line 370</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Cannot  crawl a sharepoint site</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/609/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.609</id>
      <published>2013-02-07T08:12:00Z</published>
      <updated>0</updated>
      <author><name>maxit</name></author>
      <content type="html">
      <![CDATA[
        <p>
I just cannot  crawl a sharepoint no matter what I use&#8212;even if I go the Internat route. </p>

<p>Here is what I keep getting as a result: ( Changed the address here) </p>

<p>NEWMedia  Not fully crawled Service description &#8216;http://server\administrator:M@$4448B@88.88.88.86/_vti_bin/Webs.asmx?WSDL&#8217; can&#8217;t be loaded: 400 Bad Request</p>

<p>Why is there a &#8220;http&#8221; before the domain or server name and the login name i this case Administrator ? </p>

<p>I have tired this a million different ways with different accounts - no go! But, I can reach it as normal SP site in my browser. </p>

<p>I also tired if from 3 different installs of SD&#8212;same result. Crawling other Servers works ok . </p>



<p>&nbsp;</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>Change from port 80.</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/608/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.608</id>
      <published>2013-01-28T10:03:48Z</published>
      <updated>0</updated>
      <author><name>maxit</name></author>
      <content type="html">
      <![CDATA[
        <p>
I would like to change from port 80 to some other port. How is this done ?</p>
      ]]>
      </content>
    </entry>

    <entry>
      <title>New Install and User Access, Shares. Time.&amp;nbsp; too</title>
      <link rel="alternate" type="text/html" href="http://www.searchdaimon.com/forum/viewthread/607/" />      
      <id>tag:searchdaimon.com,2013:forum/viewthread/.607</id>
      <published>2013-01-28T06:03:15Z</published>
      <updated>0</updated>
      <author><name>maxit</name></author>
      <content type="html">
      <![CDATA[
        <p>First great software! But, there are few issues: </p>

<p>1. Trying to run user access without AD looks to be impossible. I tried you FAKE AD but no such User system gets created, therefore no users can be added. I have to use public access. which I do not want to. I have some situations where the Sharepoint server is standalone, not a member of AD so it does a good job of crawling it but no private user access to crawled data. </p>

<p>2. What is the PUSH USER SYSTEM option. Does not seem to do anything? </p>

<p>2. Seems it does not crawl file shares from Windows 7&#8212;does ok on Windows 2003 . Will try 2008 RC next. </p>

<p>3. Any plans for a more Advanced Search options? </p>

<p>4. Where do change the time and date ? I live in North America Pacific Time bit the time is ahead by a few hours.</p>
      ]]>
      </content>
    </entry>


</feed>