Skip to main content

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]
Re: [smila-user] mysql crawler

Hello,

 

I followed your instructions (although I used a slightly modified schema) and indexed the contents of my database. It worked.

 

Have you had a look into the job run data for the job you’re indexing into?

 

You can find this information when you follow the “url” link in the current run section of the result of http://localhost:8080/smila/jobmanager/jobs/indexUpate.

 

There you can see the number of records processed in each worker. It should look something like the following:

{

  "jobId" : "20111019-121322624389",

  "runMode" : "STANDARD",

  "startTime" : "2011-10-19T12:13:22.647+0200",

  "state" : "RUNNING",

  "workflowRuns" : {

    "activeWorkflowRunCount" : 0,

    "canceledWorkflowRunCount" : 0,

    "failedWorkflowRunCount" : 0,

    "startedWorkflowRunCount" : 1,

    "successfulWorkflowRunCount" : 1

  },

  "tasks" : {

    "canceledTaskCount" : 0,

    "createdTaskCount" : 2,

    "failedAfterRetryTaskCount" : 0,

    "failedWithoutRetryTaskCount" : 0,

    "obsoleteTaskCount" : 0,

    "retriedAfterErrorTaskCount" : 0,

    "retriedAfterTimeoutTaskCount" : 0,

    "successfulTaskCount" : 2

  },

  "worker" : {

    "bulkbuilder" : {

      "duration.iodata" : 9.239E-5,

      "duration.iodata.close" : 9.239E-5,

      "duration.perform.output" : 0.146584566,

      "duration.perform.output.insertedRecords" : 0.146584566,

      "endTime" : "2011-10-19T12:13:56.000+0200",

      "output.insertedRecords.dataObjectCount" : 1,

      "output.insertedRecords.duration" : 13.614,

      "output.insertedRecords.recordCount" : 5,

      "output.insertedRecords.size" : 657,

      "startTime" : "2011-10-19T12:13:42.375+0200",

      "successfulTaskCount" : 1

    },

    "pipelineProcessor" : {

      "warnCount" : 0,

      "duration" : 0.308177163,

      "duration.iodata" : 0.001329847,

      "duration.iodata.open" : 0.001329847,

      "duration.perform" : 0.306844517,

      "duration.perform.input" : 0.002095212,

      "duration.perform.input.input" : 0.002095212,

      "endTime" : "2011-10-19T12:13:56.517+0200",

      "input.input.dataObjectCount" : 1,

      "input.input.recordCount" : 5,

      "input.input.size" : 657,

      "startTime" : "2011-10-19T12:13:56.147+0200",

      "successfulTaskCount" : 1

    }

  },

  "jobDefinition" : {

    "name" : "indexUpate",

    "timestamp" : "2011-10-19T10:23:29.588+0200",

    "parameters" : {

      "store" : "temp",

      "tempStore" : "temp"

    },

    "workflow" : "indexUpdate"

  }

}

 

I marked the number of processed records (that should equal the number of rows in your DB) with red.

 

If the result is different from yours, please have a look at the jdbc crawler with the jconsole, if there are any errors.

 

Please be sure to clear deltaindexing with your jconsole before you test again.

 

Best regards,

Andreas

 

Von: smila-user-bounces@xxxxxxxxxxx [mailto:smila-user-bounces@xxxxxxxxxxx] Im Auftrag von Srividya Jagannathan
Gesendet: Dienstag, 18. Oktober 2011 09:07
An: smila-user@xxxxxxxxxxx
Betreff: [smila-user] mysql crawler

 

I configured mysql crawler with jdbc.xml and correct xsd file. However when i do this from REST client my data does not seem to index

 

 

{

    "jobName":"indexUpdateJob"

  }

 

The following error happens and does not seem to index. Can you tell me what could be the issue?

 

KeeperException when processing sessionid:0x13315c329620000 type:create cxid:0x923 zxid:0xfffffffffffffffe txntype:unknown reqpath:n/a Error Path:/smila/tasks Error:KeeperErrorCode = NoNode for /smila/tasks

 2011-10-18 12:12:08,116 INFO  [Thread-19                                    ]  impl.DeltaIndexingManagerImpl                 - obsoleteIdIterator could not find any obsolete ids for source: jdbc

 2011-10-18 12:12:08,225 INFO  [Thread-19                                    ]  jdbc.JdbcCrawler                              - Closing JdbcCrawler...

 2011-10-18 12:12:08,225 INFO  [Thread-19                                    ]  impl.CrawlThread                              - Removing deltaindexing lock on datasource jdbc

 2011-10-18 12:12:08,257 INFO  [Thread-19                                    ]  impl.CrawlThread                              - Finished session edac8322-3bf2-4608-99fc-d47df79314b7 and removed Deltaindexing lock on datasource jdbc

 2011-10-18 12:12:08,257 INFO  [Thread-19                                    ]  impl.CrawlThread                              - Unregistering crawling thread jdbc

 2011-10-18 12:12:08,257 INFO  [Thread-19                                    ]  impl.CrawlThread                              - Crawling thread jdbc unregistered

 2011-10-18 12:12:08,257 INFO  [Thread-19  


Back to the top