Computation error


Advanced search

Message boards : Windows : Computation error

Sort
Author Message
BLUWOLF

Joined: Oct 27 08
Posts: 1
ID: 3029
Credit: 5,986
RAC: 0
Message 4775 - Posted 22 Jan 2009 19:30:01 UTC

after dealing with download errors. the work units i got all of them have failed. just shortly after boinc started them

ChinookFoehn

Joined: Sep 3 08
Posts: 2
ID: 460
Credit: 2,162,688
RAC: 0
Message 4778 - Posted 22 Jan 2009 21:57:21 UTC - in response to Message ID 4775 .

Consider yourself lucky. Many of mine have either been failing overnight or by me aborting them, after I noticed them, after 8+ hours of calculations showing 0 seconds calculated in the results with a "too many errors" error. Rather annoying at all the wasted effort.

-ChinookFöhn

ChinookFoehn

Joined: Sep 3 08
Posts: 2
ID: 460
Credit: 2,162,688
RAC: 0
Message 4779 - Posted 23 Jan 2009 8:25:55 UTC

There seems to be something very wrong. On all my computers, Windows [2000, XP, Vista] & Linux [Ubuntu], the tasks are failing. Sometimes they run for a bit before they stop showing any time spent yet sit there for hours and hours while others stay at 0 and but calculate forever. I've set the project for No New Work and aborted all remaining tasks. I do not have the luxury of time to sit and monitor all the computers to observe if a task is failing, which it seems every single one now does.

I'll try re-attaching a single computer in a month or two to see if, then, the tasks calculate properly or still fail without cancelling in a quick manner.

At least individual tasks haven't been as bad as WCG - Clean Energy - where one of my laptops was on the road and calculated for over 60 hours before it returned and I could abort it. Why other projects do not follow the pattern of LHC which sends aborts out so as not to waste time and energy on useless tasks as Clean Energy knew of this error days before the laptop was returned.... Cosmology also could do this which let tasks waste hundreds of hours knowing full well every task returned was wasted and would not be accepted as valid. (sigh)

I wish you the best of luck but there have been too many tasks that have done over 8 hours of calculation before I caught them all failing - wasting several times the time wasted by the WCG - Clean Energy task.

-ChinookFoehn

Profile Sumeet S

Joined: Jun 18 09
Posts: 2
ID: 13576
Credit: 37,445
RAC: 0
Message 5377 - Posted 7 Sep 2009 17:01:46 UTC - in response to Message ID 4775 .

after dealing with download errors. the work units i got all of them have failed. just shortly after boinc started them


have you overclocked your CPU
Profile robertmiles

Joined: Apr 16 09
Posts: 96
ID: 9967
Credit: 1,290,747
RAC: 0
Message 5546 - Posted 23 Nov 2009 15:50:35 UTC

If this problem happens again, you may want to check the outputs returned for any sign of a lockfile error - something that starts when a workunit ends without doing anything to delete its lockfile. It then causes all other workunits that try to run in the same slot to crunch for a while, then find that the leftover lockfile makes them unable to write a checkpoint file or possibly an output file, even if those workunits are from some other BOINC project. BOINC then restarts those workunits over and over, using CPU time but not accomplishing much of anything useful. Restarting BOINC (either by a reboot or by more gentle means) makes it automatically delete the lockfiles for all workunits that are already finished, but it will seldom let you delete the lockfiles yourself by any other means.

If this is a common problem under Docking@home, it may be wise to make the next version of the workunit program test its slot for a leftover lockfile from some other workunit soon after it starts, then fail with a numeric error code if it finds one, since it probably won't be able to write much in the way of output files that it can return instead.

Some of the very recent versions of BOINC appear to avoid this problem by simply not reusing slots with a leftover lockfile.

kd55

Joined: Sep 21 08
Posts: 3
ID: 1086
Credit: 40,624
RAC: 0
Message 5572 - Posted 9 Dec 2009 6:24:55 UTC

I have been running BOINC 6.10.18 on Win7 64x. Projects include Climate Prediction, Rosetta, WCG and Docking. All are running well except Docking. For the past 5-6 weeks every Docking file has run for at least 8 hours with 0% progress. I have had to manually abort every one of them. For now I am detaching from the project. Hopefully the bug will be resolved in the future.

Eagle

Joined: May 27 11
Posts: 1
ID: 40935
Credit: 352,397
RAC: 0
Message 6807 - Posted 16 Aug 2012 18:21:36 UTC

16.08.2012 19:18:06 | Docking | Starting task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 using charmm34 version 623 in slot 3
16.08.2012 19:21:18 | Docking | Task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 exited with zero status but no 'finished' file
16.08.2012 19:21:18 | Docking | If this happens repeatedly you may need to reset the project.
16.08.2012 19:21:18 | Docking | Restarting task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 using charmm34 version 623 in slot 3

16.08.2012 19:24:28 | Docking | Task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 exited with zero status but no 'finished' file
16.08.2012 19:24:28 | Docking | If this happens repeatedly you may need to reset the project.

16.08.2012 22:05:28 | Docking | Task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 exited with zero status but no 'finished' file
16.08.2012 22:05:28 | Docking | If this happens repeatedly you may need to reset the project.
16.08.2012 22:05:28 | Docking | Restarting task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 using charmm34 version 623 in slot 3
16.08.2012 22:08:31 | Docking | Task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 exited with zero status but no 'finished' file
16.08.2012 22:08:31 | Docking | If this happens repeatedly you may need to reset the project.
16.08.2012 22:08:31 | Docking | Restarting task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 using charmm34 version 623 in slot 3

16.08.2012 22:11:36 | Docking | Task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 exited with zero status but no 'finished' file
16.08.2012 22:11:36 | Docking | If this happens repeatedly you may need to reset the project.
16.08.2012 22:11:36 | Docking | Restarting task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 using charmm34 version 623 in slot 3
16.08.2012 22:14:42 | Docking | Task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 exited with zero status but no 'finished' file
16.08.2012 22:14:42 | Docking | If this happens repeatedly you may need to reset the project.
16.08.2012 22:14:42 | Docking | Restarting task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 using charmm34 version 623 in slot 3
16.08.2012 22:17:47 | Docking | Task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 exited with zero status but no 'finished' file
16.08.2012 22:17:47 | Docking | If this happens repeatedly you may need to reset the project.
16.08.2012 22:17:47 | Docking | Restarting task 1hih1g35_mod0014crossdockinghiv1_30138_461676_0 using charmm34 version 623 in slot 3

Profile TheFiend

Joined: Apr 7 09
Posts: 70
ID: 9482
Credit: 20,705,527
RAC: 0
Message 6809 - Posted 17 Aug 2012 23:23:40 UTC

Are you running multiple projects on the same PC?

I found running Docking and Malariacontrol together caused compute errors which stopped when I suspended Malariacontrol and just ran Docking.

Try suspending your other projects and see what happens.

Rana Ahmed

Joined: Nov 4 14
Posts: 1
ID: 112235
Credit: 0
RAC: 0
Message 7389 - Posted 4 Nov 2014 17:53:30 UTC

Computer is very essential pert for everyone.
____________
ground penetrating radar companies

Message boards : Windows : Computation error

Database Error
: The MySQL server is running with the --read-only option so it cannot execute this statement
array(3) {
  [0]=>
  array(7) {
    ["file"]=>
    string(47) "/boinc/projects/docking/html_v2/inc/db_conn.inc"
    ["line"]=>
    int(97)
    ["function"]=>
    string(8) "do_query"
    ["class"]=>
    string(6) "DbConn"
    ["object"]=>
    object(DbConn)#14 (2) {
      ["db_conn"]=>
      resource(96) of type (mysql link persistent)
      ["db_name"]=>
      string(7) "docking"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(1) {
      [0]=>
      &string(51) "update DBNAME.thread set views=views+1 where id=388"
    }
  }
  [1]=>
  array(7) {
    ["file"]=>
    string(48) "/boinc/projects/docking/html_v2/inc/forum_db.inc"
    ["line"]=>
    int(60)
    ["function"]=>
    string(6) "update"
    ["class"]=>
    string(6) "DbConn"
    ["object"]=>
    object(DbConn)#14 (2) {
      ["db_conn"]=>
      resource(96) of type (mysql link persistent)
      ["db_name"]=>
      string(7) "docking"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(3) {
      [0]=>
      object(BoincThread)#3 (16) {
        ["id"]=>
        string(3) "388"
        ["forum"]=>
        string(1) "5"
        ["owner"]=>
        string(4) "3029"
        ["status"]=>
        string(1) "0"
        ["title"]=>
        string(17) "Computation error"
        ["timestamp"]=>
        string(10) "1415123610"
        ["views"]=>
        string(3) "277"
        ["replies"]=>
        string(1) "8"
        ["activity"]=>
        string(19) "0.00097511866117458"
        ["sufferers"]=>
        string(1) "0"
        ["score"]=>
        string(1) "0"
        ["votes"]=>
        string(1) "0"
        ["create_time"]=>
        string(10) "1232652601"
        ["hidden"]=>
        string(1) "0"
        ["sticky"]=>
        string(1) "0"
        ["locked"]=>
        string(1) "0"
      }
      [1]=>
      &string(6) "thread"
      [2]=>
      &string(13) "views=views+1"
    }
  }
  [2]=>
  array(7) {
    ["file"]=>
    string(63) "/boinc/projects/docking/html_v2/user/community/forum/thread.php"
    ["line"]=>
    int(184)
    ["function"]=>
    string(6) "update"
    ["class"]=>
    string(11) "BoincThread"
    ["object"]=>
    object(BoincThread)#3 (16) {
      ["id"]=>
      string(3) "388"
      ["forum"]=>
      string(1) "5"
      ["owner"]=>
      string(4) "3029"
      ["status"]=>
      string(1) "0"
      ["title"]=>
      string(17) "Computation error"
      ["timestamp"]=>
      string(10) "1415123610"
      ["views"]=>
      string(3) "277"
      ["replies"]=>
      string(1) "8"
      ["activity"]=>
      string(19) "0.00097511866117458"
      ["sufferers"]=>
      string(1) "0"
      ["score"]=>
      string(1) "0"
      ["votes"]=>
      string(1) "0"
      ["create_time"]=>
      string(10) "1232652601"
      ["hidden"]=>
      string(1) "0"
      ["sticky"]=>
      string(1) "0"
      ["locked"]=>
      string(1) "0"
    }
    ["type"]=>
    string(2) "->"
    ["args"]=>
    array(1) {
      [0]=>
      &string(13) "views=views+1"
    }
  }
}
query: update docking.thread set views=views+1 where id=388