[opa-nightly-tests] jam OPA_Daily_Tests_1206Thu_FAILED!!!
Dave Goodell
goodell at mcs.anl.gov
Fri Dec 7 14:25:54 CST 2012
OK, thanks Neil. If you get stumped, we can see if Kaz has any ideas, since he has a good deal of PPC/POWER knowledge.
-Dave
On Dec 7, 2012, at 1:36 PM CST, Neil Fortner wrote:
> Dave,
>
> ostrich is a linux/POWER7 machine. It seems to occasionally fail when I run it by hand both from the svn and git repos. I don't know why it wasn't failing before the change to git. The failure is in the LL/SC ABA test (2 threads), and the message is "Unexpected success of OPA_SC_int" (often multiple times). I will look into this further.
>
> Thanks,
> -Neil
>
> On 12/07/2012 11:29 AM, Dave Goodell wrote:
>> What sort of host is ostrich? The source code should not have changed (besides adding a ".gitignore" file), and the tests are passing on my x86_64 laptop when run by hand.
>>
>> The log output is missing the actual failure message too, so it's hard to diagnose what's going on here.
>>
>> -Dave
>>
>> On Dec 6, 2012, at 7:01 PM PST, Larry Knox wrote:
>>
>>> This is the first run using the new git repository. 1 test failed on
>>> ostrich. This will give us a good test of the git update assuming a code
>>> change is necessary to fix the problem.
>>>
>>> There was another test message just before this one from the previous
>>> night's test. The machines freedom and loyalty are not responding to ssh
>>> due to hdfgroup changes, pending a reboot by Quincey when he returns. I
>>> have removed them from the test machine list until that happens.
>>>
>>> Larry
>>>
>>> -----Original Message-----
>>> From: opa-nightly-tests-bounces at lists.mcs.anl.gov
>>> [mailto:opa-nightly-tests-bounces at lists.mcs.anl.gov] On Behalf Of HDF Tester
>>> Sent: Thursday, December 06, 2012 8:43 PM
>>> To: opa-nightly-tests at lists.mcs.anl.gov
>>> Subject: [opa-nightly-tests] jam OPA_Daily_Tests_1206Thu_FAILED!!!
>>>
>>> *** OPA Tests on 1206Thu ***
>>> =============================
>>> Tests Summary
>>> =============================
>>> ****FAILED ostrich: standard****
>>>
>>> PASSED jam: standard
>>> PASSED koala: standard
>>> PASSED linew: standard
>>>
>>> =============================
>>> Tests Time Summary
>>> =============================
>>> koala: Ran 1(1/0/0) tests, Grand total test time = 0m 32s
>>> jam: Ran 1(1/0/0) tests, Grand total test time = 1m 1s
>>> ostrich: Ran 1(0/1/0) tests, Grand total test time = 1m 40s
>>> linew: Ran 1(1/0/0) tests, Grand total test time = 2m 47s
>>> jam: Ran 6(0/0/0) hosts, Grand total test time = 191m 58s
>>> jam: Ran 6(0/0/0) hosts, Grand total test time = 197m 10s
>>> jam: Ran 6(0/0/0) hosts, Grand total test time = 153m 39s
>>>
>>>
>>> =============================
>>> Timekeeper log
>>> =============================
>>> Timekeeper started at Thu Dec 6 18:09:52 CST 2012 Timekeeper sleeping for
>>> 720 seconds Timekeeper woke up at Thu Dec 6 18:21:52 CST 2012, looking for
>>> processes to terminate...
>>> Timekeeper ended at Thu Dec 6 18:21:52 CST 2012
>>>
>>>
>>> =============================
>>> Tests Failures
>>> =============================
>>> =========================
>>> Dumping logfile of ostrich: standard
>>> Last 50 lines of /mnt/scr1/SnapTest/snapshots-opa/log/ostrich_1206Thu_1810
>>> =========================
>>> Testing integer compare-and-swap fairness with 100 threads
>>> PASSED
>>> Testing pointer compare-and-swap fairness with 100 threads
>>> PASSED
>>> Testing integer swap with 100 threads
>>> PASSED
>>> Testing pointer swap with 100 threads
>>> PASSED
>>> Testing integer LL/SC stack with 100 threads
>>> PASSED
>>> Testing pointer LL/SC stack with 100 threads
>>> PASSED
>>> ***** 1 PRIMITIVES TEST FAILED! *****
>>> FAIL: test_primitives
>>> Testing memory barrier sanity
>>> PASSED
>>> Testing memory barriers with linear array with 2 threads
>>> PASSED
>>> Testing memory barriers with local variables with 2 threads
>>> PASSED
>>> Testing memory barriers with scattered array with 2 threads
>>> PASSED
>>> Testing memory barriers with linear array with 4 threads
>>> PASSED
>>> Testing memory barriers with local variables with 4 threads
>>> PASSED
>>> Testing memory barriers with scattered array with 4 threads
>>> PASSED
>>> Testing memory barriers with linear array with 10 threads
>>> PASSED
>>> Testing memory barriers with local variables with 10 threads
>>> PASSED
>>> Testing memory barriers with scattered array with 10 threads
>>> PASSED
>>> Testing memory barriers with linear array with 100 threads
>>> PASSED
>>> Testing memory barriers with local variables with 100 threads
>>> PASSED
>>> Testing memory barriers with scattered array with 100 threads
>>> PASSED
>>> All barriers tests passed.
>>> PASS: test_barriers
>>> Testing queue sanity
>>> PASSED
>>> Testing multithreaded queue with 2 threads
>>> PASSED
>>> Testing multithreaded queue (empty queue) with 2 threads
>>> PASSED
>>> Testing multithreaded queue (full queue) with 2 threads
>>> PASSED
>>> Testing multithreaded queue with 4 threads
>>> PASSED
>>> Testing multithreaded queue (empty queue) with 4 threads
>>> PASSED
>>> Testing multithreaded queue (full queue) with 4 threads
>>> PASSED
>>> Testing multithreaded queue with 10 threads
>>> PASSED
>>> Testing multithreaded queue (empty queue) with 10 threads
>>> PASSED
>>> Testing multithreaded queue (full queue) with 10 threads
>>> PASSED
>>> Testing multithreaded queue with 100 threads
>>> PASSED
>>> Testing multithreaded queue (empty queue) with 100 threads
>>> PASSED
>>> Testing multithreaded queue (full queue) with 100 threads
>>> PASSED
>>> All queue tests passed.
>>> PASS: test_queue
>>> ===================================================================
>>> 1 of 4 tests failed
>>> Please report to https://trac.mcs.anl.gov/projects/openpa/newticket
>>> ===================================================================
>>> gmake[2]: *** [check-TESTS] Error 1
>>> gmake[2]: Leaving directory
>>> `/scr/hdftest/snapshots-opa/TestDir/ostrich/test'
>>> gmake[1]: *** [check-am] Error 2
>>> gmake[1]: Leaving directory
>>> `/scr/hdftest/snapshots-opa/TestDir/ostrich/test'
>>> gmake: *** [check-recursive] Error 1
>>> Failed running make check
>>> ===== Exit bin/snapshot with status=2: Thu Dec 6 18:11:55 CST 2012 =====
>>> Thu Dec 6 18:11:55 CST 2012 ========================= Dumping done
>>> =========================
>>>
>>> Runtest did not exit normally.
>>> Runtest did not exit normally.
>>>
>>> =============================
>>> Watchers List
>>> =============================
>>> OPA Daily test features/platforms watchers and procedure
>>> ---------------------------------------------------------
>>>
>>> Procedure:
>>> The watcher will investigate and report the cause of failure by 11am.
>>> The developer who checked in the error code may report so by then too.
>>> The watcher or the developer should get the failure fixed and report it by
>>> 3pm.
>>>
>>>
>>> Watcher for OPA: Neil
>>>
>>>
>>> ---
>>> updated: 2009/05/05
>>>
>>> =============================
>>> Tests Details
>>> =============================
>>> 18:09:33 up 1 day, 9:03, 18 users, load average: 0.06, 0.26, 0.43
>>> Filesystem 1K-blocks Used Available Use% Mounted on
>>> /dev/sda3 31738420 3798708 26301484 13% /
>>> /dev/sda1 101086 23185 72682 25% /boot
>>> /dev/sda2 31738420 365712 29734480 2% /tmp
>>> /dev/sda6 31738392 9891148 20209020 33% /var
>>> /dev/sda7 31738392 14854308 15245860 50% /usr
>>> /dev/sda8 124991068 1133300 117406068 1% /var/tmp
>>> /dev/mapper/VolGroup00-home
>>> 198351840 17664644 170448836 10% /home
>>> /dev/sde1 565688764 407775828 128714000 77% /scr
>>> /dev/sdc1 961432072 816721756 95872316 90% /mnt/scr1
>>> /dev/sdd1 961432072 759854124 152739948 84% /mnt/hdf
>>> tmpfs 8313628 0 8313628 0% /dev/shm
>>> gumund:/data/ftp 480719104 361001344 95298560 80% /mnt/ftp
>>> gumund:/data/web 480719104 361001344 95298560 80% /mnt/web
>>> STANDARD_OPT=op-configure --prefix=${PWD}/opainstall TEST_TYPES=standard
>>>
>>> Running source repository checkout with output saved in
>>> /mnt/scr1/SnapTest/snapshots-opa/log/REPO_LOG_1206Thu
>>> Checking MANIFEST file ...
>>> cat: /mnt/scr1/SnapTest/snapshots-opa/log/#runtest.1206Thu.18700: No such
>>> file or directory
>>> rm: cannot remove
>>> `/mnt/scr1/SnapTest/snapshots-opa/log/#runtest.1206Thu.18700': No such file
>>> or directory
>>>
>>> Thu Dec 6 18:09:52 CST 2012
>>> *** launching tests from jam ***
>>>
>>> TESTHOST is linew
>>> jam
>>> koala
>>> ostrich
>>> loyalty
>>> freedom
>>> Fork off timekeeper 12
>>> Killed
>>> cannot remote command with loyalty
>>> Killed
>>> cannot remote command with freedom
>>> cat: #linew_linew.out: No such file or directory
>>> tail: cannot open `#linew_linew.out' for reading: No such file or directory
>>> tail: cannot open `#linew_linew.out' for reading: No such file or directory
>>> *************************************
>>> Thu Dec 6 20:43:22 CST 2012
>>> ****INCOMPLETE linew: snaptest did not complete****
>>> *************************************
>>>
>>> cat: #jam_jam.out: No such file or directory
>>> tail: cannot open `#jam_jam.out' for reading: No such file or directory
>>> tail: cannot open `#jam_jam.out' for reading: No such file or directory
>>> *************************************
>>> Thu Dec 6 20:43:22 CST 2012
>>> ****INCOMPLETE jam: snaptest did not complete****
>>> *************************************
>>>
>>> cat: #koala_koala.out: No such file or directory
>>> tail: cannot open `#koala_koala.out' for reading: No such file or directory
>>> tail: cannot open `#koala_koala.out' for reading: No such file or directory
>>> *************************************
>>> Thu Dec 6 20:43:22 CST 2012
>>> ****INCOMPLETE koala: snaptest did not complete****
>>> *************************************
>>>
>>> cat: #ostrich_ostrich.out: No such file or directory
>>> tail: cannot open `#ostrich_ostrich.out' for reading: No such file or
>>> directory
>>> tail: cannot open `#ostrich_ostrich.out' for reading: No such file or
>>> directory
>>> *************************************
>>> Thu Dec 6 20:43:22 CST 2012
>>> ****INCOMPLETE ostrich: snaptest did not complete****
>>> *************************************
>>>
>>> cat: #loyalty_loyalty.out: No such file or directory
>>> tail: cannot open `#loyalty_loyalty.out' for reading: No such file or
>>> directory
>>> tail: cannot open `#loyalty_loyalty.out' for reading: No such file or
>>> directory
>>> *************************************
>>> Thu Dec 6 20:43:22 CST 2012
>>> ****INCOMPLETE loyalty: snaptest did not complete****
>>> *************************************
>>>
>>> freedom does not accept Remote Command (Thu Dec 6 20:42:52 CST 2012)
>>> freedom does not accept Remote Command (Thu Dec 6 20:42:52 CST 2012)
>>> *************************************
>>> *************************************
>>> Thu Dec 6 20:42:52 CST 2012
>>> ****SYSTEM ERROR: freedom does not accept Remote Command (Thu Dec 6
>>> 20:42:52 CST 2012)
>>> *************************************
>>> Thu Dec 6 20:42:52 CST 2012
>>> ****SYSTEM ERROR: freedom does not accept Remote Command (Thu Dec 6
>>> 20:42:52 CST 2012)
>>> *************************************
>>> *************************************
>>> Thu Dec 6 20:43:22 CST 2012
>>> ****INCOMPLETE freedom: snaptest did not complete****
>>> *************************************
>>>
>>> *** finished tests in jam ***
>>> Thu Dec 6 20:43:22 CST 2012
>>> jam: Ran 6(0/0/0) hosts, Grand total test time = 153m 39s
>>>
>>> _______________________________________________
>>> opa-nightly-tests mailing list
>>> opa-nightly-tests at lists.mcs.anl.gov
>>> https://lists.mcs.anl.gov/mailman/listinfo/opa-nightly-tests
>>>
>>> _______________________________________________
>>> opa-nightly-tests mailing list
>>> opa-nightly-tests at lists.mcs.anl.gov
>>> https://lists.mcs.anl.gov/mailman/listinfo/opa-nightly-tests
>> _______________________________________________
>> opa-nightly-tests mailing list
>> opa-nightly-tests at lists.mcs.anl.gov
>> https://lists.mcs.anl.gov/mailman/listinfo/opa-nightly-tests
>>
More information about the opa-nightly-tests
mailing list