[opa-nightly-tests] jam OPA_Daily_Tests_1206Thu_FAILED!!!

Neil Fortner nfortne2 at hdfgroup.org
Fri Dec 7 13:36:49 CST 2012


Dave,

ostrich is a linux/POWER7 machine.  It seems to occasionally fail when I 
run it by hand both from the svn and git repos.  I don't know why it 
wasn't failing before the change to git.  The failure is in the LL/SC 
ABA test (2 threads), and the message is "Unexpected success of 
OPA_SC_int" (often multiple times).  I will look into this further.

Thanks,
-Neil

On 12/07/2012 11:29 AM, Dave Goodell wrote:
> What sort of host is ostrich?  The source code should not have changed (besides adding a ".gitignore" file), and the tests are passing on my x86_64 laptop when run by hand.
>
> The log output is missing the actual failure message too, so it's hard to diagnose what's going on here.
>
> -Dave
>
> On Dec 6, 2012, at 7:01 PM PST, Larry Knox wrote:
>
>> This is the first run using the new git repository.  1 test failed on
>> ostrich.  This will give us a good test of the git update assuming a code
>> change is necessary to fix the problem.
>>
>>   There was another test message just before this one from the previous
>> night's test.  The machines freedom and loyalty are not responding to ssh
>> due to hdfgroup changes, pending a reboot by Quincey when he returns.  I
>> have removed them from the test machine list until that happens.
>>
>> Larry
>>
>> -----Original Message-----
>> From: opa-nightly-tests-bounces at lists.mcs.anl.gov
>> [mailto:opa-nightly-tests-bounces at lists.mcs.anl.gov] On Behalf Of HDF Tester
>> Sent: Thursday, December 06, 2012 8:43 PM
>> To: opa-nightly-tests at lists.mcs.anl.gov
>> Subject: [opa-nightly-tests] jam OPA_Daily_Tests_1206Thu_FAILED!!!
>>
>> *** OPA Tests on 1206Thu ***
>> =============================
>>    Tests Summary
>> =============================
>> ****FAILED ostrich: standard****
>>
>> PASSED jam: standard
>> PASSED koala: standard
>> PASSED linew: standard
>>
>> =============================
>>    Tests Time Summary
>> =============================
>> koala: Ran 1(1/0/0) tests, Grand total test time =  0m 32s
>> jam: Ran 1(1/0/0) tests, Grand total test time =  1m 1s
>> ostrich: Ran 1(0/1/0) tests, Grand total test time =  1m 40s
>> linew: Ran 1(1/0/0) tests, Grand total test time =  2m 47s
>> jam: Ran 6(0/0/0) hosts, Grand total test time =  191m 58s
>> jam: Ran 6(0/0/0) hosts, Grand total test time =  197m 10s
>> jam: Ran 6(0/0/0) hosts, Grand total test time =  153m 39s
>>
>>
>> =============================
>>    Timekeeper log
>> =============================
>> Timekeeper started at Thu Dec  6 18:09:52 CST 2012 Timekeeper sleeping for
>> 720 seconds Timekeeper woke up at Thu Dec  6 18:21:52 CST 2012, looking for
>> processes to terminate...
>> Timekeeper ended at Thu Dec  6 18:21:52 CST 2012
>>
>>
>> =============================
>>    Tests Failures
>> =============================
>> =========================
>> Dumping logfile of ostrich: standard
>> Last 50 lines of /mnt/scr1/SnapTest/snapshots-opa/log/ostrich_1206Thu_1810
>> =========================
>> Testing integer compare-and-swap fairness with 100 threads
>> PASSED
>> Testing pointer compare-and-swap fairness with 100 threads
>> PASSED
>> Testing integer swap with 100 threads
>> PASSED
>> Testing pointer swap with 100 threads
>> PASSED
>> Testing integer LL/SC stack with 100 threads
>> PASSED
>> Testing pointer LL/SC stack with 100 threads
>> PASSED
>> ***** 1 PRIMITIVES TEST FAILED! *****
>> FAIL: test_primitives
>> Testing memory barrier sanity
>> PASSED
>> Testing memory barriers with linear array with 2 threads
>> PASSED
>> Testing memory barriers with local variables with 2 threads
>> PASSED
>> Testing memory barriers with scattered array with 2 threads
>> PASSED
>> Testing memory barriers with linear array with 4 threads
>> PASSED
>> Testing memory barriers with local variables with 4 threads
>> PASSED
>> Testing memory barriers with scattered array with 4 threads
>> PASSED
>> Testing memory barriers with linear array with 10 threads
>> PASSED
>> Testing memory barriers with local variables with 10 threads
>> PASSED
>> Testing memory barriers with scattered array with 10 threads
>> PASSED
>> Testing memory barriers with linear array with 100 threads
>> PASSED
>> Testing memory barriers with local variables with 100 threads
>> PASSED
>> Testing memory barriers with scattered array with 100 threads
>> PASSED
>> All barriers tests passed.
>> PASS: test_barriers
>> Testing queue sanity
>> PASSED
>> Testing multithreaded queue with 2 threads
>> PASSED
>> Testing multithreaded queue (empty queue) with 2 threads
>> PASSED
>> Testing multithreaded queue (full queue) with 2 threads
>> PASSED
>> Testing multithreaded queue with 4 threads
>> PASSED
>> Testing multithreaded queue (empty queue) with 4 threads
>> PASSED
>> Testing multithreaded queue (full queue) with 4 threads
>> PASSED
>> Testing multithreaded queue with 10 threads
>> PASSED
>> Testing multithreaded queue (empty queue) with 10 threads
>> PASSED
>> Testing multithreaded queue (full queue) with 10 threads
>> PASSED
>> Testing multithreaded queue with 100 threads
>> PASSED
>> Testing multithreaded queue (empty queue) with 100 threads
>> PASSED
>> Testing multithreaded queue (full queue) with 100 threads
>> PASSED
>> All queue tests passed.
>> PASS: test_queue
>> ===================================================================
>> 1 of 4 tests failed
>> Please report to https://trac.mcs.anl.gov/projects/openpa/newticket
>> ===================================================================
>> gmake[2]: *** [check-TESTS] Error 1
>> gmake[2]: Leaving directory
>> `/scr/hdftest/snapshots-opa/TestDir/ostrich/test'
>> gmake[1]: *** [check-am] Error 2
>> gmake[1]: Leaving directory
>> `/scr/hdftest/snapshots-opa/TestDir/ostrich/test'
>> gmake: *** [check-recursive] Error 1
>> Failed running make check
>> ===== Exit bin/snapshot with status=2: Thu Dec  6 18:11:55 CST 2012 =====
>> Thu Dec  6 18:11:55 CST 2012 ========================= Dumping done
>> =========================
>>
>> Runtest did not exit normally.
>> Runtest did not exit normally.
>>
>> =============================
>>    Watchers List
>> =============================
>> OPA Daily test features/platforms watchers and procedure
>> ---------------------------------------------------------
>>
>> Procedure:
>> The watcher will investigate and report the cause of failure by 11am.
>> The developer who checked in the error code may report so by then too.
>> The watcher or the developer should get the failure fixed and report it by
>> 3pm.
>>
>>
>> Watcher for OPA:	 	Neil
>>
>>
>> ---
>> updated: 2009/05/05
>>
>> =============================
>>    Tests Details
>> =============================
>> 18:09:33 up 1 day,  9:03, 18 users,  load average: 0.06, 0.26, 0.43
>> Filesystem           1K-blocks      Used Available Use% Mounted on
>> /dev/sda3             31738420   3798708  26301484  13% /
>> /dev/sda1               101086     23185     72682  25% /boot
>> /dev/sda2             31738420    365712  29734480   2% /tmp
>> /dev/sda6             31738392   9891148  20209020  33% /var
>> /dev/sda7             31738392  14854308  15245860  50% /usr
>> /dev/sda8            124991068   1133300 117406068   1% /var/tmp
>> /dev/mapper/VolGroup00-home
>>                      198351840  17664644 170448836  10% /home
>> /dev/sde1            565688764 407775828 128714000  77% /scr
>> /dev/sdc1            961432072 816721756  95872316  90% /mnt/scr1
>> /dev/sdd1            961432072 759854124 152739948  84% /mnt/hdf
>> tmpfs                  8313628         0   8313628   0% /dev/shm
>> gumund:/data/ftp     480719104 361001344  95298560  80% /mnt/ftp
>> gumund:/data/web     480719104 361001344  95298560  80% /mnt/web
>> STANDARD_OPT=op-configure --prefix=${PWD}/opainstall TEST_TYPES=standard
>>
>> Running source repository checkout with output saved in
>>    /mnt/scr1/SnapTest/snapshots-opa/log/REPO_LOG_1206Thu
>> Checking MANIFEST file ...
>> cat: /mnt/scr1/SnapTest/snapshots-opa/log/#runtest.1206Thu.18700: No such
>> file or directory
>> rm: cannot remove
>> `/mnt/scr1/SnapTest/snapshots-opa/log/#runtest.1206Thu.18700': No such file
>> or directory
>>
>> Thu Dec  6 18:09:52 CST 2012
>> *** launching tests from jam ***
>>
>> TESTHOST is linew
>> jam
>> koala
>> ostrich
>> loyalty
>> freedom
>>     Fork off timekeeper 12
>> Killed
>> cannot remote command with loyalty
>> Killed
>> cannot remote command with freedom
>> cat: #linew_linew.out: No such file or directory
>> tail: cannot open `#linew_linew.out' for reading: No such file or directory
>> tail: cannot open `#linew_linew.out' for reading: No such file or directory
>> 	*************************************
>> 	Thu Dec  6 20:43:22 CST 2012
>> 	****INCOMPLETE linew: snaptest did not complete****
>> 	*************************************
>>
>> cat: #jam_jam.out: No such file or directory
>> tail: cannot open `#jam_jam.out' for reading: No such file or directory
>> tail: cannot open `#jam_jam.out' for reading: No such file or directory
>> 	*************************************
>> 	Thu Dec  6 20:43:22 CST 2012
>> 	****INCOMPLETE jam: snaptest did not complete****
>> 	*************************************
>>
>> cat: #koala_koala.out: No such file or directory
>> tail: cannot open `#koala_koala.out' for reading: No such file or directory
>> tail: cannot open `#koala_koala.out' for reading: No such file or directory
>> 	*************************************
>> 	Thu Dec  6 20:43:22 CST 2012
>> 	****INCOMPLETE koala: snaptest did not complete****
>> 	*************************************
>>
>> cat: #ostrich_ostrich.out: No such file or directory
>> tail: cannot open `#ostrich_ostrich.out' for reading: No such file or
>> directory
>> tail: cannot open `#ostrich_ostrich.out' for reading: No such file or
>> directory
>> 	*************************************
>> 	Thu Dec  6 20:43:22 CST 2012
>> 	****INCOMPLETE ostrich: snaptest did not complete****
>> 	*************************************
>>
>> cat: #loyalty_loyalty.out: No such file or directory
>> tail: cannot open `#loyalty_loyalty.out' for reading: No such file or
>> directory
>> tail: cannot open `#loyalty_loyalty.out' for reading: No such file or
>> directory
>> 	*************************************
>> 	Thu Dec  6 20:43:22 CST 2012
>> 	****INCOMPLETE loyalty: snaptest did not complete****
>> 	*************************************
>>
>> freedom does not accept Remote Command (Thu Dec  6 20:42:52 CST 2012)
>> freedom does not accept Remote Command (Thu Dec  6 20:42:52 CST 2012)
>> 	*************************************
>> 	*************************************
>> 	Thu Dec  6 20:42:52 CST 2012
>> 	****SYSTEM ERROR: freedom does not accept Remote Command (Thu Dec  6
>> 20:42:52 CST 2012)
>> 	*************************************
>> 	Thu Dec  6 20:42:52 CST 2012
>> 	****SYSTEM ERROR: freedom does not accept Remote Command (Thu Dec  6
>> 20:42:52 CST 2012)
>> 	*************************************
>> 	*************************************
>> 	Thu Dec  6 20:43:22 CST 2012
>> 	****INCOMPLETE freedom: snaptest did not complete****
>> 	*************************************
>>
>> *** finished tests in jam ***
>> Thu Dec  6 20:43:22 CST 2012
>> jam: Ran 6(0/0/0) hosts, Grand total test time =  153m 39s
>>
>> _______________________________________________
>> opa-nightly-tests mailing list
>> opa-nightly-tests at lists.mcs.anl.gov
>> https://lists.mcs.anl.gov/mailman/listinfo/opa-nightly-tests
>>
>> _______________________________________________
>> opa-nightly-tests mailing list
>> opa-nightly-tests at lists.mcs.anl.gov
>> https://lists.mcs.anl.gov/mailman/listinfo/opa-nightly-tests
> _______________________________________________
> opa-nightly-tests mailing list
> opa-nightly-tests at lists.mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/opa-nightly-tests
>


More information about the opa-nightly-tests mailing list