Hi,<br><br>I am trying to run grep on newslab data on teraport. While everything works fine for a small number of patterns and files(e.g. ~20 patterns, ~1000 files), I get errors for larger numbers of files (~10k). I would be very grateful for any help.<br>
<br>The main files I'm using are (at CI) :<br>/home/gabri/swift-0.8/examples/swift/newslabex/count/tc.data<br>/home/gabri/swift-0.8/examples/swift/newslabex/count/sites.xml (-using the fast queue)<br>/home/gabri/swift-0.8/examples/swift/newslabex/count/count.swift<br>
/home/gabri/swift-0.8/examples/swift/newslabex/count/grp<br><br>For number of files=10k and number of patterns=2<br>- I'm getting an "java.lang.OutOfMemoryError". I have tried increasing the heap size by runnig Swift with (-Xms1536m -Xmx4096m) in the command line, but that seemed to just push the failure point a little further. Is this at all the way to go?<br>
- The corresponding logs are at: /home/gabri/swift-0.8/examples/swift/newslabex/count/errmanyfiles/<br><br>Thank you very much for any suggestions.<br>Best,<br>Gabri<br>