Error with Gravitational Wave S6 Directed Search (CasA)

robertmiles
robertmiles
Joined: 8 Oct 09
Posts: 127
Credit: 21748042
RAC: 37757

I found some documentation on

I found some documentation on vsnprintf:

http://www.cplusplus.com/reference/cstdio/vsnprintf/

http://linux.about.com/library/cmd/blcmdl3_vsnprintf.htm

http://www.tin.org/bin/man.cgi?section=3&topic=vsnprintf

http://www.delorie.com/djgpp/doc/libc/libc_855.html

http://en.cppreference.com/w/cpp/io/c/vfprintf

http://pubs.opengroup.org/onlinepubs/9699919799/functions/vfprintf.html

http://stackoverflow.com/questions/3362994/vsnprintf-and-gcc

This suggests that the call to vsnprintf is likely to be in your source code, rather than something automatically added by the compiler.

If your source code is under Linux, you should be able to grep for all references to vsnprintf within a whole directory of source code.

If the number of references is small but not zero, it shouldn't take long to add some debugging output to show what will be sent to each call to vsnprintf just before it is called.

One of these web pages says that vsnprintf is new for C++11; I've found nothing on whether the Windows Vista version of msvcrt.dll supports C++11.

I still have the gravity wave application enabled for Albert@Home on my Windows Vista computer, so it can test any new version of this application.

Bernd Machenschalk
Bernd Machenschalk
Moderator
Administrator
Joined: 15 Oct 04
Posts: 4274
Credit: 245382343
RAC: 11409

Currently 64 Win applications

Currently 64 Win applications don't give proper stackdumps, which AFAIK is a limitation of the compiler we use.

I'll try to manually resolve the addresses to something useful.

The error most likely happens in BOINC API code, one of the hundred or so s(n)printf being called with a NULL buffer, format string or string argument. BOINC doesn't do a lot of error-checking.

I don't know for sure yet, but some time later it might help if you could capture a init_data.xml file from the slot directory of such a task on your system and make it available to me.

BM

BM

Darrell
Darrell
Joined: 11 Nov 04
Posts: 32
Credit: 15397991
RAC: 0

Sorry, haven't been paying

Sorry, haven't been paying attention to my system, just noticed that some recent CasA tasks have thrown an exception, here is an example:

http://einsteinathome.org/task/451810894

Seems they don't like boinc version 7.4.12, will upgrade to 7.4.18 and see if they continue to fail, if I get any more tasks.

Valery Astafyev
Valery Astafyev
Joined: 27 Feb 14
Posts: 1
Credit: 1719
RAC: 0

Hello. There was a problem

Hello. There was a problem with task h1_0991.65_S6Directed__S6CasAf40a_992.8Hz_186_1

After 90, 909 % completing, it was going to close cycle (as I understand) because in few hours it was that 90, 909 % and no more. So I aborted it.

Claggy
Claggy
Joined: 29 Dec 06
Posts: 560
Credit: 2694028
RAC: 0

RE: Hello. There was a

Quote:

Hello. There was a problem with task h1_0991.65_S6Directed__S6CasAf40a_992.8Hz_186_1

After 90, 909 % completing, it was going to close cycle (as I understand) because in few hours it was that 90, 909 % and no more. So I aborted it.


A number of Einstein apps display infrequent progress, eventually it would have reached 100%, and may even have spent some time at 100%, if you'd had more patience.

Claggy

[AF>Le_Pommier] Jerome_C2005
[AF>Le_Pommier]...
Joined: 1 May 10
Posts: 37
Credit: 111531916
RAC: 3141

Hello I have one such WU

Hello

I have one such WU on my Mac (intel) : http://einsteinathome.org/task/452691711

It has not failed but has a very strange behavior, it has ran for more than 10mins and is using only 6% of one core (calculation time is 50 secs after almost 15mn of duration).

In the stderr.txt I see :

Quote:

2014-09-03 22:40:38.9239 (79813) [debug]: Flags: LAL_NDEBUG, OPTIMIZE, HS_OPTIMIZATION, GC_SSE2_OPT, X64, SSE, SSE2, GNUC X86 GNUX86
2014-09-03 22:40:38.9261 (79813) [debug]: Set up communication with graphics process.
Code-version: %% LAL: 6.10.0.1 (CLEAN a8f4bc45afff592cb0b364305d838a3e4674209f)
%% LALApps: 6.12.0.1 (CLEAN a8f4bc45afff592cb0b364305d838a3e4674209f)

2014-09-03 22:40:39.2324 (79813) [normal]: Reading input data ...

So it's been "reading input data" for more than 10mn ?

edit : it seems I was too eager to have action, now things are much better :

Quote:
2014-09-03 22:40:39.2324 (79813) [normal]: Reading input data ... done.
% --- GPS reference time = 960541454.5000 , GPS data mid time = 960541454.5000
2014-09-03 22:56:59.7140 (79813) [normal]: dFreqStack = 5.351900e-07, df1dot = 8.228100e-12, df2dot = 1.932800e-18
% --- Setup, N = 44, T = 503831 s, Tobs = 22160773 s, gammaRefine = 90, gamma2Refine = 60
2014-09-03 22:56:59.7148 (79813) [normal]: INFO: No checkpoint checkpoint.cpt found - starting from scratch
% --- Cpt:0, total:11, sky:1/1, f1dot:1/11
2014-09-03 22:56:59.7149 (79813) [normal]: Coarse grid sky:1/1 f1dot:1/11 f2dot:1/12
% --- CG:4157252 FG:93425 f1dotmin_fg:-6.589369911806e-08 df1dot_fg:9.041868131868e-14 f2dotmin_fg:1.584262295082e-20 df2dot_fg:3.168524590164e-20

And CPU is fully used, so there is "long reading input data" step in the app, ok :)

Sorry for useless post !

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.