New Optimised Executables Links - a READ ONLY thread

log in

Advanced search

Message boards : Cruncher's Corner : New Optimised Executables Links - a READ ONLY thread

1 · 2 · Next
Author Message
Profile Mike Hewson
Volunteer moderator
Avatar
Send message
Joined: 1 Dec 05
Posts: 4612
Credit: 39,741,295
RAC: 15,560
Message 28095 - Posted: 14 Mar 2006, 21:20:04 UTC
Last modified: 14 Mar 2006, 22:09:50 UTC

Only akosf to post here, please, if he has any other links to download the new gear....

For any discussion, please start a new thread or add to an existing one. :-) :-)

The following was provided by akosf. I have tested by successfully downloading from the link given. It points to a zip file ( 1,044,480 bytes packed down to 412,787 bytes ) containing his optimised version of 'albert_4.37_windows_intel86.exe'.



C37 - optimised x86 compatible windows executable

Install:

1, download file
2, stop BOINC
3, unzip file to ../BOINC/projects/einstein.phys.uwm.edu/ directory
4, restart BOINC


____________
"I have made this letter longer than usual, because I lack the time to make it short." - Blaise Pascal
Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 28394 - Posted: 20 Mar 2006, 5:15:21 UTC

S38 - optimised x86-SSE compatible executable

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 28547 - Posted: 22 Mar 2006, 5:11:27 UTC

S39 - optimised x86-SSE compatible executable (for windows)

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 28668 - Posted: 23 Mar 2006, 3:52:04 UTC
Last modified: 23 Mar 2006, 4:16:55 UTC

S39L - optimised SSE compatible windows executable

Change: S39L needs less L1 cache (~11kB) than S39 (~33kB) for its important datas.

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 28909 - Posted: 26 Mar 2006, 6:49:27 UTC

C40 - optimised x86 compatible windows executable

Changes: sin/cos interpolator, AGI optimisations
Speedup: ~10% faster than C37

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 29080 - Posted: 28 Mar 2006, 19:34:45 UTC

D40 - 3DNow! optimised windows executable

Comment: 3DNow! supported on these cpus:
K6-2,K6-III,Athlon,Duron,AthlonXP,Sempron,Athlon64,C3,Samuel-II,Ezra,Nehemiah,Winchip2

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 29529 - Posted: 3 Apr 2006, 3:46:53 UTC
Last modified: 3 Apr 2006, 4:14:51 UTC

S40 - SSE optimised windows executable

Changes: better prefetching, AGI optimisations

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 29534 - Posted: 3 Apr 2006, 8:41:10 UTC

S40 is not perfect! Don't use it!
Sometimes gives back an "access violation" on one of my Durons.
(I will check the code and the PC.)

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 30006 - Posted: 8 Apr 2006, 19:49:03 UTC

S40.03 - SSE optimised windows executable

Comment: S40 was good, just one of my Durons is in poor health.

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 30445 - Posted: 16 Apr 2006, 10:30:07 UTC

S40.04 - SSE optimised windows executable

Comment: bugfixed version of S40.03 (zero credit problem)

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 30632 - Posted: 20 Apr 2006, 16:10:07 UTC

S40.12 - SSE optimised windows executable

- double size sin/cos look-up table ( 2 more valuable bits but worse speed )
- 4 cache-lines are freed up ;-)

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 30995 - Posted: 25 Apr 2006, 4:10:25 UTC

C41.00 - 386 compatible windows executable

- some tricks from S40.xx
- 2kB size sin/cos look-up table

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 31383 - Posted: 28 Apr 2006, 15:11:55 UTC

C41.01 - 386 compatible windows executable

- better look-up table handling

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 31532 - Posted: 30 Apr 2006, 8:25:38 UTC

D41.12 - 3DNow! compatible windows executable

- 3DNow! based trigonometry (2kB look-up table)
- reorganized address generation
- newton-raphson iteration
- mathematical reductions

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 31695 - Posted: 1 May 2006, 13:05:13 UTC

D41.13 - 3DNow! compatible windows executable

- increased accuracy
- FPU based trigonometry with 4kB data
- common denominator for 4 pairs

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 31731 - Posted: 1 May 2006, 16:27:02 UTC
Last modified: 1 May 2006, 16:39:34 UTC

D41.13 is refreshed (status bit bug is fixed).

Why did you give negative markings for these messages?

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 31732 - Posted: 1 May 2006, 16:30:22 UTC

S41.06 - SSE compatible windows executable

- FPU based trigonomety with 4kB data
- common denominator for 4 pairs
- better FPU/SSE overlapping

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 31767 - Posted: 1 May 2006, 19:42:06 UTC

U41.01 - SSE3 compatible windows executable

- SSE3 truncation

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 31915 - Posted: 2 May 2006, 9:52:29 UTC
Last modified: 2 May 2006, 9:57:49 UTC

Hello! Don't use U41.01 because it doesn't work well. Sorry...

edit: I will look after and correct it.

Akos Fekete
Volunteer developer
Avatar
Send message
Joined: 13 Nov 05
Posts: 562
Credit: 4,410,312
RAC: 3
Message 32152 - Posted: 3 May 2006, 18:30:16 UTC

C41.02 - 386 compatible windows executable (386,486,Pentium,Pentium2,K5,K6,...)
D41.14 - 3DNow! compatible windows executable (K6-2,K6-3,Spitfire,Thunderbird,Cyrix3,Samuel-II,Ezra,...)
S41.07 - SSE compatible windows executable (Pentium-III,Pentium4,AthlonXP,Duron(1GHz+),...)
U41.04 - SSE3 compatible windows executable (AMD: from E3 stepping, Intel: Prescott? and newer)

- special rounding method (~2% speedup)

comments:
1,use at your own risk
2,use the observation threads for bug report and for your questions

1 · 2 · Next

Message boards : Cruncher's Corner : New Optimised Executables Links - a READ ONLY thread


Home · Your account · Message boards

This material is based upon work supported by the National Science Foundation (NSF) under Grants PHY-1104902, PHY-1104617 and PHY-1105572 and by the Max Planck Gesellschaft (MPG). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the investigators and do not necessarily reflect the views of the NSF or the MPG.

Copyright © 2016 Bruce Allen