VCH888 Posted August 26, 2009 Share Posted August 26, 2009 It didn't work on my ATI HD4870 after try to compute GPU. can move only mouse & got to press a reset button. Also, see http://netkas.org/?p=164 Link to comment Share on other sites More sharing options...
RaZZe Posted August 26, 2009 Share Posted August 26, 2009 Link to comment Share on other sites More sharing options...
mitch_de Posted August 26, 2009 Author Share Posted August 26, 2009 It didn't work on my ATI HD4870 after try to compute GPU. can move only mouse & got to press a reset button. Also, see http://netkas.org/?p=164 Have you used the lastest V020 - which added lost of error handling code ? Please look / write down reported errors / error messages. Link to comment Share on other sites More sharing options...
music-anderson Posted August 26, 2009 Share Posted August 26, 2009 My test ***** Last login: Wed Aug 26 16:57:16 on console /Users/peterdavidanderson/Desktop/OpenCLBench_as_terminal_tool/OpenCL2_Bench_V020 ; exit; noname:~ peterdavidanderson$ /Users/peterdavidanderson/Desktop/OpenCLBench_as_terminal_tool/OpenCL2_Bench_V020 ; exit; ........................................................... .................. OpenCL Bench V 0.15 by mitch ........... ...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ...... ....... ....... ........My test code (simple adds) is cpu friedly.......... .more gpu friedly+complexer code (raytracing/video encod.). ... may give much more speed advantage - at least on C2Ds . ........................................................... ....CL_DEVICE_NAME: Intel® Xeon® CPU 5150 @ 2.66GHz ..... CL_DEVICE_VENDOR: Intel CL_DEVICE_MAX_CLOCK_FREQUENCY: 2660 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 4 Now computing - please be patient.... time used: 16.817684 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 8800 GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1500 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 112 Now computing - please be patient.... time used: 2.608059 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... Validate results test passed - GPU=CPU logout [Prozess beendet] Link to comment Share on other sites More sharing options...
nofearl Posted August 26, 2009 Share Posted August 26, 2009 cpu + 2xgpu ........................................................... .................. OpenCL Bench V 0.15 by mitch ........... ...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ...... ....... ....... ........My test code (simple adds) is cpu friedly.......... .more gpu friedly+complexer code (raytracing/video encod.). ... may give much more speed advantage - at least on C2Ds . ........................................................... ....CL_DEVICE_NAME: Intel® Core2 Quad CPU @ 2.40GHz ..... CL_DEVICE_VENDOR: Intel CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 4 Now computing - please be patient.... time used: 28.956915 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 9600 GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1750 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 64 Now computing - please be patient.... time used: 2.694709 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 9600 GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1750 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 64 Now computing - please be patient.... time used: 2.797374 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... Validate results test passed - GPU=CPU logout Link to comment Share on other sites More sharing options...
mitch_de Posted August 26, 2009 Author Share Posted August 26, 2009 THANKS ! Question: Do you have 2 indentical GPUs´? Also (will not change the times) always use the lastest BUILD, ist V020. Has much error handling code for ATI users (NVIDIAs seems to run without errors so far ) Link to comment Share on other sites More sharing options...
cmf Posted August 26, 2009 Share Posted August 26, 2009 mbp/late 2008 result: ....CL_DEVICE_NAME: Intel(R) Core(TM)2 Duo CPU P8600 @ 2.40GHz ..... CL_DEVICE_VENDOR: Intel CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 2 Now computing - please be patient.... time used: 56.190952 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 9600M GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 32 Now computing - please be patient.... time used: 10.169043 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 9600M GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 32 Now computing - please be patient.... time used: 10.120525 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... :) Validate results test passed - GPU=CPU :) seems to be a bug, you are testing the same gpu twice (or just printing out the info of the first gpu device twice?). the second gpu should be a 9400M. This program seems to multi-thread very well according to SL's CPU Usage monitor. Thanks for that detail ! I think the 10.6 changes "in the deep" will exspecially use much Cores better than 10.5 - even without special source coding changes. But recompiling source with newest Xcode & using 10.6 dev framework needed , i think. thats an opencl feature, or the purpose of opencl scale a small program/kernel well to many cores - be it cpu or gpu. Link to comment Share on other sites More sharing options...
morfy Posted August 26, 2009 Share Posted August 26, 2009 Upgrade from Open CL Bench V 020. ........................................................... .................. OpenCL Bench V 0.15 by mitch ........... ...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ...... ....... ....... ........My test code (simple adds) is cpu friedly.......... .more gpu friedly+complexer code (raytracing/video encod.). ... may give much more speed advantage - at least on C2Ds . ........................................................... ....CL_DEVICE_NAME: Pentium(R) Dual-Core CPU E5200 @ 2.50GHz ..... CL_DEVICE_VENDOR: Intel CL_DEVICE_MAX_CLOCK_FREQUENCY: 3129 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 2 Now computing - please be patient.... time used: 28.777699 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 8800 GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1600 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 112 Now computing - please be patient.... time used: 2.618950 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... :) Validate results test passed - GPU=CPU :) Link to comment Share on other sites More sharing options...
netkas Posted August 26, 2009 Share Posted August 26, 2009 ./OpenCL2_Bench_V020 ........................................................... .................. OpenCL Bench V 0.15 by mitch ........... ...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ...... ....... ....... ........My test code (simple adds) is cpu friedly.......... .more gpu friedly+complexer code (raytracing/video encod.). ... may give much more speed advantage - at least on C2Ds . ........................................................... ....CL_DEVICE_NAME: Intel� Coreâ„¢2 Quad CPU Q9450 @ 2.66GHz ..... CL_DEVICE_VENDOR: Intel CL_DEVICE_MAX_CLOCK_FREQUENCY: 3072 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 4 Now computing - please be patient.... time used: 14.658403 Number of elements computed: 2097152 ....CL_DEVICE_NAME: Radeon HD 4870 ..... CL_DEVICE_VENDOR: AMD CL_DEVICE_MAX_CLOCK_FREQUENCY: 750 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 4 Now computing - please be patient.... ///here gui freezes immedeatly time used: 27.399342 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... Validate results test passed - GPU=CPU x2000.kexts dumps ** GPU Debug Info ** to dmesg maybe its too much loops and so too much memory used by arrays, decreased loops number to 1000(edited source inside bin), now no crash. Link to comment Share on other sites More sharing options...
mitch_de Posted August 26, 2009 Author Share Posted August 26, 2009 mbp/late 2008 result: ....CL_DEVICE_NAME: Intel(R) Core(TM)2 Duo CPU P8600 @ 2.40GHz ..... CL_DEVICE_VENDOR: Intel CL_DEVICE_MAX_CLOCK_FREQUENCY: 2400 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 2 Now computing - please be patient.... time used: 56.190952 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 9600M GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 32 Now computing - please be patient.... time used: 10.169043 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 9600M GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1250 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 32 Now computing - please be patient.... time used: 10.120525 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... :) Validate results test passed - GPU=CPU :) seems to be a bug, you are testing the same gpu twice (or just printing out the info of the first gpu device twice?). the second gpu should be a 9400M. Thanks. I will fix that bug soon. A workaround for that bug: Please post result again after you disabled the 9600M GT ( so 9400M is the alone GPU). The 10 sec results is for the 9600MGT - 9400M will run slower. Link to comment Share on other sites More sharing options...
tommix1968 Posted August 26, 2009 Share Posted August 26, 2009 This is my result: ........................................................... .................. OpenCL Bench V 0.15 by mitch ........... ...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ...... ....... ....... ........My test code (simple adds) is cpu friedly.......... .more gpu friedly+complexer code (raytracing/video encod.). ... may give much more speed advantage - at least on C2Ds . ........................................................... ....CL_DEVICE_NAME: Intel(R) Core(TM)2 Quad CPU Q9550 @ 2.83GHz ..... CL_DEVICE_VENDOR: Intel CL_DEVICE_MAX_CLOCK_FREQUENCY: 2836 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 4 Now computing - please be patient.... time used: 15.836717 Number of elements computed: 2097152 ....CL_DEVICE_NAME: GeForce 9600 GT ..... CL_DEVICE_VENDOR: NVIDIA CL_DEVICE_MAX_CLOCK_FREQUENCY: 1625 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 64 Now computing - please be patient.... time used: 2.700367 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... :) Validate results test passed - GPU=CPU :) Link to comment Share on other sites More sharing options...
mitch_de Posted August 26, 2009 Author Share Posted August 26, 2009 ./OpenCL2_Bench_V020........................................................... .................. OpenCL Bench V 0.15 by mitch ........... ...... C2D 3GHz = 30 sec vs Nvidia 9600GT = 3.10 sec ...... ....... ....... ........My test code (simple adds) is cpu friedly.......... .more gpu friedly+complexer code (raytracing/video encod.). ... may give much more speed advantage - at least on C2Ds . ........................................................... ....CL_DEVICE_NAME: Intel� Core�„�2 Quad CPU Q9450 @ 2.66GHz ..... CL_DEVICE_VENDOR: Intel CL_DEVICE_MAX_CLOCK_FREQUENCY: 3072 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 4 Now computing - please be patient.... time used: 14.658403 Number of elements computed: 2097152 ....CL_DEVICE_NAME: Radeon HD 4870 ..... CL_DEVICE_VENDOR: AMD CL_DEVICE_MAX_CLOCK_FREQUENCY: 750 MHz CL_DEVICE_MAX_COMPUTE_UNITS: 4 Now computing - please be patient.... ///here gui freezes immedeatly time used: 27.399342 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... Validate results test passed - GPU=CPU x2000.kexts dumps ** GPU Debug Info ** to dmesg maybe its too much loops and so too much memory used by arrays, decreased loops number to 1000(edited source inside bin), now no crash. Thanks, i also thougt about this memory problem. Will compile an ATI_debug version soon and post it below other DL Link. Sure, if i will decrease the loop down from 5000 to 1000, the time results of very fast GPUs like GTX 285 will also decrease from 0.8 down to 0,0xy I am working on an other solution, which does more complex work but not in such an huge loop. Link to comment Share on other sites More sharing options...
netkas Posted August 26, 2009 Share Posted August 26, 2009 smth like for(i=0;i<5;i++) for(loop.... should be enough to add just one line (and one for int i;) Link to comment Share on other sites More sharing options...
vidkidd Posted August 26, 2009 Share Posted August 26, 2009 With Version: OpenCL2_Bench_V020 Application still hangs on 4870, MacPro 1,1 3.0ghz CPU is calculated at 15 seconds. Application crashes at GPU Please Wait. Thx, Vidkidd Link to comment Share on other sites More sharing options...
nofearl Posted August 26, 2009 Share Posted August 26, 2009 THANKS !Question: Do you have 2 indentical GPUs´? Also (will not change the times) always use the lastest BUILD, ist V020. Has much error handling code for ATI users (NVIDIAs seems to run without errors so far ) yep 2 palit nvidia 9600 gt on 2 pciex x16 ports Link to comment Share on other sites More sharing options...
El.Pilote Posted August 27, 2009 Share Posted August 27, 2009 Hi all here is mine : Link to comment Share on other sites More sharing options...
grue Posted August 27, 2009 Share Posted August 27, 2009 Doesn't seem to test my setup correctly. CL_DEVICE_NAME: Intel® Xeon® CPU X5365 @ 3.00GHz CL_DEVICE_VENDOR: Intel Now computing - please be patient.... time used: 7.710562 Number of elements computed: 2097152 CL_DEVICE_NAME: GeForce 8800 GT CL_DEVICE_VENDOR: NVIDIA Now computing - please be patient.... time used: 2.492461 Number of elements computed: 2097152 CL_DEVICE_NAME: GeForce 8800 GT CL_DEVICE_VENDOR: NVIDIA Now computing - please be patient.... time used: 2.489143 Number of elements computed: 2097152 Now checking if results are valid - please be patient.... Validate results test passed - GPU=CPU I have an 8800GT in Slot 3 working as the helper card to a GTX260. Looks like it's testing the 8800GT twice. Link to comment Share on other sites More sharing options...
mitch_de Posted August 27, 2009 Author Share Posted August 27, 2009 Doesn't seem to test my setup correctly. I have an 8800GT in Slot 3 working as the helper card to a GTX260. Looks like it's testing the 8800GT twice. NEW VERSION on the road - DL V025, lots of changes (and hopefully fixed ATI + > 2 gpu tests) Link to comment Share on other sites More sharing options...
grue Posted August 27, 2009 Share Posted August 27, 2009 BINGO Number of OpenCL devices found: 3 OpenCL Device # 0 = GeForce 8800 GT Device 0 is an: GPU with max. 1500 MHz and 112 units/cores Now computing - please be patient.... time used: 0.683 seconds OpenCL Device # 1 = GeForce GTX 260 Device 1 is an: GPU with max. 1400 MHz and 216 units/cores Now computing - please be patient.... time used: 0.365 seconds OpenCL Device # 2 = Intel® Xeon® CPU X5365 @ 3.00GHz Device 2 is an: CPU with max. 3000 MHz and 8 units/cores Now computing - please be patient.... time used: 3.094 seconds Now checking if results are valid - please be patient.... Validate test passed - GPU results=CPU results Link to comment Share on other sites More sharing options...
mitch_de Posted August 27, 2009 Author Share Posted August 27, 2009 BINGO Number of OpenCL devices found: 3 OpenCL Device # 0 = GeForce 8800 GT Device 0 is an: GPU with max. 1500 MHz and 112 units/cores Now computing - please be patient.... time used: 0.683 seconds OpenCL Device # 1 = GeForce GTX 260 Device 1 is an: GPU with max. 1400 MHz and 216 units/cores Now computing - please be patient.... time used: 0.365 seconds OpenCL Device # 2 = Intel® Xeon® CPU X5365 @ 3.00GHz Device 2 is an: CPU with max. 3000 MHz and 8 units/cores Now computing - please be patient.... time used: 3.094 seconds Now checking if results are valid - please be patient.... Validate test passed - GPU results=CPU results YEAH ! Hope also ATI users will not get an freezed system anymore - their GPUs got overloaded with old code - NVIDIAs not Remember : the V025 time used results of V025 cant be compared 1:1 with the old version ones. Thats because of code changes for ATI users Link to comment Share on other sites More sharing options...
proengin Posted August 27, 2009 Share Posted August 27, 2009 Here are my scores from V0.25 script: ........................................................... .................. OpenCL Bench V 0.25 by mitch ........... ...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ...... ... time results are not comparable to older version! ..... ........................................................... Number of OpenCL devices found: 2 OpenCL Device # 0 = GeForce GTX 285 Device 0 is an: GPU with max. 1584 MHz and 240 units/cores Now computing - please be patient.... time used: 0.231 seconds OpenCL Device # 1 = Intel® Core i7 CPU 920 @ 2.67GHz Device 1 is an: CPU with max. 4280 MHz and 8 units/cores Now computing - please be patient.... time used: 1.296 seconds Now checking if results are valid - please be patient.... Validate test passed - GPU results=CPU results Link to comment Share on other sites More sharing options...
gzfelix Posted August 27, 2009 Share Posted August 27, 2009 ........................................................... .................. OpenCL Bench V 0.25 by mitch ........... ...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ...... ... time results are not comparable to older version! ..... ........................................................... Number of OpenCL devices found: 3 OpenCL Device # 0 = Radeon HD 4870 Device 0 is an: GPU with max. 750 MHz and 4 units/cores Now computing - please be patient.... time used: 4.126 seconds OpenCL Device # 1 = GeForce GT 120 Device 1 is an: GPU with max. 1400 MHz and 32 units/cores Error: clBuildProgram for device # 1 ERROR NUMBER = -11 Link to comment Share on other sites More sharing options...
vidkidd Posted August 27, 2009 Share Posted August 27, 2009 This just shows how SAD the ATI Drivers currently are!!! OUCH!!!! _tool-1/OpenCL2_Bench_V025 ; exit; ........................................................... .................. OpenCL Bench V 0.25 by mitch ........... ...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ...... ... time results are not comparable to older version! ..... ........................................................... Number of OpenCL devices found: 2 OpenCL Device # 0 = Radeon HD 4870 Device 0 is an: GPU with max. 750 MHz and 4 units/cores Now computing - please be patient.... time used: 4.065 seconds OpenCL Device # 1 = Intel® Xeon® CPU 5160 @ 3.00GHz Device 1 is an: CPU with max. 3000 MHz and 4 units/cores Now computing - please be patient.... time used: 6.079 seconds Now checking if results are valid - please be patient.... Validate test passed - GPU results=CPU results logout [Process completed] Link to comment Share on other sites More sharing options...
real3x Posted August 27, 2009 Share Posted August 27, 2009 tool-1/OpenCL2_Bench_V025; exit; ........................................................... .................. OpenCL Bench V 0.25 by mitch ........... ...... C2D 3GHz = 12 sec vs Nvidia 9600GT = 0,93 sec ...... ... time results are not comparable to older version! ..... ........................................................... Number of OpenCL devices found: 2 OpenCL Device # 0 = GeForce GTX 260 Device 0 is an: GPU with max. 1242 MHz and 192 units/cores Now computing - please be patient.... time used: 0.357 seconds OpenCL Device # 1 = Intel(R) Core(TM)2 Duo CPU E8400 @ 3.00GHz Device 1 is an: CPU with max. 3600 MHz and 2 units/cores Now computing - please be patient.... time used: 10.433 seconds Now checking if results are valid - please be patient.... :) Validate test passed - GPU results=CPU results :) logout Link to comment Share on other sites More sharing options...
moondark Posted August 27, 2009 Share Posted August 27, 2009 Hello mitch, can you make available the source code? Thanks!! My results: Number of OpenCL devices found: 3 OpenCL Device # 0 = GeForce 9600M GT Device 0 is an: GPU with max. 1250 MHz and 32 units/cores Now computing - please be patient.... time used: 2.798 seconds OpenCL Device # 1 = GeForce 9400M Device 1 is an: GPU with max. 1100 MHz and 16 units/cores Now computing - please be patient.... time used: 9.549 seconds OpenCL Device # 2 = Intel® Core2 Duo CPU P8600 @ 2.40GHz Device 2 is an: CPU with max. 2400 MHz and 2 units/cores Now computing - please be patient.... time used: 15.800 seconds Now checking if results are valid - please be patient.... Validate test passed - GPU results=CPU results Link to comment Share on other sites More sharing options...
Recommended Posts