ccurl icon indicating copy to clipboard operation
ccurl copied to clipboard

libccurl, win64, Intel Core i5 Graphics issue

Open mettenian opened this issue 7 years ago • 10 comments

using libccurl to process a tx takes significantly more time on Intel Core i5 Graphics then without using GPU support. I checked libccurl with a AMD Card, and it is very fast, using CPU only is fast too, but using the integrated intel i5 graphic is very slow (GPU usage is >90%, monitored with SIV64X).

mettenian avatar Jun 14 '17 07:06 mettenian

processing a simple spam tx (mwm 15) took <4 sec on CPU, ~1 sec on AMD RX 480, but >20 minutes(!) on Intel Core i5 Graphics

mettenian avatar Jun 14 '17 08:06 mettenian

Can you try sharing the out put of clinfo so that I have an idea of what kind of architecture is there?

paulhandy avatar Jun 14 '17 13:06 paulhandy

Number of platforms:				 1
  Platform Profile:				 FULL_PROFILE
  Platform Version:				 OpenCL 2.0 
  Platform Name:				 Intel(R) OpenCL
  Platform Vendor:				 Intel(R) Corporation
  Platform Extensions:				 cl_intel_dx9_media_sharing cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_gl_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_spir


  Platform Name:				 Intel(R) OpenCL
Number of devices:				 2
  Device Type:					 CL_DEVICE_TYPE_GPU
  Device ID:					 32902
  Max compute units:				 24
  Max work items dimensions:			 3
    Max work items[0]:				 256
    Max work items[1]:				 256
    Max work items[2]:				 256
  Max work group size:				 256
  Preferred vector width char:			 1
  Preferred vector width short:			 1
  Preferred vector width int:			 1
  Preferred vector width long:			 1
  Preferred vector width float:			 1
  Preferred vector width double:		 0
  Max clock frequency:				 1000Mhz
  Address bits:					 14757395255531667488
  Max memory allocation:			 390280806
  Image support:				 Yes
  Max number of images read arguments:		 128
  Max number of images write arguments:		 128
  Max image 2D width:				 16384
  Max image 2D height:				 16384
  Max image 3D width:				 16384
  Max image 3D height:				 16384
  Max image 3D depth:				 2048
  Max samplers within kernel:			 16
  Max size of kernel argument:			 1024
  Alignment (bits) of base address:		 1024
  Minimum alignment (bytes) for any datatype:	 128
  Single precision floating point capability
    Denorms:					 Yes
    Quiet NaNs:					 Yes
    Round to nearest even:			 Yes
    Round to zero:				 Yes
    Round to +ve and infinity:			 Yes
    IEEE754-2008 fused multiply-add:		 No
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 524288
  Global memory size:				 1561123226
  Constant buffer size:				 65536
  Max number of constant args:			 8
  Local memory type:				 Scratchpad
  Local memory size:				 65536
  Error correction support:			 0
  Profiling timer resolution:			 83
  Device endianess:				 Little
  Available:					 Yes
  Compiler available:				 Yes
  Execution capabilities:				 
    Execute OpenCL kernels:			 Yes
    Execute native function:			 No
  Queue properties:				 
    Out-of-Order:				 Yes
    Profiling :					 Yes
  Platform ID:					 0051E508
  Name:						 Intel(R) HD Graphics 520
  Vendor:					 Intel(R) Corporation
  Driver version:				 20.19.15.4463
  Profile:					 FULL_PROFILE
  Version:					 OpenCL 2.0 
  Extensions:					 cl_intel_accelerator cl_intel_advanced_motion_estimation cl_intel_ctz cl_intel_d3d11_nv12_media_sharing cl_intel_dx9_media_sharing cl_intel_motion_estimation cl_intel_simultaneous_sharing cl_intel_subgroups cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_depth_images cl_khr_dx9_media_sharing cl_khr_fp16 cl_khr_gl_depth_images cl_khr_gl_event cl_khr_gl_msaa_sharing cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_gl_sharing cl_khr_icd cl_khr_image2d_from_buffer cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_khr_spir 


  Device Type:					 CL_DEVICE_TYPE_CPU
  Device ID:					 32902
  Max compute units:				 4
  Max work items dimensions:			 3
    Max work items[0]:				 8192
    Max work items[1]:				 8192
    Max work items[2]:				 8192
  Max work group size:				 8192
  Preferred vector width char:			 1
  Preferred vector width short:			 1
  Preferred vector width int:			 1
  Preferred vector width long:			 1
  Preferred vector width float:			 1
  Preferred vector width double:		 1
  Max clock frequency:				 2400Mhz
  Address bits:					 14757395255531667488
  Max memory allocation:			 536838144
  Image support:				 Yes
  Max number of images read arguments:		 480
  Max number of images write arguments:		 480
  Max image 2D width:				 16384
  Max image 2D height:				 16384
  Max image 3D width:				 2048
  Max image 3D height:				 2048
  Max image 3D depth:				 2048
  Max samplers within kernel:			 480
  Max size of kernel argument:			 3840
  Alignment (bits) of base address:		 1024
  Minimum alignment (bytes) for any datatype:	 128
  Single precision floating point capability
    Denorms:					 Yes
    Quiet NaNs:					 Yes
    Round to nearest even:			 Yes
    Round to zero:				 No
    Round to +ve and infinity:			 No
    IEEE754-2008 fused multiply-add:		 No
  Cache type:					 Read/Write
  Cache line size:				 64
  Cache size:					 262144
  Global memory size:				 2147352576
  Constant buffer size:				 131072
  Max number of constant args:			 480
  Local memory type:				 Global
  Local memory size:				 32768
  Error correction support:			 0
  Profiling timer resolution:			 410
  Device endianess:				 Little
  Available:					 Yes
  Compiler available:				 Yes
  Execution capabilities:				 
    Execute OpenCL kernels:			 Yes
    Execute native function:			 Yes
  Queue properties:				 
    Out-of-Order:				 Yes
    Profiling :					 Yes
  Platform ID:					 0051E508
  Name:						 Intel(R) Core(TM) i5-6300U CPU @ 2.40GHz
  Vendor:					 Intel(R) Corporation
  Driver version:				 5.2.0.10094
  Profile:					 FULL_PROFILE
  Version:					 OpenCL 2.0 (Build 10094)
  Extensions:					 cl_khr_icd cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_depth_images cl_khr_3d_image_writes cl_intel_exec_by_local_thread cl_khr_spir cl_khr_dx9_media_sharing cl_intel_dx9_media_sharing cl_khr_d3d11_sharing cl_khr_gl_sharing cl_khr_fp64 cl_khr_image2d_from_buffer 



mettenian avatar Jun 15 '17 03:06 mettenian

https://github.com/iotaledger/wallet/issues/116

On my Surface Pro 4 (i5) attaching or sending never finishes.

ghost avatar Jun 21 '17 13:06 ghost

iotaledger/wallet#116 => I was not able to attach an address to the tangle. I have a Dell XPS 13 9350 with an Intel HD Graphics 520

Thanks to rage1337 on iotatangle.slack.com I could resolve my problems by updating to the newest Intel driver.

Now everything works fine with the IOTA light wallet.

roarwrecker avatar Jun 23 '17 10:06 roarwrecker

Unfortunately I can´t test that on my surface since it won´t let me install generic graphic drivers

ghost avatar Jun 23 '17 10:06 ghost

But I also have the Intel HD Graphics 520

ghost avatar Jun 23 '17 10:06 ghost

https://communities.intel.com/thread/115377 (Surface Pro 4). Worked for me.

ghost avatar Jul 05 '17 14:07 ghost

same issue here with Intel HD Graphics 530 1536 MB, 2,9 GHz Intel Core i7 and on OsX Sierra 10.13.4. How can I switch to CPU?

mehranshakeri avatar May 04 '18 10:05 mehranshakeri

@mettenian How did you use only CPU?

mehranshakeri avatar May 07 '18 09:05 mehranshakeri