搜索 - 腾讯云开发者社区-腾讯云

文章/答案/技术大牛

发布

来自专栏GPUS开发者
DAY34:阅读算术指令
Single-Precision Floating-Point Division __fdividef(x, y) (see Intrinsic Functions) provides faster single-precision Single-Precision Floating-Point Reciprocal Square Root To preserve IEEE-754 semantics the compiler can Single-Precision Floating-Point Square Root Single-precision floating-point square root is implemented At present, 28 bytes of local memory are used by single-precision functions, and 44 bytes are used by This last case can be avoided by using single-precision floating-point constants, defined with an f suffix
78130发布于 2018-06-22
来自专栏程序员IT圈
百度跨平台 AI 推理加速引擎 -- Anakin
图2 Public model on NV 测试平台Nvidia-P4信息： GPU Architecture NVIDIA Pascal™ Single-Precision Performance 图5 MI8 and P4 on VGG16 model 测试平台信息： MI8: AMD Radeon Instinct MI8 single-Precision Performance 8.192 TFLOPS GPU Memory 4 GB P4: GPU Architecture NVIDIA Pascal™ Single-Precision Performance 5.5 TFLOPS GPU
1.6K40发布于 2018-08-16
来自专栏GPUS开发者
DAY15：阅读CUDA C runtime之纹理内存
· The type of a texel, which is restricted to the basic integer and single-precision floating-point defined in char, short, int, long, longlong, float, double that are derived from the basic integer and single-precision 可能就慢点，所以说CUDA Array比普通的显存有可能有性能优势 The type of a texel, which is restricted to the basic integer and single-precision defined in char, short, int, long, longlong, float, double that are derived from the basic integer and single-precision
1K30发布于 2018-06-25
来自专栏光城(guangcity)
C++那些事之高性能SIMD
诸如： _mm512_set1_ps Broadcast single-precision (32-bit) floating-point value a to all elements of dst. 于是，找到了下面这个表格： Abbreviation Full Name C/C++ Equivalent ps packed single-precision float ph packed half-precision
1.6K30编辑于 2023-09-02
来自专栏人工智能LeadAI
Tesla P100
NVIDIA TESLA P100 加速器性能规格 Double-Precision Performance 4.7 TeraFLOPS Single-Precision Performance 9.3
90200编辑于 2023-03-28
来自专栏计算机视觉理论及其实现
tf.dtypes
tf.float16: 16-bit half-precision floating-point. tf.float32: 32-bit single-precision floating-point. double-precision floating-point. tf.bfloat16: 16-bit truncated floating-point. tf.complex64: 64-bit single-precision
1.2K10编辑于 2022-09-28
来自专栏全栈程序员必看
跟踪算法(一)光流法跟踪
The point coordinates must be single-precision floating-point numbers. nextPts – Output vector of 2D points (with single-precision floating-point coordinates) containing
1.6K20编辑于 2022-09-15
来自专栏全栈程序员必看
Opencv学习笔记（九）光流法
The point coordinates must be single-precision floating-point numbers. nextPts – Output vector of 2D points (with single-precision floating-point coordinates) containing the calculated new positions of
61820编辑于 2022-09-15
来自专栏JNing的专栏
opencv: 颜色空间转换(cv2.cvtColor) 探究(图示+源码)
dst, code) → None Parameters: src – input image: 8-bit unsigned, 16-bit unsigned ( CV_16UC… ), or single-precision
3.7K20发布于 2018-09-28
来自专栏云服务器活动
腾讯云NVIDIA GPU实例配置性能使用场景及注意事项
分子建模基因组学及其他 NVIDIA GPU云服务器硬件规格 NVIDIA GPU云服务器硬件规格规格说明： GPU 性能：主要指标为 GPU 的浮点运行能力，TF 代表 T Flops，SP 代表 single-precision
3.8K00发布于 2019-06-01
来自专栏嵌入式程序猿
号外号外：无规矩不成方圆
浮点类型的定义提供了一个注释所用浮点标准的机会，如： /* IEEE 754 single-precision floating-point */ typedef float float32_t; 一天不用学习很多
86170发布于 2018-04-10
来自专栏Android必知必会
Android 必知必会 - 使用 ADB 操作 Clipboard
f: Write the 32-bit single-precision number N into the send parcel.
2.6K30发布于 2019-07-01
来自专栏GPUS开发者
DAY79：阅读 Compute Capabilities
while SNaN encodings are supported, they are not signaling and are handled as quiet; The result of a single-precision Regardless of the setting of the compiler flag -ftz, Atomic single-precision floating-point adds on global memory always operate in flush-to-zero mode, i.e., behave equivalent to FADD.F32.FTZ.RN, Atomic single-precision
1.1K20发布于 2018-10-23
来自专栏嵌入式与Linux那些事
《深入理解计算机系统》（CSAPP）实验一 —— Data Lab
passed as unsigned int's, but * they are to be interpreted as the bit-level representation of * single-precision passed as unsigned int, but * it is to be interpreted as the bit-level representation of a * single-precision
2.9K20发布于 2021-05-20
来自专栏GPUS开发者
DAY83：阅读Compute Capability 7.x
Architecture A multiprocessor consists of: 64 FP32 cores for single-precision arithmetic operations, math, 8 mixed-precision Tensor Cores for deep learning matrix arithmetic 16 special function units for single-precision
1.3K20发布于 2018-11-06
来自专栏每月技术成长
RVA23 profile
RVA22U64包含以下强制 extensionsM Integer multiplication and division.M 整数乘法和除法A Atomic instructions.A 原子指令F Single-precision Extension for Integer Multiplication and DivisionA Extension for Atomic InstructionsF Extension for Single-Precision Half-Precision Floating-PointZfhmin Minimal Extension for Half-Precision Floating-PointZfinx Extension for Single-Precision
1.2K10编辑于 2025-03-06
RVB 23 profile
M Integer multiplication and division.M 整数乘除A Atomic instructions.A 原子指令F Single-precision floating-point Extension for Integer Multiplication and DivisionA Extension for Atomic InstructionsF Extension for Single-Precision Half-Precision Floating-PointZfhmin Minimal Extension for Half-Precision Floating-PointZfinx Extension for Single-Precision
81200编辑于 2025-03-15
来自专栏程序那些事
NumPy之:数据类型
Platform-defined extended-precision float np.csingle float complex Complex number, represented by two single-precision
80450发布于 2021-04-23
来自专栏程序那些事
NumPy之:数据类型
Platform-defined extended-precision float np.csingle float complex Complex number, represented by two single-precision
80420发布于 2021-05-11
来自专栏机器之心
学界 | 减少模型半数内存用量：百度&英伟达提出混合精度训练法
研究人员在多种深度学习模型上运行了新方法，并与常用方法进行了对比：基线（FP32）：单精度（Single-precision）存储用于激活、权重和梯度。计算也使用单精度单元。
1.4K90发布于 2018-05-08

第 2 页第 3 页第 4 页

点击加载更多

DAY34:阅读算术指令

百度跨平台 AI 推理加速引擎 -- Anakin

DAY15：阅读CUDA C runtime之纹理内存

C++那些事之高性能SIMD

Tesla P100

tf.dtypes

跟踪算法(一)光流法跟踪

Opencv学习笔记（九）光流法

opencv: 颜色空间转换(cv2.cvtColor) 探究(图示+源码)

腾讯云NVIDIA GPU实例配置性能使用场景及注意事项

号外号外：无规矩不成方圆

Android 必知必会 - 使用 ADB 操作 Clipboard

DAY79：阅读 Compute Capabilities

《深入理解计算机系统》（CSAPP）实验一 —— Data Lab

DAY83：阅读Compute Capability 7.x

RVA23 profile

RVB 23 profile

NumPy之:数据类型

NumPy之:数据类型

学界 | 减少模型半数内存用量：百度&英伟达提出混合精度训练法

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐