Search found 6 matches
- 2018-07-18, 9:06:56
- Forum: forwardcom forum
- Topic: What to do with unclear standards?
- Replies: 8
- Views: 23254
Re: What to do with unclear standards?
Mhmm... Question: is there any reason for C to be this highly abstract machine, rather than the real world "sum-of-x86-ARM-MIPS-PPC-68k-SPARC-ALPHA-ITANIUM-PARISC-SUPERH-32/64bit-compiled-on-GCC/LLVM/MSVC/ICC" de-facto architecture (for lack of a better term), OTHER than for compiler opti...
- 2018-05-25, 9:09:58
- Forum: forwardcom forum
- Topic: NAN propagation instead of fault trapping. Can we avoid speculative execution?
- Replies: 3
- Views: 17294
Re: NAN propagation instead of fault trapping. Can we avoid speculative execution?
Hi Agner, I really think that NaN propagation is much better than fault trapping. However, I fail to see how avoiding fault trapping would remove the need for speculative execution... I see why it reduces the complexity, but I have the feeling speculative execution is still needed: traps are not the...
- 2018-04-14, 8:47:03
- Forum: forwardcom forum
- Topic: Emulating multiple output instructions with caching
- Replies: 6
- Views: 18329
Re: Emulating multiple output instructions with caching
Actually, the multiple instruction scheme is also applicable to the division (just call div and rem) The only thing: in that case it will be slow. That's were I started to think about caching to not pay for the second divsion (required for rem). I wonder: is there any performance critical codes that...
- 2018-04-13, 13:53:55
- Forum: forwardcom forum
- Topic: Singularities handling
- Replies: 0
- Views: 31967
Singularities handling
I was looking in the intel documentation and I found the function _mm512_fixupimm_ps: https://software.intel.com/sites/landingpage/IntrinsicsGuide/#expand=2380,4470,2357,2360,2380&text=fixup Basically, this function allows to map some value type to other value type. The primary use is to handle ...
- 2018-04-13, 13:28:40
- Forum: forwardcom forum
- Topic: Emulating multiple output instructions with caching
- Replies: 6
- Views: 18329
Re: Emulating multiple output instructions with caching
What I like with the "caching" approach: it is completely transparent. If it is not implemented by the hardware, the software still behaves exactly the same (only slower). And it does not require any cross-lane communication. After some thinking, such a caching would be beneficial only for...
- 2018-04-13, 10:01:26
- Forum: forwardcom forum
- Topic: Emulating multiple output instructions with caching
- Replies: 6
- Views: 18329
Emulating multiple output instructions with caching
Hi Agner, Some operations have naturally multiple outputs: like multiplication (low part and high part), division (quotient and remainder), addition (result and carry)... As far as I understand, you want to avoid multiple outputs to simplify the hardware. What if those multiple outputs were single o...