PS5 Pro updated rumors/leaks & technical/specs discussion |OT| PS5 Pro Enhanced Requirements Detailed.

Gediminas

Boy...
Founder
21 Jun 2022
5,678
7,246
In any case, at over $600, it's going to be hard to convince gamers. And let's hope it will also bring the price of the slim down permanently.
not me. but i can see that scenario. but is depends, what kind of advantages it will offer? we have to remember, that it doesn't targeting casuals.
 
  • Like
Reactions: Entropi

John Elden Ring

The Thread Maker
Content Creator
5 Jul 2022
4,821
5,531
United States
ps5_future_of_gaming.0-1.jpg


Insider Gaming Details a New PSSR Solution (PlayStation Super Resolution) which will be coming to PS5 and Future Consoles:

PlayStation’s new Spectral Super Resolution (PSSR), which will first be integrated into the PlayStation 5 Pro is internally aiming for 4K 120 FPS and 8K 60FPS console gaming, Insider Gaming has learned.
 

Satoru

Limitless
Founder
20 Jun 2022
6,799
10,242
This math is not possible for the PS5 pro.

For the PS5 you have:

36 CUs x 64 shaders per CU x 2.23 GHz x 2 instructions per clock = 10.275 tflops

Assuming the PS5 pro has working CUs, it will pull

60 CUs x 64 shaders per CU (basically RDNA standard) x 2.18 GHz G x 2 instructions per clock = 16.742 tflops

You have to double this for fp16, but the number we always talked about is fp32.

Not that this matters as teraflops are a single measure amongst an ocean of variables that impact performance.

I'm much more interested in the supposed RNDA 4 features, which I think are just more bull from supposed leakers.
 
  • they're_right_you_know
Reactions: Entropi

Gediminas

Boy...
Founder
21 Jun 2022
5,678
7,246

ToTTenTranz

Well-known member
Icon Extra
4 Aug 2023
483
545
ps5_future_of_gaming.0-1.jpg


Insider Gaming Details a New PSSR Solution (PlayStation Super Resolution) which will be coming to PS5 and Future Consoles:



Details here are scarce, but it looks like my prediction was correct:


PSSR that will probably allow for the PS5 Pro to render at a lower native resolution than a PS5 using temporal upscaling


Here's what they're saying on the first example:

Target – image quality close to Fidelity Mode (1800p) with Performance Mode FPS (60 FPS)

Standard PlayStation 5 –

  • Performance Mode – 1080p at 60FPS
  • Fidelity Mode – 1800p at 30FPS

PlayStation 5 Pro –

  • 1440p at 60FPS (PSSR used)

So for the same/similar output image quality in a 4K presentation, the PS5 Pro renders at 2560*1440 (3.7MPix) where the PS5 renders at 3200*1800 (5.7MPix).
We now know Sony is aiming for PSSR to get the same IQ as other temporal upscalers like FSR2 or TSR but at 64% of the base resolution which is quite impressive.

It's bit like saying we're getting FSR2 Quality IQ but with FSR2 Performance framerate.
 
  • Like
Reactions: Entropi

Entropi

Veteran
Icon Extra
22 Jan 2023
2,247
3,367

Again, I’m starting to believe these hacks are purposely posting misinformation to create drama and engagement artificially, or someone needs to explain how they came up with the 33.3 TF.

Also, GPU clocks are lower? Very odd.
 

AllBizness

Veteran
22 Jul 2023
678
721
Again, I’m starting to believe these hacks are purposely posting misinformation to create drama and engagement artificially, or someone needs to explain how they came up with the 33.3 TF.

Also, GPU clocks are lower? Very odd.
Its 33tf in FP32, if yall naysayers want people to take tall seriously on your lowball teraflops claims then you need to list the dual issue teraflops for the current gen AMD RDNA 3 GPU's. Dont just single out PS5 Pro, please enlighten us with the true lower teraflops numbers for the 7900XT, 7800 ect..
 
  • Like
Reactions: Entropi

Evilms

Graph Master
21 Jun 2022
1,130
2,435
image.png


Again, I’m starting to believe these hacks are purposely posting misinformation to create drama and engagement artificially, or someone needs to explain how they came up with the 33.3 TF.

Also, GPU clocks are lower? Very odd.

33.5 TF it's with dual-issue shaders.
 
  • Like
Reactions: Entropi

Gediminas

Boy...
Founder
21 Jun 2022
5,678
7,246
Again, I’m starting to believe these hacks are purposely posting misinformation to create drama and engagement artificially, or someone needs to explain how they came up with the 33.3 TF.

Also, GPU clocks are lower? Very odd.
Cuda cores X gpu speed X 2 (for single precision) divided by tera

7680 x 2180 x 2 / 1.000.000 = 33,48tf
 
  • Shake
Reactions: Entropi

ToTTenTranz

Well-known member
Icon Extra
4 Aug 2023
483
545
Again, I’m starting to believe these hacks are purposely posting misinformation to create drama and engagement artificially, or someone needs to explain how they came up with the 33.3 TF.

- 30 WGP = 60 Compute Units
- 64 ALUs per Compute Unit on RDNA: 60x64
- FP32 ALUs in RDNA3 are 2-wide (or double-pumped): 60x64x2
- ALU operations can be MUL (multiply), ADD (sum) or FMAD/MADD (fused multiply add). Since multiply and add are counted as 2 operations, you double the count again: 60x64x2x2
- To get total theoretical FLOP output, you multiply that by the clock. TFLOPs = 60x64x2x2 x clocks.

- So, to achieve the GPU clock for the rumored 33.5 TFLOPs FP32, we calculate clock = (33 500 000) / (60x64x2x2). Therefore clock is 2181 MHz.


Also, GPU clocks are lower? Very odd.

It's indeed odd. I think 2181MHz is base clock without boost, like 2GHz is the base clock for the PS5's GPU.
It that's the case and the GPU averages at say 2600MHz, the actual throughput will be closer to 40 TFLOPs.
 

Entropi

Veteran
Icon Extra
22 Jan 2023
2,247
3,367
- 30 WGP = 60 Compute Units
- 64 ALUs per Compute Unit on RDNA: 60x64
- FP32 ALUs in RDNA3 are 2-wide (or double-pumped): 60x64x2
- ALU operations can be MUL (multiply), ADD (sum) or FMAD/MADD (fused multiply add). Since multiply and add are counted as 2 operations, you double the count again: 60x64x2x2
- To get total theoretical FLOP output, you multiply that by the clock. TFLOPs = 60x64x2x2 x clocks.

- So, to achieve the GPU clock for the rumored 33.5 TFLOPs FP32, we calculate clock = (33 500 000) / (60x64x2x2). Therefore clock is 2181 MHz.




It's indeed odd. I think 2181MHz is base clock without boost, like 2GHz is the base clock for the PS5's GPU.
It that's the case and the GPU averages at say 2600MHz, the actual throughput will be closer to 40 TFLOPs.

Game engines would need to be rewritten for that new feature though, right? And with the base PS5 not having it, it is dead weight or maybe not even included to cut costs.

Thanks for the explanation, I knew the math to get to 16 but was not sure why it was 2x that number.
 

Satoru

Limitless
Founder
20 Jun 2022
6,799
10,242
Its 33tf in FP32, if yall naysayers want people to take tall seriously on your lowball teraflops claims then you need to list the dual issue teraflops for the current gen AMD RDNA 3 GPU's. Dont just single out PS5 Pro, please enlighten us with the true lower teraflops numbers for the 7900XT, 7800 ect..
No, it isn't, unless the console has 128 shader per compute unit. I've done the math, just look for my previous post.

Edit: it's 16.7 at fp32 and 33.4 at fp16.
 

Satoru

Limitless
Founder
20 Jun 2022
6,799
10,242
This math is not possible for the PS5 pro.

For the PS5 you have:

36 CUs x 64 shaders per CU x 2.23 GHz x 2 instructions per clock = 10.275 tflops

Assuming the PS5 pro has working CUs, it will pull

60 CUs x 64 shaders per CU (basically RDNA standard) x 2.18 GHz G x 2 instructions per clock = 16.742 tflops

You have to double this for fp16, but the number we always talked about is fp32.

Not that this matters as teraflops are a single measure amongst an ocean of variables that impact performance.

I'm much more interested in the supposed RNDA 4 features, which I think are just more bull from supposed leakers.

I'm going to quote my post again and clarify: unless the PS5 pro has 128 shaders per CU or has a GPU running at 4+ GHz it will have 16.7tflop at fp32. You double teraflops fp16, not halve.
 
  • Like
Reactions: peter42O

Entropi

Veteran
Icon Extra
22 Jan 2023
2,247
3,367
Its 33tf in FP32, if yall naysayers want people to take tall seriously on your lowball teraflops claims then you need to list the dual issue teraflops for the current gen AMD RDNA 3 GPU's. Dont just single out PS5 Pro, please enlighten us with the true lower teraflops numbers for the 7900XT, 7800 ect..

Isn’t dual-issue very limited in current AMD cards? Unless it works like magic and developers don’t have to rewrite code for it, it is useless on the PS5 Pro, hence, 16 TF is the most accurate number.
 

historia

Veteran
Icon Extra
29 Jun 2023
2,818
2,717
I just said PS5 Pro PS5 Pro PS5 Pro three-time in front of a mirror at midnight.

Now it is true right?
 

ToTTenTranz

Well-known member
Icon Extra
4 Aug 2023
483
545
Game engines would need to be rewritten for that new feature though, right?

The number of instructions in AMD's ISA that support the per-ALU dual-FP32 feature is limited, yes. The guys over at Chips and Cheese couldn't find a way to measure the dual-issue output until they tested Nemes' compute benchmark for Vulkan. I think they don't know what Nemes is using in her benchmark to trigger the ability but the feature is there nonetheless, so it can be used. Assuming it all fits in the caches and registers, of course.


And with the base PS5 not having it, it is dead weight or maybe not even included to cut costs.

It would probably be dead weight in the PC space except for stuff they fully control like driver-level features (FSR1, AFMF, etc.) or wherever they can hand-tune driver optimizations. Sony however owns the platform and the SDK so they can implement these optimizations directly into the compiler, and they can automate a list of optimizations for their 1st party engines as well as the widely used ones (UE5, Anvil, etc.).
 
  • Shake
Reactions: Entropi

Entropi

Veteran
Icon Extra
22 Jan 2023
2,247
3,367
The number of instructions in AMD's ISA that support the per-ALU dual-FP32 feature is limited, yes. The guys over at Chips and Cheese couldn't find a way to measure the dual-issue output until they tested Nemes' compute benchmark for Vulkan. I think they don't know what Nemes is using in her benchmark to trigger the ability but the feature is there nonetheless, so it can be used. Assuming it all fits in the caches and registers, of course.




It would probably be dead weight in the PC space except for stuff they fully control like driver-level features (FSR1, AFMF, etc.) or wherever they can hand-tune driver optimizations. Sony however owns the platform and the SDK so they can implement these optimizations directly into the compiler, and they can automate a list of optimizations for their 1st party engines as well as the widely used ones (UE5, Anvil, etc.).

Again, thanks for the info; I need to catch up on what is happening in the PC space.
 
  • Shake
Reactions: ToTTenTranz