Definitely, although they're using basically none of the logic resources. You could probably get the bulk of the result with an Artix 7 and a memory controller (instead of using all those large BRAMs). Or just an Artix 7 and not allocating 64kB of BRAM for each core.
$129 for an Artix 7 development board is a steal [1].
Yeah, if you just replaced the 64KB of RAM with a 4KB cache backed by DRAM, you could probably fit 64 cores in one of those no problem, but 1) the performance wouldn't be quite as good, and 2) this project was mostly an exercise in excess since I was trying to find something to do with my giant FPGA board.
Sorry, to be clear, that was by no means a criticism of what you did. I was just pointing out that this kind of thing is more accessible to people than they may think when they get sticker shock re: the cost of that particular FPGA you built on.
It's current manifestation is very much a 'good enough' solution. I just wanted a scheme for simple all-to-all connectivity to map the serial ports to, and the only traffic it carries at the moment is either terminal traffic or disk data from the SD card, so it's not even close to being taxed. The obvious improvements are to add deep buffers to hide the latency of credit return, crank up the clock and to make the bus wider. This board could probably handle a 200-bit wide bus at 100 MHz+ speeds with no problems. Computer architecture is a fun hobby =)
$129 for an Artix 7 development board is a steal [1].
[1] https://store.digilentinc.com/arty-a7-artix-7-fpga-developme...