Implement a cache oblivious multiplication algorithm for huge inputs, too large for main memory. The benefit of cache oblivious is that swap disk (virtual memory, paging file) can be transparently and efficiently used in conjunction with main memory. Hopefully the address space is large enough. This is in contrast to current implementations for calculating pi which manually manage disk. The implementation becomes simpler.
Cache oblivious FFT already seems to exist, using cache oblivious matrix transpose to twiddle.
No comments :
Post a Comment