They are very small, since the structures are recursive - about 2-4 cells per layer (for now). So the final production code is highly efficient, a bunch of multiply-adds followed by nonlinearities, all of which can be further optimized with SSE or AVX intrinsics.
Richard