Toward a multi-level parallel framework on GPU cluster with PetSC-CUDA for PDE-based Optical Flow computation