Spatio-Temporal Convolutional Neural Network for Enhanced Inter Prediction in Video Coding