In recent years, the COVID-19 has made it difficult for people to interact with each other face-to-face, but various kinds of social interactions are still needed. Therefore, we have developed an online interactive system based on the image processing method, that allows people in different places to merge the human region of two images onto the same image in real-time. The system can be used in a variety of situations to extend its interactive applications. The system is mainly based on the task of Human Segmentation in the CNN (convolution Neural Network) method. Then the images from different locations are transmitted to the computing server through the Internet. In our design, the system ensures that the CNN method can run in real-time, allowing both side users can see the integrated image to reach 30 FPS when the network is running smoothly.