Читать книгу Cyberphysical Smart Cities Infrastructures. Optimal Operation and Intelligent Decision Making онлайн
85 страница из 94
33 33 Amirian, S., Rasheed, K., Taha, T.R., and Arabnia, H.R. (2020). Automatic generation of descriptive titles for video clips using deep learning. In: Springer Nature ‐ Research Book Series: Transactions on Computational Science & Computational Intelligence, Hamid R. Arabnia, Springer. 17–28.
34 34 Gao, L., Guo, Z., Zhang, H. et al. (2017). Video captioning with attention‐based LSTM and semantic consistency. IEEE Transactions on Multimedia 19 (9): 2045–2055.
35 35 Yang, Y., Zhou, J., Ai, J. et al. (2018). Video captioning by adversarial LSTM. IEEE Transactions on Image Processing 27 (11): 5600–5611.
36 36 Singh, A., Natarajan, V., Shah, M. et al. (2019). Towards VQA models that can read. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8317–8326.
37 37 Jayaraman, D. and Grauman, K. (2017). Learning image representations tied to egomotion from unlabeled video. International Journal of Computer Vision 125 (1–3): 136–161.
38 38 Jayaraman, D., Gao, R., and Grauman, K. (2018). Shapecodes: self‐supervised feature learning by lifting views to viewgrids. Proceedings of the European Conference on Computer Vision (ECCV), pp. 120–136.