Busca360: A Search Application in the Context of Top-Side Asset Integrity Management in the Oil & Gas Industry
Resumo
Oil and gas industry applications often require querying data of various types and integrating the query results. Data range from structured tables stored in databases to documents and images organized in digital libraries. The users typically have technical training but are not necessarily versed in Information Technology, meaning the data processing tasks may burden them significantly. This article introduces a multimodal search application, called Busca360, designed to alleviate this burden and discusses the main challenges that emerged during the research, implementation, and user experience. The application uses structured data in the context of asset integrity management and 360º images of equipment and installation locations. Finally, this article concludes with real-world use cases that show how the proposed multimodal search application helps perform planning and maintenance tasks.
Palavras-chave:
multimodal search, industrial data integration, relational database, sql query
Referências
Bergamaschi, S., Guerra, F., Interlandi, M., Trillo-Lado, R., and Velegrakis, Y. (2016). Combining user and database perspective for solving keyword queries over relational databases. Information Systems, 55:1–19.
Boehm, K. M., Khosravi, P., Vanguri, R., Gao, J., and Shah, S. P. (2022). Harnessing multimodal data integration to advance precision oncology. Nature Reviews Cancer, 22:114–126.
de Oliveira, P., da Silva, A., and de Moura, E. (2015). Ranking candidate networks of relations to improve keyword search over relational databases. In 2015 IEEE 31st International Conference on Data Engineering, pages 399–410.
Doan, A., Halevy, A. Y., and Ives, Z. G. (2012). Principles of Data Integration. Morgan Kaufmann, San Francisco, CA, USA, 1st edition.
Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., et al. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In 9th International Conference ICLR 2021, page 21. OpenReview.net.
Espíndola, D. B., Fumagalli, L., Garetti, M., Pereira, C. E., Botelho, S. S., and Ventura Henriques, R. (2013). A model-based approach for data integration to improve main-tenance management by mixed reality. Computers in Industry, 64(4):376–391.
Garcia, G. M. (2020). A Keyword-based Query Processing Method for Datasets with Schemas. PhD thesis, Graduate Program in Informatics, PUC-Rio.
García, G. M., Izquierdo, Y. T., Menendez, E., Dartayre, F., and Casanova, M. A. (2017). Rdf keyword-based query technology meets a real-world dataset. In Proceedings of the International Conference on Extending Database Technology, pages 656–667.
Izquierdo, Y. T., Garcia, G. M., Lemos, M., Novello, A., Novelli, B., Damasceno, C., Leme, L. A. P. P., and Casanova, M. A. (2021). A platform for keyword search and its application for covid-19 pandemic data. Journal of Information and Data Management, 12(5).
Izquierdo, Y. T., García, G. M., Menendez, E. S., Casanova, M. A., Dartayre, F., and Levy, C. H. (2018). Quiow: a keyword-based query processing tool for rdf datasets and relational databases. In International Conference on Database and Expert Systems Applications (DEXA), pages 259–269. Springer.
Li, X., Yang, J., and Ma, J. (2021). Recent developments of content-based image retrieval (cbir). Neurocomputing, 452:675–689.
Molina, E., Hamazaki, G., Izquierdo, Y., Lemos, M., Britto, P., Corseuil, E., and Garcia, R. (2024). A proposal of a knowledge graph for digital engineering systems integration for operation and maintenance activities in industrial plants. In XX Brazilian Symposium on Information Systems (SBSI).
Nascimento, E., García, G., Victorio, W., Lemos, M., Izquierdo, Y., Garcia, R., Leme, L., and Casanova, M. (2023). A family of natural language interfaces for databases based on chatgpt and langchain. In 42nd International Conference on Conceptual Modeling – Posters&Demos, pages 1–5.
Nascimento, E., Izquierdo, Y., García, G., Coelho, G., Feijó, L., Lemos, M., Leme, L., and Casanova, M. (2024). My database user is a large language model. In 26th International Conference on Enterprise Information Systems, pages 800–806.
Nguyen, T. H., Prinz, A., Friisø, T., Nossum, R., and Tyapin, I. (2013). A framework for data integration of offshore wind farms. Renewable Energy, 60:150–161.
Pinheiro, J., Victorio, W., Nascimento, E., Seabra, A., Izquierdo, Y., Garcıa, G., Coelho, G., Lemos, M., Leme, L. A. P. P., Furtado, A., et al. (2023). On the construction of database interfaces based on large language models. In 19th International Conference on WEBIST, pages 373–380.
Ramada, M. S., da Silva, J. C., and de Sá Leitão-Júnior, P. (2020). From keywords to relational database content: A semantic mapping method. Information Systems, 88:101460.
Raposo, A., Santos, I., Soares, L., Wagner, G., Corseuil, E., and Gattass, M. (2009). Environ: Integrating vr and cad in engineering projects. In IEEE Computer Graphics and Applications, volume 29.
Boehm, K. M., Khosravi, P., Vanguri, R., Gao, J., and Shah, S. P. (2022). Harnessing multimodal data integration to advance precision oncology. Nature Reviews Cancer, 22:114–126.
de Oliveira, P., da Silva, A., and de Moura, E. (2015). Ranking candidate networks of relations to improve keyword search over relational databases. In 2015 IEEE 31st International Conference on Data Engineering, pages 399–410.
Doan, A., Halevy, A. Y., and Ives, Z. G. (2012). Principles of Data Integration. Morgan Kaufmann, San Francisco, CA, USA, 1st edition.
Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., et al. (2021). An image is worth 16x16 words: Transformers for image recognition at scale. In 9th International Conference ICLR 2021, page 21. OpenReview.net.
Espíndola, D. B., Fumagalli, L., Garetti, M., Pereira, C. E., Botelho, S. S., and Ventura Henriques, R. (2013). A model-based approach for data integration to improve main-tenance management by mixed reality. Computers in Industry, 64(4):376–391.
Garcia, G. M. (2020). A Keyword-based Query Processing Method for Datasets with Schemas. PhD thesis, Graduate Program in Informatics, PUC-Rio.
García, G. M., Izquierdo, Y. T., Menendez, E., Dartayre, F., and Casanova, M. A. (2017). Rdf keyword-based query technology meets a real-world dataset. In Proceedings of the International Conference on Extending Database Technology, pages 656–667.
Izquierdo, Y. T., Garcia, G. M., Lemos, M., Novello, A., Novelli, B., Damasceno, C., Leme, L. A. P. P., and Casanova, M. A. (2021). A platform for keyword search and its application for covid-19 pandemic data. Journal of Information and Data Management, 12(5).
Izquierdo, Y. T., García, G. M., Menendez, E. S., Casanova, M. A., Dartayre, F., and Levy, C. H. (2018). Quiow: a keyword-based query processing tool for rdf datasets and relational databases. In International Conference on Database and Expert Systems Applications (DEXA), pages 259–269. Springer.
Li, X., Yang, J., and Ma, J. (2021). Recent developments of content-based image retrieval (cbir). Neurocomputing, 452:675–689.
Molina, E., Hamazaki, G., Izquierdo, Y., Lemos, M., Britto, P., Corseuil, E., and Garcia, R. (2024). A proposal of a knowledge graph for digital engineering systems integration for operation and maintenance activities in industrial plants. In XX Brazilian Symposium on Information Systems (SBSI).
Nascimento, E., García, G., Victorio, W., Lemos, M., Izquierdo, Y., Garcia, R., Leme, L., and Casanova, M. (2023). A family of natural language interfaces for databases based on chatgpt and langchain. In 42nd International Conference on Conceptual Modeling – Posters&Demos, pages 1–5.
Nascimento, E., Izquierdo, Y., García, G., Coelho, G., Feijó, L., Lemos, M., Leme, L., and Casanova, M. (2024). My database user is a large language model. In 26th International Conference on Enterprise Information Systems, pages 800–806.
Nguyen, T. H., Prinz, A., Friisø, T., Nossum, R., and Tyapin, I. (2013). A framework for data integration of offshore wind farms. Renewable Energy, 60:150–161.
Pinheiro, J., Victorio, W., Nascimento, E., Seabra, A., Izquierdo, Y., Garcıa, G., Coelho, G., Lemos, M., Leme, L. A. P. P., Furtado, A., et al. (2023). On the construction of database interfaces based on large language models. In 19th International Conference on WEBIST, pages 373–380.
Ramada, M. S., da Silva, J. C., and de Sá Leitão-Júnior, P. (2020). From keywords to relational database content: A semantic mapping method. Information Systems, 88:101460.
Raposo, A., Santos, I., Soares, L., Wagner, G., Corseuil, E., and Gattass, M. (2009). Environ: Integrating vr and cad in engineering projects. In IEEE Computer Graphics and Applications, volume 29.
Publicado
14/10/2024
Como Citar
T. IZQUIERDO, Yenier et al.
Busca360: A Search Application in the Context of Top-Side Asset Integrity Management in the Oil & Gas Industry. In: SIMPÓSIO BRASILEIRO DE BANCO DE DADOS (SBBD), 39. , 2024, Florianópolis/SC.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2024
.
p. 104-116.
ISSN 2763-8979.
DOI: https://doi.org/10.5753/sbbd.2024.240793.