When Lucene first hit the scene five years ago, it was nothing short of amazing. By using this open-source, highly scalable, super-fast search engine, developers could integrate search into applications quickly and efficiently. A lot has changed since then-search has grown from a "nice-to-have" feature into an indispensable part of most enterprise applications. Lucene now powers search in diverse companies including Akamai, Netflix, LinkedIn, Technorati, HotJobs, Epiphany, FedEx, Mayo Clinic, MIT, New Scientist Magazine, and many others. Some things remain the same, though. Lucene still delivers high-performance search features in a disarmingly easy-to-use API. Due to its vibrant and diverse open-source community of developers and users, Lucene is relentlessly improving, with evolutions to APIs, significant new features such as payloads, and a huge increase (as much as 8x) in indexing speed with Lucene 2.3. And with clear writing, reusable examples, and unmatched advice on best practices, Lucene in Action, Second Edition is still the definitive guide to developing with Lucene.
Cited By
- Liu C, Xia X, Lo D, Gao C, Yang X and Grundy J (2021). Opportunities and Challenges in Code Search Tools, ACM Computing Surveys, 54:9, (1-40), Online publication date: 31-Dec-2022.
- Kim J, Feng K, Cong G, Zhu D, Yu W and Miao C (2022). ABC, Proceedings of the VLDB Endowment, 15:10, (2134-2147), Online publication date: 1-Jun-2022.
- Florez J, Perry J, Wei S and Marcus A Retrieving data constraint implementations using fine-grained code patterns Proceedings of the 44th International Conference on Software Engineering, (1893-1905)
- Ren M, Huang H and Gao Y (2021). SKR-QA, Neurocomputing, 459:C, (142-151), Online publication date: 12-Oct-2021.
- Vizcarra J, Kozaki K, Torres Ruiz M and Quintero R (2021). Knowledge-Based Sentiment Analysis and Visualization on Social Networks, New Generation Computing, 39:1, (199-229), Online publication date: 1-Apr-2021.
- Chen Z, Cheng X, Dong S, Dou Z, Guo J, Huang X, Lan Y, Li C, Li R, Liu T, Liu Y, Ma J, Qin B, Wang M, Wen J, Xu J, Zhang M, Zhang P and Zhang Q (2020). Information retrieval: a view from the Chinese IR community, Frontiers of Computer Science: Selected Publications from Chinese Universities, 15:1, Online publication date: 1-Feb-2021.
- Jiang Y (2020). Semantically-enhanced information retrieval using multiple knowledge sources, Cluster Computing, 23:4, (2925-2944), Online publication date: 1-Dec-2020.
- Slimani H, Hamal O, El faddouli N, Bennani S and Amrous N The hybrid recommendation of digital educational resources in a distance learning environment Proceedings of the 13th International Conference on Intelligent Systems: Theories and Applications, (1-9)
- Kolbe N, Vandenbussche P, Kubler S and Le Traon Y LOVBench: Ontology Ranking Benchmark Proceedings of The Web Conference 2020, (1750-1760)
- Bendechache M, Svorobej S, Endo P, Mario M, Ares M, Byrne J and Lynn T Modelling and simulation of ElasticSearch using CloudSim Proceedings of the 23rd IEEE/ACM International Symposium on Distributed Simulation and Real Time Applications, (223-230)
- Rodriguez-Cancio M, Combemale B and Baudry B Approximate loop unrolling Proceedings of the 16th ACM International Conference on Computing Frontiers, (94-105)
- Cagliero L, Garza P and Baralis E (2019). ELSA, ACM Transactions on Information Systems, 37:2, (1-33), Online publication date: 30-Apr-2019.
- Bruno R, Patricio D, Simão J, Veiga L and Ferreira P Runtime Object Lifetime Profiler for Latency Sensitive Big Data Applications Proceedings of the Fourteenth EuroSys Conference 2019, (1-16)
- Lokoč J, Kovalčík G, Münzer B, Schöffmann K, Bailer W, Gasser R, Vrochidis S, Nguyen P, Rujikietgumjorn S and Barthel K (2019). Interactive Search or Sequential Browsing? A Detailed Analysis of the Video Browser Showdown 2018, ACM Transactions on Multimedia Computing, Communications, and Applications, 15:1, (1-18), Online publication date: 28-Feb-2019.
- Melucci M and Paggiaro A (2019). Evaluation of information retrieval systems using structural equation modeling, Computer Science Review, 31:C, (1-18), Online publication date: 1-Feb-2019.
- Deveaud R, Mothe J, Ullah M and Nie J (2018). Learning to Adaptively Rank Document Retrieval System Configurations, ACM Transactions on Information Systems, 37:1, (1-41), Online publication date: 31-Jan-2019.
- Klarman U, Flores M and Kuzmanovic A Mining the web with webcoin Proceedings of the 14th International Conference on emerging Networking EXperiments and Technologies, (165-177)
- Sriraman A and Wenisch T µtune Proceedings of the 13th USENIX conference on Operating Systems Design and Implementation, (177-194)
- Maas M, Asanović K and Kubiatowicz J A hardware accelerator for tracing garbage collection Proceedings of the 45th Annual International Symposium on Computer Architecture, (138-151)
- Kim K, Kim D, Bissyandé T, Choi E, Li L, Klein J and Traon Y FaCoY Proceedings of the 40th International Conference on Software Engineering, (946-957)
- Prabhune A, Stotzka R, Sakharkar V, Hesser J and Gertz M (2018). MetaStore, Distributed and Parallel Databases, 36:1, (153-194), Online publication date: 1-Mar-2018.
- Rekik R, Kallel I, Casillas J and Alimi A (2018). Assessing web sites quality, International Journal of Information Management: The Journal for Information Professionals, 38:1, (201-216), Online publication date: 1-Feb-2018.
- Bruno R and Ferreira P POLM2 Proceedings of the 18th ACM/IFIP/USENIX Middleware Conference, (147-160)
- Grover A, Arya D and Venkataraman G Latency Reduction via Decision Tree Based Query Construction Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, (1399-1407)
- Bruno R, Oliveira L and Ferreira P (2017). NG2C: pretenuring garbage collection with dynamic generations for HotSpot big data applications, ACM SIGPLAN Notices, 52:9, (2-13), Online publication date: 31-Oct-2017.
- Araújo J, Araújo J, Magalhães C, Andrade J and Mota A Feasibility of using Source Code Changes on the Selection of Text-based Regression Test Cases Proceedings of the 2nd Brazilian Symposium on Systematic and Automated Software Testing, (1-6)
- Magalhães C, Andrade J, Perrusi L and Mota A Evaluating an Automatic Text-based Test Case Selection using a Non-Instrumented Code Coverage Analysis Proceedings of the 2nd Brazilian Symposium on Systematic and Automated Software Testing, (1-9)
- Hassen F and Amel G (2017). An efficient synchronous indexing technique for full-text retrieval in distributed databases, Procedia Computer Science, 112:C, (811-821), Online publication date: 1-Sep-2017.
- Jang H, Oh Y, Jin S, Jung H, Kong H, Lee D, Jeon D and Kim W KBQA Proceedings of the International Conference on Electronic Commerce, (1-8)
- Mitra B, Diaz F and Craswell N Luandri Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, (1221-1223)
- Bruno R, Oliveira L and Ferreira P NG2C: pretenuring garbage collection with dynamic generations for HotSpot big data applications Proceedings of the 2017 ACM SIGPLAN International Symposium on Memory Management, (2-13)
- Liang J, Jiang L, Meng D and Hauptmann A Leveraging Multi-modal Prior Knowledge for Large-scale Concept Learning in Noisy Web Data Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, (32-40)
- Siddiqa A, Karim A and Chang V (2017). SmallClient for big data, Cluster Computing, 20:2, (1193-1208), Online publication date: 1-Jun-2017.
- Grunzke R, Krüger J, Jäkel R, Nagel W, Herres-Pawlis S and Hoffmann A (2017). Metadata Management in the MoSGrid Science Gateway - Evaluation and the Expansion of Quantum Chemistry Support, Journal of Grid Computing, 15:1, (41-53), Online publication date: 1-Mar-2017.
- Ramos G, Sales V and Teixeira C LETRAS Proceedings of the 22nd Brazilian Symposium on Multimedia and the Web, (215-218)
- Zhang Y, Prekas G, Fumarola G, Fontoura M, Goiri Í and Bianchini R History-based harvesting of spare cycles and storage in large-scale datacenters Proceedings of the 12th USENIX conference on Operating Systems Design and Implementation, (755-770)
- Reuss P, Stram R, Juckenack C, Althoff K, Henkel W, Fischer D and Henning F FEATURE-TAK - Framework for Extraction, Analysis, and Transformation of Unstructured Textual Aircraft Knowledge Case-Based Reasoning Research and Development, (327-341)
- (2016). An efficient method to evaluate intersections on big data sets, Theoretical Computer Science, 647:C, (1-21), Online publication date: 27-Sep-2016.
- Magalhães C, Barros F, Mota A and Maia E Automatic Selection of Test Cases for Regression Testing Proceedings of the 1st Brazilian Symposium on Systematic and Automated Software Testing, (1-8)
- Yu H, Song W and Mine T APIBook Proceedings of the 8th Asia-Pacific Symposium on Internetware, (45-53)
- Guy I, Ronen I, Kravi E and Barnea M (2016). Increasing Activity in Enterprise Online Communities Using Content Recommendation, ACM Transactions on Computer-Human Interaction, 23:4, (1-28), Online publication date: 1-Sep-2016.
- Ibrahim O and Landa-Silva D (2016). Term frequency with average term occurrences for textual information retrieval, Soft Computing - A Fusion of Foundations, Methodologies and Applications, 20:8, (3045-3061), Online publication date: 1-Aug-2016.
- Baralis E, Cagliero L and Cerquitelli T Supporting stock trading in multiple foreign markets Proceedings of the Second International Workshop on Data Science for Macro-Modeling, (1-6)
- Candela L, Castelli D, Coro G, Pagano P and Sinibaldi F (2016). Species distribution modeling in the cloud, Concurrency and Computation: Practice & Experience, 28:4, (1056-1079), Online publication date: 25-Mar-2016.
- Palle A and Kulkarni R Classification of Medical MRI Brain Images based on Hadoop Proceedings of the Second International Conference on Information and Communication Technology for Competitive Strategies, (1-4)
- Piasecki M and Gut A Similarity Measure for Polish Short Texts Based on Wordnet-Enhanced Bag-of-words Representation Human Language Technology. Challenges for Computer Science and Linguistics, (169-182)
- Baralis E, Cagliero L, Fiori A and Garza P (2015). MWI-Sum, ACM Transactions on Information Systems, 34:1, (1-35), Online publication date: 1-Oct-2015.
- Guy I, Levin R, Daniel T and Bolshinsky E Islands in the Stream Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, (665-674)
- Bouadjenek M, Sanner S and Ferraro G A study of query reformulation for patent prior art search with partial patent applications Proceedings of the 15th International Conference on Artificial Intelligence and Law, (23-32)
- Gil R, Piveta E, Saccol D and de Faveri C A Tool for Searching in Unstructured Code AspectJ Proceedings of the annual conference on Brazilian Symposium on Information Systems: Information Systems: A Computer Socio-Technical Perspective - Volume 1, (39-46)
- Cho J, Sondhi P, Zhai C and Schatz B Resolving healthcare forum posts via similar thread retrieval Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics, (33-42)
- Ronen I, Guy I, Kravi E and Barnea M Recommending social media content to community owners Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, (243-252)
- Bouadjenek M, Hacid H and Bouzeghoub M LAICOS Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, (1446-1449)
- Guy I, Avraham U, Carmel D, Ur S, Jacovi M and Ronen I Mining expertise and interests from social media Proceedings of the 22nd international conference on World Wide Web, (515-526)
- Guy I, Steier T, Barnea M, Ronen I and Daniel T Swimming against the streamz Proceedings of the 21st ACM international conference on Information and knowledge management, (1587-1591)
- Martínez-Rodríguez J, Sosa-Sosa V and López-Arévalo I Automatic discovery of web content related to IT in the mexican internet based on supervised classifiers Proceedings of the 11th Mexican international conference on Advances in Artificial Intelligence - Volume Part I, (107-118)
- Hermann C Improving document retrieval using special characteristics of lecture recording documents Proceedings of the 3rd Symposium on Information and Communication Technology, (250-259)
- Demidova E, Zhou X and Nejdl W FreeQ Proceedings of the 21st International Conference on World Wide Web, (325-328)
- Lux M Content based image retrieval with LIRe Proceedings of the 19th ACM international conference on Multimedia, (735-738)
- Rosa R and Lucena V Smart composition of reusable software components in mobile application product lines Proceedings of the 2nd International Workshop on Product Line Approaches in Software Engineering, (45-49)
- Polok L and Smrz P Implementing random indexing on GPU Proceedings of the 19th High Performance Computing Symposia, (134-142)
- Ajmera J, Joshi A, Mukherjea S, Rajput N, Sahay S, Shrivastava M and Srivastava K Two-stream indexing for spoken web search Proceedings of the 20th international conference companion on World wide web, (503-512)
- Diao M, Mukherjea S, Rajput N and Srivastava K Faceted search and browsing of audio content on spoken web Proceedings of the 19th ACM international conference on Information and knowledge management, (1029-1038)
- Mattmann C and Sharan M An Automatic Approach for Discovering and Geocoding Locations in Domain-Specific Web Data (Application Paper) 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI), (87-93)