Posts

Showing posts with the label research

The same full-text search engine for different products. Apache Luke.

Image
  Hi, awesome community!   In this article, you can read info about the tool which helps me with search index investigation.  At the moment, I do an investigation with the Russian language stemming and reuse the morphology analysis by reusing existing libraries.  So, as I understand, today we will speak about Apache Lucene index and small awesome utility Apache Luke. Because this search engine library is used for full-text search in the Apache Lucene, Solr, Elasticsearch as well.  It means Jira, Confluence, Bamboo on-premises solution used Lucene, Bitbucket used the Elasticsearch. About Cloud, I imagine the Atlassian team used Elasticsearch as it scales easier even Apache Lucene local index. e.g. for Lucene, you need to use for the replication (lucene-replicator -  https://lucene.apache.org/core/7_4_0/replicator/org/apache/lucene/replicator/Replicator.html ) or just use Elasticsearch.   Let’s use Apache Luke for the Confluence search indexes: I just co...

Houston, we have a problem! What an APM tool we will use to find a bottlenecks?

Image
  Hi!  I would like to share a series of articles related to the APM( Application performance monitoring) tool. It based on my investigation of different problems related to the performance.  Let's start from most popular in industry tool "New Relic".  1. We have a business case: it is the angry end of users :)  2. Requirements: 2.1. ASAP detect the bottleneck 2.2. Continuously monitoring the problem 2.3. Possibility to review (on browser side, on backend side and RDMS side as well) 2.4. Price and Functionality   Let's start from scratch.  1. We need to register and install ab agent in our situation is java.  You will see below panel. I have spent on test environment around 5 min to install.  2. For the realistic situation, I have installed an app with the leak. Where immediately NewRelic detects the problem:  3. On the tab of the transaction, you can find the slowest average response time:    4. After we that we can inves...

How to decrease disk usage or find interesting things during investigation on filesystem?

Image
  Hi, awesome community!  I hope you are doing well.    In this article, I'd like to share my usage a small util  fdupes .  Home page of that project located in that  GitHub .  Let's determine to exist use case:  1. We have a huge directory {jira_home}/data/attachments/ or {confluence_home}/attachments. (for bamboo and bitbucket it will not work properly) In my use case is ~750GB and ~180GB All those instances are on-premises. We need to analyze exist disk usage for duplicates and if it is possible to replace by symlinks.   So, without any more stalling, here we go. 1. Install fdupes if it is not in your system. on RHEL/CentOS-based and Fedora based system yum install fdupes dnf install fdupes [On Fedora 22 onwards] Debian based: sudo apt-get install fdupes or  sudo aptitude install fdupes macOS based: brew install fdupes   2. Next step is to change to an attachments directory fdupes --recurse --size --summarize ./attachment...