Stata/MP data analysis machine

A customer involved in bioinformatics-related research asked us about a machine for data analysis and simulation. We are considering data analysis and simulation using Stata/MP (12 cores), R, Python, etc., and there are plans to perform machine learning in the future.
We would like to make a proposal with a budget of around 100 million yen.

Please note that all data and products used for analysis etc. are stored on external media, so there is no need to install large-capacity storage, and it is not necessary to run the machine continuously for more than one day for analysis. I have heard that even though they do exist, they are not often used for long periods of time.
If it is possible to perform this kind of operation on a machine that is not a workstation specification, I have heard that you would like a consumer configuration.

We have also received inquiries from customers who would like to know what number of CPU cores is suitable for using Stata with a 12-core license.

Other desired conditions are as follows.

・CPU: 12 cores or more
・ Memory: 128GB
・Storage: 1TB S-ATA SSD x2
・OS: Windows 11 Pro
・Software used: Stata/MP (12 cores), R, Python, etc.
・Budget: About 100 million yen

Based on the information we received, we proposed the following configuration.

CPU AMD Ryzen9 7950X (4.50GHz 16 cores)
memory 128GB
Storage 1 1TB SSD S-ATA
Storage 2 1TB SSD S-ATA
video NVIDIA RTX A5000 24GB
network on board (2.5G x1 10/100/1000Base-T x1) Wi-Fi x1
Housing + power supply Tower type housing + 850W
OS Microsoft Windows 11 Pro 64 bit

We propose a configuration for consumers based on your planned operation method.
A consumer-grade configuration will not fail immediately if used continuously for a day, but if you frequently perform long calculations, you can improve reliability by using a workstation that supports ECC memory. can.
On the other hand, if you only need to operate continuously for up to 24 hours, a non-workstation configuration may be a good option for cost-effective reasons.

The CPU selected is Ryzen7000 9X (7950 cores) from the Ryzen 16 series.
If you run Stata on all 12 cores, which is the maximum number on the license, on a CPU with only 12 cores, the CPU resources will be used up, which will affect the ability to perform other tasks in parallel. There is a possibility.
Therefore, we chose a 16-core model CPU to give us more resources, but you can also change to a 12-core model if you don't do anything else while Stata is running.

In addition, since we plan to use it for machine learning, we have selected the NVIDIA RTX A5000, a high-end workstation graphics card, with the assumption that it will use CUDA.The video card can be changed according to your wishes, so please feel free to let us know.

The configuration of this case study is based on the conditions given by the customer.
We will flexibly propose machines according to your conditions, so please feel free to contact us even if you are considering different conditions than what is listed.

■FAQ

・What is Stata
Stata is a comprehensive statistical software package with features such as data analysis, data management and chart generation.
In addition to GUI operation with a mouse, it can also be executed with a powerful and intuitive command syntax, making it easy to use, fast and accurate.

reference:Been * Jump to our Unipos handling product page

 

・What is R?
R is an open source and free software programming language/development execution environment for statistical analysis.Used in calculations and graphing for statistical processing.
Since many libraries exist, complex techniques can be handled by simply calling the library.

reference:The R Project for Statistical Computing*Jumps to an external site

 

・What is Python?
Python is an object-oriented programming language copyrighted by the Python Software Foundation (PSF).Its programming syntax is simple, making it highly readable, and it also features a wide variety of components, such as libraries and frameworks, that are suitable for different purposes.A popular language for programming beginners to advanced users.

Reference: Python *Jumps to an external site

Reference: [Special article] Why is the programming language Python so popular? - Tools to accelerate your Python programming * Jump to our owned media "TEGAKARI"