因为生产环境的某业务大量写操做,致使出现了以下的错误:linux
Making DataFactory:com.raqsoft.report.dataset.SQLDataSetFactory failure (dataset named):ds1 Caused:ORA-01034: ORACLE not available ORA-27123: unable to attach to shared memory segment Linux-x86_64 Error: 22: Invalid argument Additional information: 25 Additional information: 1605653 Additional information: -1073741824
环境 | 版本信息 |
---|---|
数据库的版本 | Release 11.2.0.4.0 Production |
操做系统版本 | 2.6.32-696.el6.x86_64 |
查看环境的命令:shell
[root@newarpdb01 ~]# uname -r 2.6.32-696.el6.x86_64 [root@newarpdb01 ~]# grep memlock /etc/security/limits.conf # - memlock - max locked-in-memory address space (KB) [root@newarpdb01 ~]# grep HugePsges /proc/meminfo [root@newarpdb01 ~]# grep HugePages /proc/meminfo AnonHugePages: 22540288 kB HugePages_Total: 0 HugePages_Free: 0 HugePages_Rsvd: 0 HugePages_Surp: 0 [root@newarpdb01 ~]# grep Hugepages /proc/meminfo Hugepagesize: 2048 kB [root@newarpdb01 ~]# sysctl -a |grep shm kernel.shmmax = 536870912000 kernel.shmall = 131072000 kernel.shmmni = 4096 kernel.shm_rmid_forced = 0 vm.hugetlb_shm_group = 0 [root@newarpdb01 ~]# uname -r 2.6.32-696.el6.x86_64
基本定位在HugePages
上面数据库
系统进程是经过虚拟地址访问内存,可是CPU必须把它转换程物理内存地址才能真正访问内存。为了提升这个转换效率,CPU会缓存最近的虚拟内存地址和物理内存地址的映射关系,并保存在一个由CPU维护的映射表中。为了尽可能提升内存的访问速度,须要在映射表中保存尽可能多的映射关系。缓存
而在Linux中,内存都是以页的形式划分的,默认状况下每页是4K,这就意味着若是物理内存很大,则映射表的条目将会很是多,会影响CPU的检索效率。由于内存大小是固定的,为了减小映射表的条目,可采起的办法只有增长页的尺寸。bash
page table
是操做系统上的虚拟内存系统的数据结构模型,用于存储虚拟地址与物理地址的对应关系。当咱们访问内存时,首先访问page table,而后Linux在经过page table的mapping来访问真实物理内存(ram+swap)服务器
如何计算nr_hugepages,可使用以下的shell,hugepages_settings.sh
数据结构
#!/bin/bash # # hugepages_settings.sh # # Linux bash script to compute values for the # recommended HugePages/HugeTLB configuration # on Oracle Linux # # Note: This script does calculation for all shared memory # segments available when the script is run, no matter it # is an Oracle RDBMS shared memory segment or not. # # This script is provided by Doc ID 401749.1 from My Oracle Support # http://support.oracle.com # Welcome text echo " This script is provided by Doc ID 401749.1 from My Oracle Support (http://support.oracle.com) where it is intended to compute values for the recommended HugePages/HugeTLB configuration for the current shared memory segments on Oracle Linux. Before proceeding with the execution please note following: * For ASM instance, it needs to configure ASMM instead of AMM. * The 'pga_aggregate_target' is outside the SGA and you should accommodate this while calculating the overall size. * In case you changes the DB SGA size, as the new SGA will not fit in the previous HugePages configuration, it had better disable the whole HugePages, start the DB with new SGA size and run the script again. And make sure that: * Oracle Database instance(s) are up and running * Oracle Database 11g Automatic Memory Management (AMM) is not setup (See Doc ID 749851.1) * The shared memory segments can be listed by command: # ipcs -m Press Enter to proceed..." read # Check for the kernel version KERN=`uname -r | awk -F. '{ printf("%d.%d\n",$1,$2); }'` # Find out the HugePage size HPG_SZ=`grep Hugepagesize /proc/meminfo | awk '{print $2}'` if [ -z "$HPG_SZ" ];then echo "The hugepages may not be supported in the system where the script is being executed." exit 1 fi # Initialize the counter NUM_PG=0 # Cumulative number of pages required to handle the running shared memory segments for SEG_BYTES in `ipcs -m | cut -c44-300 | awk '{print $1}' | grep "[0-9][0-9]*"` do MIN_PG=`echo "$SEG_BYTES/($HPG_SZ*1024)" | bc -q` if [ $MIN_PG -gt 0 ]; then NUM_PG=`echo "$NUM_PG+$MIN_PG+1" | bc -q` fi done RES_BYTES=`echo "$NUM_PG * $HPG_SZ * 1024" | bc -q` # An SGA less than 100MB does not make sense # Bail out if that is the case if [ $RES_BYTES -lt 100000000 ]; then echo "***********" echo "** ERROR **" echo "***********" echo "Sorry! There are not enough total of shared memory segments allocated for HugePages configuration. HugePages can only be used for shared memory segments that you can list by command: # ipcs -m of a size that can match an Oracle Database SGA. Please make sure that: * Oracle Database instance is up and running * Oracle Database 11g Automatic Memory Management (AMM) is not configured" exit 1 fi # Finish with results case $KERN in '2.4') HUGETLB_POOL=`echo "$NUM_PG*$HPG_SZ/1024" | bc -q`; echo "Recommended setting: vm.hugetlb_pool = $HUGETLB_POOL" ;; '2.6') echo "Recommended setting: vm.nr_hugepages = $NUM_PG" ;; '3.8') echo "Recommended setting: vm.nr_hugepages = $NUM_PG" ;; '3.10') echo "Recommended setting: vm.nr_hugepages = $NUM_PG" ;; '4.1') echo "Recommended setting: vm.nr_hugepages = $NUM_PG" ;; '4.14') echo "Recommended setting: vm.nr_hugepages = $NUM_PG" ;; *) echo "Kernel version $KERN is not supported by this script (yet). Exiting." ;; esac # End
使用以下,可得:oracle
[root@newarpdb01 u01]# ./hugepages_settings.sh This script is provided by Doc ID 401749.1 from My Oracle Support (http://support.oracle.com) where it is intended to compute values for the recommended HugePages/HugeTLB configuration for the current shared memory segments on Oracle Linux. Before proceeding with the execution please note following: * For ASM instance, it needs to configure ASMM instead of AMM. * The 'pga_aggregate_target' is outside the SGA and you should accommodate this while calculating the overall size. * In case you changes the DB SGA size, as the new SGA will not fit in the previous HugePages configuration, it had better disable the whole HugePages, start the DB with new SGA size and run the script again. And make sure that: * Oracle Database instance(s) are up and running * Oracle Database 11g Automatic Memory Management (AMM) is not setup (See Doc ID 749851.1) * The shared memory segments can be listed by command: # ipcs -m Press Enter to proceed... Recommended setting: vm.nr_hugepages = 117001
修改vm.nr_hugepages参数app
# vi /etc/sysctl.conf 修改vm.nr_hugepages参数 vm.nr_hugepages = 117001 执行sysctl –p使配置生效: # sysctl -p
备注: Hugepages是在分配后就会预留出来的,其大小必定要比服务器上全部实例的SGA总和要大less
关于memlock
, 修改内核参数memlock,单位是KB,若是内存是16G,memlock的大小要稍微小于物理内存。计划lock 12GB的内存大小。参数设置为大于SGA是没有坏处的。
而后重启各个实例的RAC的数据库。
查看shared memory
[root@newarpdb01 u01]# su - oracle [oracle@newarpdb01 ~]$ $ORACLE_HOME/bin/sysresv ------ Shared Memory Segments -------- key shmid owner perms bytes nattch status 0x00000000 1441809 grid 640 4096 0 0x00000000 1474578 grid 640 4096 0 0x6d9f45cc 1507347 grid 640 4096 0 0x00000000 1572884 oracle 640 1610612736 102 0x00000000 1605653 oracle 640 243739394048 101 0xb0b15d50 1638422 oracle 640 2097152 102 定位了1605653。占用了 Linux: % ipcrm shm 1605653
而后 重启某实例的RAC数据库
srvctl stop instance -d newarpdb -i newarpdb2 srvctl start instance -d newarpdb -i newarpdb2
文献参考:
Semaphores and Shared Memory - An Overview (Doc ID 153961.1)
Startup Error ORA-27123: Unable To Attach To Shared Memory Segment Linux-x86_64 Error: 22: Invalid argument (Doc ID 1607545.1)