C H A P T E R  2

System Management Services 1.2 Release Notes

This chapter contains the release notes for System Management Services 1.2 on Sun Fire 15K servers and covers the following topics:


Known Limitations

This section contains known limitations that involve SMS on the Sun Fire 15K system.


General Issues

This section contains general issues that involve SMS on Sun Fire 15K systems.

System Controller External Network Configuration

Each system controller (SC) must be configured for the TCP/IP network to which it is attached. Refer to the System Administration Guide: Resource Management and Network Services of the Solaris 9 System Administrator Collection for details on planning and configuring a TCP/IP-based network. SMS supports both IPv4 and IPv6 configurations.

In this release, the SC supports network connections through the RJ45 jacks on the faceplate of each SC. This corresponds to the network interface hme0 and eri1 under Solaris software for each SC. You will be required to configure hme0 or eri1 on each SC with appropriate information for your TCP/IP network. Using this configuration, each SC is known to external network applications by a separate IP hostname and address.

Each SC operates in one of two mutually exclusive modes: main or spare. The SC that is in main mode is the SC that controls the machine. The SC that is in spare mode acts as a spare that automatically takes over if the main SC fails. It is important to know which system controller is the main SC and which is the spare SC. To determine the SC role log in to the SC and use the following command:

sc0:sms-user:> showfailover -r
MAIN

External network-based applications such as Sun Management Center, telnet , and others will need to be given the appropriate IP hostname of the main system controller. In the case of an SC failover, these applications need to be restarted with the IP address of the new main SC.



Note Note - Any changes made to the network configuration on one SC using smsconfig -m must be made to the other SC as well. Network configuration is not automatically propagated.



IPSec Configuration

Disks intended to be used on a Sun Fire 15K system must be installed using a Sun Fire 15K machine. Policy placed in /etc/inet/inetd.conf must be added manually to /etc/inet/ipsecinit.conf as well.

Whenever policy is taken out of /etc/inet/inetd.conf it must be removed manually from /etc/inet/ipsecinit.conf also.

Refer to Bug Id: 4449848.

Hardware

When a board breaker is turned off and ready to be taken out of the system, I2C timeout errors will be seen. These messages are a notification and does not indicate that an error has occurred. They can be ignored.


SMS Documentation Notes

Location

Software documentation for this release is provided, in PDF form, at the following location:

/cdrom/cdrom0/System_Management_Services_1.2/Docs

These PDF files are named by part number. For your convenience, here are the associated document titles:

816-3267-10.pdf - System Management Services (SMS) 1.2 Administration Guide

816-3268-10.pdf - System Management Services (SMS) 1.2 Reference Manual

816-3269-10.pdf - System Management Services (SMS) 1.2 Installation Guide and Release Notes

816-3285-10.pdf - Sun Fire 15K Software Overview Guide

816-4279-10.pdf - System Management Services (SMS) 1.2 Dynamic Reconfiguration User Guide

Documentation Errata

Due to a late arrival in the software, you may see slight differences between the screen snapshots shown in the Installation Guide and what appears on your screen during installation.

The System Management Services (SMS) 1.2 Reference Manual contains corrected text for each of the following man pages but the man pages themselves do not.

Man Pages

console List of Valid Escpape Characters

The list of valid console escape characters for use with the -e option are invalid. You can use any characters other than those listed.

See BugID 4592909.

disablecomponent and enablecomponent Missing Paroli Link Operand

The enablecomponent and disablecomponent manpages do not contain support for Paroli modules on wPCI boards.

The following operand is supported:

board_loc/paroli_link

The following paroli_link forms are valid:

PAR(0|1)

For example:

sc0:sms-user:> disablecomponent IO7/PAR0
sc0:sms-user:> showcomponent
Component PARS at 1O7/PAR0 is disabled <no reason given>

See BugID 4546763.

showplatform Missing Domain Down

Domain Down is missing from the list of domain statuses. Domain Down indicates that the domain is down and setkeyswitch is set to ON, DIAG or SECURE. To restore the domain use:

sc0:sms-user:> setkeyswitch off
sc0:sms-user:> setkeyswitch on

For more information on showplatform, refer to Chapter 7 in the System Management Services (SMS) 1.2 Administrator Guide .

See BugID 4620748.

smsconfig Options for Adding and Removing Users Incomplete

The smsconfig man page command synopsis does not list options for adding domain users or removing platform users. The -a and the -r option need to be added to each list:

smsconfig -a|-r -u username  -G admn|oper|svc  platform
smsconfig -a|-r -u username -G admn|rcfg domain_id

See BugID 4626077.


Fixed SMS 1.2 Bugs

This section contains bugs fixed since SMS 1.1.

setkeyswitch Returns Success on Failure (BugId 4430866)

If setkeyswitch is already running for a domain, and you try to run it again, an error message is printed, but the return code is 0. A non-zero result would indicate failure.

Board Test Status Gets Lost After Failover (BugId 4431636)

When failover occurs, pcd receives poweron events from esmd . pcd clears the test status field of those boards being reported as powered on by esmd (even though in reality they are not being powered on).

Console Session Would Not Open (BugId 4447218)

A console session does not connect if dxs/dca are not running.

dsmd May Leave Domain Off (BugId 4448476)

If a domain does not perform an environmental shutdown quickly enough, dsmd may leave it off. esmd is not sending a recover event to dsmd .

Strange Character in frad Messages (BugId 4466020)

frad messages in the message log files sometimes contain a bad string in place of the FRUID. This does not crash the daemon and nothing needs to be done.

SMS CLIs Must Revoke Privileges for Platsvc (BugId 4477169)

The following commands should not be executable by platsvc:

disablecomponent , enablecomponent , flashupdate , poweron , poweroff , resetsc , setbus , setfailover .

showdate Does Not Conform to the Privileges Table and man pages (BugId 4477357)

showdate privileges are incorrect and allow all users access to both the platform and the domains. showdate should be executable as follows:

Platform administrator, operator and service can only run showdate for the platform. Domain administrator and configurator can only run showdate on the domain for which they have privileges.

Usage Message for showcomponent Needs to Be Updated (BugId 4477464)

The usage message for the showcomponent command does not match the man page. Needs to be updated to follow the accepted nomenclature.

showkeyswitch Failed When Run with Platform Permissions (BugId 4477473)

Only the platform admin can run showkeyswitch for a domain.

Workaround : The platoper or platsvc will need to run showplatform -d domain_id to see the keyswitch state of domains.

esmd Gives Incorrect Power Information at Startup Time (BugId 4479317)

esmd calculates available power based on how many power supplies it has probed. At startup time, esmd registers a power supply failure because it has not yet probed all the power supplies. It then logs an incorrect message about available power.

SMS Needs to Handle Lock Timeouts More Cleanly (BugId 4484180)

When running SMS operations (like setkeyswitch, for example) on machines with many domains (greater than 10) you see failures due to "lock acquisition failures".

smsconnectsc Should Not Support "-q" Option.(BugId 4484857)

Currently, smsconnectsc supports the "-q" command line option which suppress all messages to stdout including prompts and you will not get the tip console.

setkeyswitch on Should Provide Notification When Waiting to Start POST (BugId 4485413)

If POST is already running on several domains, setkeyswitch may appear to hang before starting POST. It can take up to 50+ minutes to finish.

esmd Does Not Do a Graceful Domain Shutdown (BugId 4487091)

When both CSBs overheat simultaneously esmd does not gracefully shutdown the domain.

Duplicate Container Nodes in PICL FRU Tree After esmd Dies (BugId 4488053)

Occurs when esmd restarts.

Domain Log Fills Up with Mailbox Messages (BugId 4488179)

The internal network fails when Sun Management Center starts up. The domain can be reached by the external network but not by the internal network.

xir Outputs Lock Error Messages (BugId 4488549)

A library routine is trying to get status. This does not affect the operation, only the return code.

Error Messages Before SC Has Become Main (BugId 4489856)

After starting SMS but before the SC has become main, the hwad and fomd error messages are printed in the platform log. These error messages vanish once the SC has becomes main.

Sometimes the SC Cannot Become Main (BugId 4489958)

After running setfailover force , the desired new main sometimes has problems becoming main. pcd repeatedly fails to startup. The SC eventually gives up and remains in an UNKNOWN state until it is either reset or SMS is cycled. The old main comes back up, does not detect interrupts, and then assumes the main role.

Relevant /etc Files Should Be Backed up with smsbackup (BugId 4490943)

Files under the /etc directory are not backed up by the smsbackup command. These include but are not limited to: /etc/hosts , /etc/nsswitch.conf , /etc/group and /etc/hostname.* Consequently, an smsrestore does not restore a system to its previous working state completely.

dsmd Loops Trying to Clear Recordstop (BugId 4492052)

dsmd attempts clear recordstops, after the hardware state dump is taken. The recordstop may not be cleared if the lowest numbered expander board is unconfigured. This causes dsmd to continue taking recordstop dumps indefinitely.

Periodic File Propagation Not Working (BugId 4496790)

After starting SMS on main and spare, the platform message file does not get copied to spare. Other files in /var/opt/SUNWSMS/adm/A...R get copied once when starting failover, but never again. pcd files get propagated but other files don't.

ssd Should Log When SMS is Ready (BugId 4529989)

It is not clear when SMS is loaded and ready for use.

Workaround : Use the showfailover command. When it completes, SMS is ready.

Failover Bugs

addboard Hangs After Interrupted by Failover (BugId 4459812)

The pcd database and checkpoint files failed to propagate to the other SC before the failover occurs.

Domain Dstop During SMS Start on Previous Spare SC (BugId 4469482)

Both SC's clocks are phase locked when SMS is running. This creates a failover without the benefit of having SMS phase locking the system clocks. This, naturally, lead to a DStop.

Management Network Bugs

smsconfig Does Not Work Properly with IPv6 Addresses (BugID 4411113)

Cannot specify IPv6 addressses.

smsconfig Allows Mixed IPv4 and IPv6 (BugID 4411819)

smsconfig Should Set Appropriate OS Variables (BugId 4434696)

smsconfig should set the following IP ndd variables:

ip_forwarding

ip6_forwarding

ip_respond_to_echo_broadcast

ip6_respond_to_echo_multicast

to false using ndd . These settings should be configured to persist across reboots (add them to the appropriate rc script).

mand Does Not Work With IPv6 Addresses (BugId 4486879)


SMS 1.2 Software Bugs/RFEs

The following are known SMS 1.2 software bugs.

setkeyswitch Hangs After Control-c (BugId 4349640)

setkeyswitch may hang after you send a control-c (SIGINIT) signal.

Workaround : In the event control-c doesn't work you can regain the prompt by killing the process using kill -9 .

esmd Warnings and Errors Do Not Appear in Associated Domain Logs (BugId 4382784)

esmd logs all environmental events that affect one or more domains to the platform log but not the domain log.

Workaround : None. Refer to the platform log where the messages are logged.

kmd Does Not Delete Security Associations on the Domain (BugId 4403149)

After a failover, kmd does not delete security associations on the domain. The security associations (SAs) are associated with socket connections between DCA to and from DCS or DXS to and from CVCD. The SAs for the SC which failed over are the ones which should have been deleted.

The SAs would be useful only for a client on the failed over SC with sockets bound to the ports in the SAs.

Workaround : Use the Solaris ipseckey (1M) command on the domain to delete SAs which have the IP address of the failed over SC.

Socket Error When Using the I2 Net (BugId 4472333)

After a failover/takeover, the following errors are sometimes seen when failover is activated and file propagation begins:

" /var/opt/SUNWSMS/data/.failover/chkpt/chkpt.list " failed - "rcmd: socket: Cannot assign requested address."

This prevents file propagation from working.

Workaround : None. File propagation will take place eventually.

dsmd Should Checkpoint ASR Steps (BugId 4477381)

If a failover occurs while dsmd is performing a domain recovery, dsmd may not complete the domain recovery.

Workaround : Complete the recovery manually using setkeyswitch off , setkeyswitch on and, if necessary, booting the domain.

rcfgadm Fails After a Failover That Occurred During DR (BugId 4478467)

If a failover occurs in the middle of a rcfgadm operation, the operation fails when restarted after the failover.

Workaround : None.

esmd Shouldn't Log hPCI Cassette Removal and Insertion Messages During Power On and Off (BugId 4483155)

Whenever you turn an hPCI board on and off, esmd logs messages indicating that its cassettes were inserted/removed.

Workaround : Ignore messages.

mld Leaks 8K When 8 Domains Brought Up in Parallel to OS (BugId 4497251)

Booting 8 domains in parallel to the OS level could result in a failover when the SC runs of of memory.

Workaround : Do not boot 8 domains in parallel.

dsmd Should Increase POST Level for Next Run If ASR Boot Fails in OBP (BugId 4513721)

dsmd distinguishes two types of domain reboot. A domain reboot to recover from software failures such as domain panic or heartbeat stop is performed by the minimal POST. The reboot to recover from hardware failures such as domain stop or from repeated software failure is performed by the regular POST. Currently the dsmd -invoked POSTs always use the hpost level specified in the .postrc file and this hpost level does not change between ASR retries. dsmd should handle such boot failures by retry the ASR reboot but the POST invoked should be done with higher hpost level.

Workaround : None.

dsmd Is Slow to Pick Up Domain Nodename on SMS Startup (BugId 4514742)

When the system brings up a large number of domains, commands like showplatform will not display all domain nodenames at once. It can take several iterations to complete the display.

Workaround : Wait til dsmd finishes.

The SC Rack ID is Not Displayed (BugId 4515072)

The SC should set the frame name which is displayed on the LCD of the Frame Manager.

Workaround : None.

Frame Manager LEDs are Not Being Used to Signal Problems (BugId 4515074)

The SC should signal faults with itself, the other SC or the system it is monitoring on the Frame Managers amber LEDs.

Workaround : None.

hwad Errors Appear During setkeyswitch off(BugId 4524488)

The following messages has been seen in the platform log:

hwad[22334]: [1128 52700900610948 ERR JtagSerengeti.cc 224] 
Reading bus failed in address 40200050, ecode=1217
hwad[22334]: [0 52700901823489 ERR post_logif.c 60]
jt_read_chain(m=98,c=0,i=0x41,nbits=1112,,0) error 1217
hwad[22334]: [1158 52700902546590 ERR InterruptHandler.cc 2562] 
libxcpost call xcp_read_axq_scint_status() failed, ecode=-1

Workaround : Ignore the messages.

rcfgadm Exits With an Error When -o unassign is Used and the SB is Not in the Available Component List (BugId 4530028)

If -o unassign -c disconnect is used, the unassign is passed as an option to a domain function. In this case, the unassign is performed with the domain administrator privileges even if the user on the SC has platform administrator privileges.

Workaround : For the following example there are two possible workarounds.

sc0:sms-svc:>rcfgadm -da -v -c disconnect -o unassign SB0
This fails because SB0 is not in Domain A's available component list.

Add SB0 to the available component list of domain a.

sc0:sms-svc:>setupplatform -da -a SB0

You must have both domain and platform administrator privileges then run rcfgadm twice. First to disconnect SB0 (using domain privileges) and then unassign it (using platform privileges).

sc0:sms-svc:> rcfgadm -da -v -c disconnect SB0
sc0:sms-svc:> rcfgadm -da -x unassign SB0

Error Message About Max Number of Connected Consoles (BugId 4530492)

The following messages has been seen in the domain log:

dxs[8753]-C(): [4911 12439774264309 ERR ConsoleService.cc 506] 
DXS - maximum number of connected consoles reached

This indicates that the maximum number of console processes has been reached.

Workaround : Close some open consoles. If that does not work, kill (1) the console process.

Power Gives the Wrong Status for SC0 (spare) (BugId 4533114)

Whenever any of the power converters on the SC are powered off SMS the poweron command will show that board as off, even though it is up and running. Sometimes showboards -v will show that the spare SC is off when the SC in on and failover is active.

Workaround : Make sure all power converters are on. poweroff and poweron the spare SC.

dsmd Recovery State Sometimes Lost After Core Dump (BugId 4533133)

When some of the domains are trying to recover a failure and dsmd core dumps, it dsmd can lose the recovery state.

Workaround : Reboot the domain using setkeyswitch off , setkeyswitch on .

smsconnectsc Does Not Connect to Other SC If Powered Off (BugId 4533385)

smsconnectsc asks the user if they want to power on the other SC, then it does the poweron and exits without printing any further instructions or information. It should automatically connect to the SC after it has powered it on and not prompt.

Workaround : None.

esmd Uses the Wrong Thresholds for WCI Temps, So it Will Not Shut Them Off in Time (BugId 4533897)

If a wPCI ASIC overheats, you may lose the ASIC.

Workaround : None.

Comment in kmd_policy.cf is Misleading (BugId 4545879)

The comment in the kmd_policy.cf file is inaccurate. It states that specific domains should be identified using an integer in 0 - 17. The file should state a letter in A - R should be used to identify a domain.

Workaround : Use domain letters in the kmd_policy.cf file rather than numbers to identify specific domains.

disablecomponent and enablecomponent Man Pages Do Not Contain wPCI Support for Paroli Modules (BugId 4546763)

SMS 1.2 software supports disabling and enabling Paroli modules on wPCI boards. The man pages do not list paroli_link as a valid form.

Workaround : See disablecomponent and enablecomponent Missing Paroli Link Operand for examples on blacklisting Paroli modules. Refer to the System Management Services (SMS) 1.2 Reference Manual for the corrected text.

esmd Fails to Power Off a Paroli When One of its Voltages is Out of Range (BugId 4548844)

esmd detects a voltage condition but fails to turn the paroli off

Workaround : None.

SMS1.2 Shouldn't Restore to 1.1 When MAN.cf file Contains Incompatibilities (BugId 4554667)

smsrestore for 1.2 restores an incompatible version of the MAN.cf to SMS 1.1. Switching back from 1.2 to 1.1 once the new MAN features have been enabled is not supported.

Workaround : Rerun smsconfig after installation and smsversion to 1.2.

Clocks Are Not Set Properly When the Main SC is Powered off During failover (BugId 4554753)

You can lose clock source, causing domains to DSTOP.

Workaround : None.

crontab Entries Contain the Wrong Command Path (BugId 4586065)

The absolute path in the listed crontab entries are incorrect.

Workaround : These entries are not implemented in this release. Remove the following crontab entries :

10 4 1 * * /var/opt/SUNWSMS/bin/codlogrotate          # SUNWSMSop
0 10 * * 1 /var/opt/SUNWSMS/bin/audithotspares        # SUNWSMSop

Rebooting a Domain Can Result in a Panic (BugId 4587418)

Issuing the reboot command on a domain, issuing the boot command after shutdown on a domain and some dsmd ASR reboots will cause a domain to panic.

Workaround : Install the patch associated with this BugId. The patch is available at: http://sunsolve.sun.com. Until the patch can be installed, you can use setkeyswitch standby , setkeyswitch on, to reboot the domain.

dsmd Holds the tmd Lock of a Thread That Doesn't Exist Anymore (BugId 4587746)

This can cause setkeyswitch to hang.

Workaround : Restart tmd and dsmd .

console(1m) man page list of -e escape characters is wrong (BugId 4592909)

The list of valid escape characters is invalid. The only characters you cannot use are: # @ ^ & ? * = . |

Workaround : Use any character other than those listed. Refer to the System Management Services (SMS) 1.2 Reference Manual for the corrected text.

hwad Core Dumps After a Lock Timeout (BugId 4593197)

This will only occasionally happen.

Workaround : None.

pcd is Not Being Propagated to the Spare SC(BugId 4614577)

During complex operations, for example, setkeyswitch (1M), it is possible that the pcd on the Spare SC can get out-of-sync with the pcd on the Main SC. If this happens when a failover occurs then the new Main SC may not recognize a given domain. This will leave the domain unmonitored thus disabling console access and domain logging from the SC.

Workaround :Execute a command such as addtag (1M) after the setkeyswitch completes. This has the effect of updating the pcd and, thus, propagating it. The other option is to use setdatasync (1M)'s backup option to propagate it. However, the platform message logs on the Spare SC will be overwritten by the Main SC's. This is bug:

4619939 setdatasync backup overwrites platform message logs on SPARE SC

If You Degrade CP to CSB Associated With Spare and Failover, DARB Interrupt Will Break (BugId 4616931)

Depending on which CP is degraded, DStops may not be handled.

Workaround : None

Domain Failed to Get to OK Prompt, Due to dxs(BugId 4617560)

Once in a while the domain console hangs.

Workaround : Restart dxs

showplatform Returns the Wrong Code When Successful(BugId 4618917)

A 1 is returned instead of a 0 when showplatform (1M) successfully completes.

Workaround : None.

Domains Fail POST Several Times When dsmd Recovers From Platform Power Failure (BugId 4619655)

When DSMD recovers a domain after a platform power failure, POST may fail on the domain one or more times, but DSMD will retry POST until it is able to restart the domain

Workaround : None.

showplatform(1m) man page missing "Domain Down" in Status List (BugID 4620748)

"Domain Down" is missing from the domain status list in the showplatform man page.

Workaround : See showplatform Missing Domain Down for an explanation of "Domain Down ." Refer to the System Management Services (SMS) 1.2 Reference Manual for the corrected text.

esmd Can Decrease Fan Speeds Even Though it Has Detected a Hot Sensor (BugId 4620872)

If esmd detects a hot sensor within a minute after it starts up, it may decrease fan speeds in spite of the sensor.

Workaround : None.

smsconfig Man Page Options for Adding and Removing Users are Incomplete (BugId 4626077)

The syntax for smsconfig is incorrect. The -a option only shows platoform users and the -r option only shows domain users. Both options need their complement added.

Workaround : See smsconfig Options for Adding and Removing Users Incomplete for an example of the correct syntax. Refer to the System Management Services (SMS) 1.2 Reference Manual for the corrected text.

cmdsync Commands Do Not Work When Failover Disabled (BugId 4626440)

Normally, if failover happens in the middle of a cmdsync command execution, the new main continues and completes the commands before it disables failover. Sometimes, however, failover is disabled before the commands have finished running and they do not complete.

Workaround : Rerun the commands manually.

addboard Returns EACCES Error When -c assign Invoked as Platadmn (BugId 4627926)

The platform administrator does not have access to the /etc/opt/SUNWSMS/config/ domain_id / directories. Domain-specific blacklist and postrc entries will not be visible to the administrator if he only has platadmn privileges.

Workaround : None for a platamdn but obtaining domain administrator privileges would allow you to view the domain specific files.

fomd (main) Can Use 90% of CPU Time (BugId 4628484)

A thread in fomd can get caught in a loop and use alot of CPU cycles.

Workaround : Stop and restart SMS

sc0: # /etc/init.d/sms stop
sc0: # /etc/init.d/sms start

.

osdTimeDeltas Missing From fomd_sys_datasync.cf (BugId 4628978)

osdTimeDeltas does not get propagated to the Spare SC. This may throw off the time-of-day for all domains.

Workaround : Use setdatasync (1M) to propagate the file.

dsmd Can Hold a Lock During 18 Domain Boot (BugId 4629474)

The failure could happen when 18 domains are trying to boot.

Workaround : Stop and restart SMS

sc0: # /etc/init.d/sms stop
sc0: # /etc/init.d/sms start

, and boot fewer than eight domains at a time.

"Attach Ready" State of System Boards Must Be Cleared When No Domains are Active (BugId 4629480)

Boards that are not powered-off after all domains are brought down may cause a DStop when the domains are brought back up. Certain conditions must be met for this to occur.

Workaround : Poweroff all boards after all domains have been brought down.

SMS Software Will Not Start After Upgrade From SMS1.1 (BugId 4632095)

Some of the contents of an smsbackup from SMS1.1 are not compatable with SMS1.2. If an smsrestore is performed in SMS1.2 using an SMS1.1 backup file, SMS will not start up.

Workaround : Manually backup

/etc/opt/SUNWSMS/SMS/config/esmd_tuning.txt

/etc/opt/SUNWSMS/SMS/config/fomd.cf

/etc/opt/SUNWSMS/SMS/startup/ssd_start

/etc/opt/SUNWSMS/SMS/startup/sms_env.sh

prior to restoring a backup file created by SMS1.1.

After running smsrestore , replace the restored files with the ones saved above.

smsbackup Will Only Accept Absolute Path Names (BugId 4633179)

If a relative path name is passed to the command, the backup will fail. In two cases . and ./ will cause the command to print

/opt/SUNWSMS/bin/smsbackup.
smsbackup: Backup to tape succeeded: ./sms_backup.1.2.cpio
SMS backup complete.

:

This is incorrect. No file is generated to the current directory.

Workaround : Use absolute path name.

Management Network (MAN) Bugs

This section contains the synopses and Sun BugID number of the more important bugs that have been discovered regarding MAN. This list does not include all bugs

MAN Driver Error Message During Net Install (BugId 4368815)

While net booting a domain using the SC as the install server, and going over the MAN, the following error is displayed while the Solaris software is coming up:

ifconfig: setifflags: SIOCSLIFFLAGS: eri1: Cannot assign requested address

Workaround : Ignore it.

MAN Driver Configuration Function Should Live in sysidtool Framework (BugId 4469050)

If sys-unconfig is run on a domain preconfigured with Solaris software, the /etc/hostname.dman0 files are lost. They are not be recreated on a reconfiguration boot and the MAN network between the SC and the domain does not come up.

Workaround : Refer to Unconfigured Domains .

Domain MAN Configuration Problem When Booted with Another Domain's Boot Disk (BugId 4482112)

If a boot disk which was installed on another domain is used to boot a domain, then dman0 interface on the domain will be configured with wrong IP address.

Workaround : Refer to Unconfigured Domains .

MAN I1 Network IP Address of an Installed Domain Using smsconfig -m Does Not Reflect Changes on the Domain (BugId 4484851)

If there are already installed domains and you have changed the MAN I1 network configuration using smsconfig -m then you will need to configure the MAN network information on the already installed domains by hand.

Workaround : Refer to Unconfigured Domains .

Delay in I1 Networking Coming Up After Failover (BugId 4627984)

For certain cases there may be a delay in the start up of the I1 network.

Workaround : Run ifconfig (1).

You must be logged in as superuser on the SC.

sc0:#ifconfig scman0 down
sc0:#ifconfig scman0 up


Fixed Other Bugs

This section contains bugs fixed since SMS 1.2.

Volume Manager (BugId 4355643)

Volume Manager can not cope with some formats of CDROM

Memory Bug (BugId 4457384)

Holding proc_t->p_lock while allocating memory leads to hung clock() & heartbeat.

NTP On the Domain Does Not Sync with the SC (BugId 4467470)

xntpd on the domain should gradually adjust the clock to sync with the sc clock. Instead, a message appears about a half hour after starting xntpd :

sun15-b xntpd[1324]: IID 774427] time reset (slew) -54.206802 s

The amount printed is the amount of difference between the sc and domain clocks, but the clocks are never in sync.

IPMP Fail eri1 Path Group (BugId 4469112)

When the external network is configured so that there are two communities, with hme0 in one community and eri1 in the other, IPMP fails the path group with eri1 in it.


Other Bugs

This section contains the synopses and Sun BugID number of the more important bugs that have been discovered regarding the Sun Fire 15K system. This list does not include all bugs.

IP_RPUT_DLPI Error (BugId 4419505)

ip_rput_dlpi(fcip0): DL_ERROR_ACK error message on boot from cd image

The following error messages are seen when doing an installation to set the SC as the install server.

Configuring /dev and /devices
ip_rput_dlpi(fcip0): DL_ERROR_ACK for DL_ATTACH_REQ(11), errno 8, unix 0
ip_rput_dlpi(fcip0): DL_ERROR_ACK for DL_BIND_REQ(1), errno 3, unix 0
ip_rput_dlpi(fcip0): DL_ERROR_ACK for DL_PHYS_ADDR_REQ(49), errno 3, unix 0
ip_rput_dlpi(fcip0): DL_ERROR_ACK for DL_UNBIND_REQ(2), errno 3, unix 0
ip_rput_dlpi(fcip0): DL_ERROR_ACK for DL_DETACH_REQ(12), errno 3, unix 0
Using RPC Bootparams for network configuration information.

This indicates that the IP over Fibre Channel device instance 0 does not exist.

Workaround : Ignore it.

ohci Driver Does Not Receive SOF Interrupts (BugId 4485012)

The Sun Fire 15K server does not currently support USB devices. Due to interaction with the corresponding software device driver in Solaris, users may experience significant delays while booting SCs and domains. In addition, messages similar to the following might be seen in the console output during boot or in system log files::

WARNING: <device_tree_path> (ohci0): No SOF interrupts (refer to ohci(7D))

Since USB devices are not yet supported on the Sun Fire 15K, there is no workaround that will enable them. However, adding the following line to the /etc/system file on the SC and on each domain will eliminate the unnecessary boot delays and warning messages:

exclude: drv/ohci