From: 卢兴敬 (xingjinglu_at_gmail_dot_com)
Date: Wed Oct 29 2008 - 15:51:39 PST
Paul, I ran the "vstat" in my UPC_NODES, and it shows as follows: 1 HCA found: hca_id=InfiniHost0 pci_location={BUS=0x04,DEV/FUNC=0x00} vendor_id=0x02C9 vendor_part_id=0x5A44 hw_ver=0xA1 fw_ver=3.5.0 PSID=MT_0030000001 num_phys_ports=2 port=1 port_state=PORT_ACTIVE sm_lid=0x0009 port_lid=0x0041 port_lmc=0x00 max_mtu=2048 port=2 port_state=PORT_DOWN sm_lid=0x0000 port_lid=0x0042 port_lmc=0x00 max_mtu=2048 I am not familiar with the network, and I think the only port 1 is active. So I tried: ssh 12.11.11.7 -D 1 , the results as follows mean I didn't have the right to do so. ----------------------------------------------------- autopar@gnode8:~> ssh 12.11.11.7 -D 1 Privileged ports can only be forwarded by root. ------------------------------------------------ And I look for more information about infiniband now, so do you think it is the lack of proper active port causes the problem ? Thank you, wish your reply! -----Original Message----- From: Paul H. Hargrove [mailto:hargrove_at_hpcrd_dot_lbl_dot_gov] Sent: Thursday, October 30, 2008 5:44 AM To: 卢兴敬 Subject: a problem of "unable to open any HCA ports" when upcrun a program Email to [email protected] failed, so I am resending to your gmail address: This error indicates that one or more nodes faild to locate an InfiniBand adapter. It is possible that the InfiniBand hardware and/or software has not been setup on one or more nodes. You should see the message you quote once for each failing attempt. If you see less than 32 instances, then it is possible that at least one node does have a working InfiniBand configuration. On each of the machines in you UPC_NODES, try running the "vstat" utility to see if it reports at least one "PORT_ACTIVE" line on each. It is also possible, that the InfiniBand libraries are present but there is no hardware at all. -Paul