RTX2080Ti过程

RTX2080Ti过程

AI想搞得好,计算能力要好,最近搞了两个RTX2080Ti和一个3.8T的硬pan.然后重新搞了一下,出了许多的bug,在此记录一下。

系统的问题

做系统pan的时候,里面的东西一定要先删除光,不然的话是不行的,记得在制作启动pan的时候,已经显示会格式化了,但是在安装的时候还是报了错,

The 'grub-efi-amd64....'

查了一下,有的说是因为系统pan不干净导致的,然后重新制作了一次,发现好了, 还有是在安装的时候,有一步提醒要不要继续安装,我当时点了back,因为第一次点的是continue结果没有安装成功。

安装的时候一定要看清晰,感觉这次安装是最花时的,之前都是一步到位,没有出过那么多的bug.

显示的问题

安装好,打开后,发现极度不好看,然后在系统里找display但是分辨率却调不了,查了一下,大致是因为驱动没有安,然后我就直接安装nvidia的了,方法是

sudo add-apt-repository ppa:xorg-edgers/ppa
sudo add-apt-repository ppa:graphics-drivers/ppa
sudo apt-get update

然后在search里面找additional drivers 找到更新的显卡驱动,然后安装,等安装完了重启,就会发现一切恢复了正常。

然后 nvidia-smi时已经可以用了

显示如下

avator

这是全部弄好测试时的图。

mount 时的错

mount: unknown filesystem type 'LVM2_member'

刚开始就报这个错, mount不上去, 在网上查了一下,需要先格式化

mkfs -t ext4 -c /dev/sdb1

这个格式化非常地慢,前后持续了5个小时。

然后操作是下面的

$ sudo fdisk -l

Disk /dev/sda: 111.8 GiB, 120034123776 bytes, 234441648 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disklabel type: dos
Disk identifier: 0xd9c8fe89

Device     Boot     Start       End   Sectors   Size Id Type
/dev/sda1  *         2048 232441855 232439808 110.9G 83 Linux
/dev/sda2       232443902 234440703   1996802   975M  5 Extended
/dev/sda5       232443904 234440703   1996800   975M 82 Linux swap / Solaris


Disk /dev/sdb: 3.7 TiB, 4000787030016 bytes, 7814037168 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disklabel type: gpt
Disk identifier: E882D06C-8A20-4947-9BD2-1B3C4D503744

Device     Start        End    Sectors  Size Type
/dev/sdb1   2048 5859375103 5859373056  2.7T Linux filesystem
qizhi@qizhi-System-Product-Name:/$ sudo fdisk /dev/sdb

Welcome to fdisk (util-linux 2.27.1).
Changes will remain in memory only, until you decide to write them.
Be careful before using the write command.


Command (m for help): m

Help:

  Generic
   d   delete a partition
   F   list free unpartitioned space
   l   list known partition types
   n   add a new partition
   p   print the partition table
   t   change a partition type
   v   verify the partition table
   i   print information about a partition

  Misc
   m   print this menu
   x   extra functionality (experts only)

  Script
   I   load disk layout from sfdisk script file
   O   dump disk layout to sfdisk script file

  Save & Exit
   w   write table to disk and exit
   q   quit without saving changes

  Create a new label
   g   create a new empty GPT partition table
   G   create a new empty SGI (IRIX) partition table
   o   create a new empty DOS partition table
   s   create a new empty Sun partition table


Command (m for help): d
Selected partition 1
Partition 1 has been deleted.

Command (m for help): p
Disk /dev/sdb: 3.7 TiB, 4000787030016 bytes, 7814037168 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disklabel type: gpt
Disk identifier: E882D06C-8A20-4947-9BD2-1B3C4D503744

Command (m for help): n
Partition number (1-128, default 1): 1
First sector (34-7814037134, default 2048): 34
Last sector, +sectors or +size{K,M,G,T,P} (34-7814037134, default 7814037134): 3907018567

Created a new partition 1 of type 'Linux filesystem' and of size 1.8 TiB.

Command (m for help): n
Partition number (2-128, default 2): 2
First sector (3907018568-7814037134, default 3907018752): 
Last sector, +sectors or +size{K,M,G,T,P} (3907018752-7814037134, default 7814037134): 

Created a new partition 2 of type 'Linux filesystem' and of size 1.8 TiB.

Command (m for help): p
Disk /dev/sdb: 3.7 TiB, 4000787030016 bytes, 7814037168 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disklabel type: gpt
Disk identifier: E882D06C-8A20-4947-9BD2-1B3C4D503744

Device          Start        End    Sectors  Size Type
/dev/sdb1          34 3907018567 3907018534  1.8T Linux filesystem
/dev/sdb2  3907018752 7814037134 3907018383  1.8T Linux filesystem

Partition 1 does not start on physical sector boundary.

Command (m for help): d
Partition number (1,2, default 2): 1

Partition 1 has been deleted.

Command (m for help): n
Partition number (1,3-128, default 1): 1
First sector (34-3907018751, default 2048): 
Last sector, +sectors or +size{K,M,G,T,P} (2048-3907018751, default 3907018751): 

Created a new partition 1 of type 'Linux filesystem' and of size 1.8 TiB.

Command (m for help): p
Disk /dev/sdb: 3.7 TiB, 4000787030016 bytes, 7814037168 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disklabel type: gpt
Disk identifier: E882D06C-8A20-4947-9BD2-1B3C4D503744

Device          Start        End    Sectors  Size Type
/dev/sdb1        2048 3907018751 3907016704  1.8T Linux filesystem
/dev/sdb2  3907018752 7814037134 3907018383  1.8T Linux filesystem

Command (m for help): w

其中有一步让弄的时候,我选了一个34,因为我想着这样可能会大一些,结果报了一个错“物理边界没达到”, 估计这些边界值应该是1024的倍数,之后就又删除掉按着2048来的,就好了,最后别忘了w保存。

然后就是这样的了

然后在mount之前别忘了

sudo mkfs.ext4 /dev/sdb1

两个都mount完之后 是

Filesystem      Size  Used Avail Use% Mounted on
udev             16G     0   16G   0% /dev
tmpfs           3.2G   34M  3.2G   2% /run
/dev/sda1       109G   26G   78G  25% /
tmpfs            16G   23M   16G   1% /dev/shm
tmpfs           5.0M  4.0K  5.0M   1% /run/lock
tmpfs            16G     0   16G   0% /sys/fs/cgroup
tmpfs           3.2G  4.0K  3.2G   1% /run/user/1000
/dev/sdb1       1.8T   68M  1.7T   1% /mnt/data1
/dev/sdb2       1.8T   68M  1.7T   1% /mnt/data2

然后为了电脑重启还要重新mount就加了一个启动自动的。

sudo vim /etc/rc.local

在里面的exit 0之前写上

sudo mount /dev/sdb1 /mnt/data1
sudo mount /dev/sdb2 /mnt/data2

就可以了。

cuda 和cudnn的安装

RTX2080Ti 安cuda的时候遇到了一些坑,先试的cuda90,安装没有问题,有问题的是cudnn安装完成后测试不通过,在网上查了一下说是可能cuda90不能够用,然后还是安装了cuda100, 和对应的cudnn,比较顺利没有出bug.

打赏,谢谢~~

取消

感谢您的支持,我会继续努力的!

扫码支持
扫码打赏,多谢支持~

打开微信扫一扫,即可进行扫码打赏哦