GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:45:36 Sequence gi568815590r:33492016_33697515 : 205500 bp : 45.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3527 3601 75 1 0 97 93 97 0.999 10.81 1.02 Intr + 4610 4726 117 1 0 55 100 244 0.982 22.96 1.03 Intr + 5217 5282 66 2 0 72 83 54 0.809 2.40 1.04 Term + 6417 6614 198 2 0 107 34 344 0.994 28.30 1.05 PlyA + 7599 7604 6 1.05 2.08 PlyA - 7659 7654 6 -0.45 2.07 Term - 7738 7718 21 1 0 99 49 16 0.792 -2.89 2.06 Intr - 8475 8313 163 2 1 35 116 100 0.593 7.48 2.05 Intr - 11557 11414 144 0 0 99 90 31 0.931 3.80 2.04 Intr - 11920 11733 188 1 2 107 94 178 0.999 18.79 2.03 Intr - 15306 15214 93 0 0 78 90 38 0.855 3.16 2.02 Intr - 17917 17731 187 0 1 60 91 -3 0.717 -3.11 2.01 Init - 20598 19952 647 0 2 66 78 254 0.753 17.00 2.00 Prom - 34509 34470 40 -3.96 3.00 Prom + 41756 41795 40 -2.16 3.01 Sngl + 48041 48382 342 1 0 60 54 252 0.705 15.27 3.02 PlyA + 48843 48848 6 1.05 4.07 PlyA - 49437 49432 6 1.05 4.06 Term - 56852 56738 115 1 1 64 48 116 0.629 3.24 4.05 Intr - 57477 57395 83 0 2 117 70 18 0.457 1.34 4.04 Intr - 59070 59029 42 0 0 126 98 29 0.744 6.14 4.03 Intr - 59374 59329 46 0 1 101 88 72 0.998 6.91 4.02 Intr - 66756 66600 157 1 1 148 96 134 0.996 19.27 4.01 Init - 82247 82082 166 2 1 31 60 192 0.588 8.93 4.00 Prom - 96910 96871 40 -2.46 5.11 PlyA - 99335 99330 6 1.05 5.10 Term - 100197 99998 200 1 2 112 55 281 0.986 24.76 5.09 Intr - 101732 101518 215 1 2 81 75 373 0.361 33.66 5.08 Intr - 105758 105280 479 2 2 84 47 272 0.067 14.55 5.07 Intr - 107750 107650 101 0 2 65 65 51 0.392 0.33 5.06 Intr - 125663 125531 133 2 1 74 76 58 0.041 3.42 5.05 Intr - 139896 139752 145 2 1 69 6 112 0.012 1.38 5.04 Intr - 144319 144183 137 1 2 69 73 54 0.085 1.37 5.03 Intr - 148154 148080 75 2 0 104 89 30 0.363 4.41 5.02 Intr - 155291 155220 72 2 0 104 105 -18 0.248 1.10 5.01 Init - 163614 163537 78 0 0 81 88 43 0.427 4.66 5.00 Prom - 201241 201202 40 -1.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 40510 40397 114 1 0 83 38 97 0.953 2.77 S.002 Init - 41398 41336 63 1 0 45 82 81 0.832 2.35 S.003 Term - 105758 105249 510 2 0 84 43 278 0.808 17.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:33492016_33697515|GENSCAN_predicted_peptide_1|151_aa EKALIAAQLDNAIEKELLERLKQDTYGDIYNFPIHAFDKALEQQEAESDSSDTEEKDDDD DDEEDVGKREFVEDGEVDESDISDFEDMDKLDASSDEDQDGKSSSEEEEEKALSAKHKGK MPLRGPLQRKRAYVEIEYEQETEPVAKAKTT >gi568815590r:33492016_33697515|GENSCAN_predicted_CDS_1|456_bp gaaaaggcattaatagctgctcagctggacaatgccattgagaaggaattactggagaga ctgaaacaagatacgtatggcgacatctacaacttccccattcatgccttcgacaaagcc ctggaacaacaggaggcagagagtgactcttcagatactgaggaaaaagatgatgatgat gatgatgaggaagatgtggggaaaagagaatttgtcgaagatggtgaggtagatgagagt gacataagtgattttgaggatatggataaactggatgccagcagtgatgaagatcaggat ggtaaatcctccagtgaggaggaggaagaaaaggcccttagtgcgaaacacaaaggcaaa atgcccttgagaggaccactgcagagaaaacgagcctatgtggaaatagaatacgagcag gagacagagcccgtggccaaagccaaaaccacgtga >gi568815590r:33492016_33697515|GENSCAN_predicted_peptide_2|480_aa MELDSALEAPSQEDSNLSEELSHSAFGQAFSKILHCLARPEARRGNVKDAVLKDLGDLIE ATEFDRLFEGTGARLRGMPETLGQVAKALEKYAAPSKEEEGGGDGHSEAAEKAAQVGLLF LKLLGKVETAKNSLVGPAWQTGLHHLAGPVYIFAITHSLEQPWTTPRSREVAREVLTSLL QVTECGSVAGFLHGENEDEKGRLSVILGLLKPDLYKESWKNNPAIKHVFSWTLQQVTRPW LSQHLERVLPASLVISDDYQTENKILGVHCLHHIVLNVPAADLLQYNRAQVLYHAISNHL YTPEHHLIQAVLLCLLDLFPILEKTLHWKGDGARPTTHCDEVLRLILTHMEPEHRLLLRR TYARNLPAFVNRLGILTVRHLKRLERVIIGYLEVYDGPEEEARLKILETLKLLMQHTWPR VSCRLVVLLKALLKLICDVARDPNLTPESVKSALLQEATDCLILLDRCSQGRVKKSLVED >gi568815590r:33492016_33697515|GENSCAN_predicted_CDS_2|1443_bp atggagcttgacagcgctctggaagccccatcgcaggaagactctaatttgtccgaggag ttgtctcactccgcctttggacaggccttctccaagattttacactgtcttgcccgcccg gaggcacgacgaggcaatgtaaaagatgcagttcttaaagacctcggtgatctaatagaa gccacagaatttgataggttatttgaggggactggtgcacggctccgcggaatgccggag acactggggcaggtagcaaaagccctggagaagtatgcagccccctccaaggaggaggaa ggtggaggtgatgggcactccgaagcggccgagaaagcagcccaagttgggttactgttt cttaaactgttagggaaagttgagactgctaagaattccctggtcggccctgcatggcag acgggcctgcatcacttggcaggacccgtttatatttttgccatcacacacagcttggag caaccatggaccactccgagatctcgggaagttgctagggaggtgctcacctcactgctt caagttactgaatgcggttctgtggcaggattcctacatggagaaaatgaagatgagaaa gggagactttcggtgatactagggcttctcaaacccgacttgtataaggaatcctggaag aataaccctgccatcaaacatgttttctcatggactctgcaacaggtcactcggccctgg ctgagccagcatctggaaagggtacttcccgcatcattggtcatttcagatgactatcag actgagaacaaaatcctgggtgtacactgtctccatcacattgtgcttaatgtgccagct gctgatttgctccagtataacagagcccaggtcctataccatgccatttccaaccacctg tacacaccagagcaccacctcattcaggctgtgctcctgtgtctgctggatttattcccc atcctggagaaaaccctgcactggaaaggagatggagctcgacccaccacccattgtgat gaggtcctgcggctgatcctgacccacatggagccagagcaccgccttcttttacgcagg acctacgcaagaaacctgccggctttcgtgaacaggttggggatcctaactgtccggcac ttaaagaggctggagagagtcatcattggttatctggaggtttatgatggacctgaggag gaagctagactgaagatattggaaaccctaaaacttctcatgcaacatacttggcccaga gtttcctgcagacttgtggtcttactgaaggccctcttgaaactgatttgtgatgtagca agggatccaaaccttacacctgagtctgttaagagcgccctgctacaggaggccacagac tgcctgattctcctggaccgctgttctcaaggacgggtaaagaaatctttagttgaagat tga >gi568815590r:33492016_33697515|GENSCAN_predicted_peptide_3|113_aa MAANSSFLGGVHGLFLVWVALRVLGDRPFKCTFMSLTLHYPRCRLETGIQGAFGKPQGTV ARVHIGQVKSICTKLQNKEHVIEAPCRAKFKFPGHQKIHISKKWGFTKFNVDE >gi568815590r:33492016_33697515|GENSCAN_predicted_CDS_3|342_bp atggctgcaaacagcagctttcttggtggtgtacatggcctgtttcttgtatgggttgct ctaagggtccttggagacaggcctttcaaatgtacgttcatgtctctgaccttgcactac ccccgatgtaggctcgaaacaggcattcaaggtgcctttggaaagccccagggcactgtg gccagggttcacattggccaagttaagtccatctgcaccaagctgcagaacaaggagcat gtgattgaggcgccatgcagggccaagttcaagttccctggccaccagaagatccacatc tccaagaagtggggctttaccaagttcaatgtggatgaatga >gi568815590r:33492016_33697515|GENSCAN_predicted_peptide_4|202_aa MGLRAVEQRVALVGEARAAQEPMEWVGGSGMAGCRSRALPCGKAAKARREIERSAGCFCG LGLVSTNKSCSMPPISFQDLPLNIYMVIFGTGIFVFMLSLIFCCYFISKLRNQAQSERYG YKEVVLKGDAKKLQLYGQTCAVCLEDFKGKDELGVLPCQHAFHRKCLVKWLEVRCVCPMC NKPIASPSEATQNIGILLDELV >gi568815590r:33492016_33697515|GENSCAN_predicted_CDS_4|609_bp atgggactgcgcgccgtggagcagagggtggcgctcgtcggggaagctcgggccgcacag gagcccatggagtgggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccc tgtgggaaggcagccaaggcccggcgagaaatcgagcgcagcgccgggtgtttctgtggc ctgggactggttagcaccaacaagtcctgctcgatgccacccatcagtttccaggacctt ccgctcaacatctatatggtcatcttcggcacaggcatctttgtcttcatgctcagcctt atcttctgctgctattttatcagcaaactgcggaaccaggcacagagtgagcgatacgga tataaggaggtggtgcttaaaggtgatgccaagaagttacaattatatgggcagacctgc gcagtctgtctggaagacttcaaggggaaggatgagttaggcgtgctcccgtgccaacac gcctttcaccgcaagtgtctggtgaaatggctggaagttcgctgtgtctgccccatgtgt aacaagcccattgctagtccctcagaggccacgcagaacattgggattctattggatgag ctggtgtga >gi568815590r:33492016_33697515|GENSCAN_predicted_peptide_5|544_aa MEYYTAIKKNEILSFATTWMELEVIFGSPILWPQTSTSPWPVRNWAAQQEASAFLLSGIT AAELEVEIVPGPSSSTLSQKSFPLENFPCQLHAAKTELTTGPSGYPGTNYPSGGITQALK MEAAKAQGEIECSTGGPAQVEDPAHPPQLLAWVLSPLEPAASGAGRLLRAQTPETFLWTW TSSLSQCQASPHRLRLKAHFGAKSASVDSQKASETAAATAAGASWPWRCLRRRSAFRTEA QRSACKESQRAEQKELLQLEEKATPEGSCQSQAYLLSLENLLPKDQPREGKQLLWTHSSV SHTPSSSSSSYLATEPIITTATPAAATVSATSKMCPGNWLWASMTFMARFSRSSSRSPVR TRGTLEEMPTVQHPFLNVFELERLLYTGKTACNHADEVWPGLYLGDQDMANNRRELRRLG ITHVLNASHSRWRGTPEAYEGLGIRYLGVEAHDSPAFDMSIHFQTAADFIHRALSQPGGK ILVHCAVGVSRSATLVLAYLMLYHHLTLVEAIKKVKDHRGIIPNRGFLRQLLALDRRLRQ GLEA >gi568815590r:33492016_33697515|GENSCAN_predicted_CDS_5|1635_bp atggagtactatacagccattaaaaagaatgagattctgtcatttgccacaacatggatg gaactggaagtcattttcgggtccccaatcctctggccacagaccagtaccagtccatgg ccagttaggaactgggctgcacagcaggaggcttctgcctttttgctctcaggaattacc gctgctgaactggaagtagagattgtccctggcccttcctcatcaacattatctcagaag tccttccctcttgaaaactttccatgccagcttcatgctgccaagactgaattaactacc ggaccttcaggataccctgggaccaactatccctctgggggcattacccaggctcttaaa atggaggcagctaaggcccagggagaaattgagtgcagcaccggtgggccagcacaggtg gaagacccggcacaccctccgcagctgctggcctgggtgctaagcccgttagagcccgcg gccagcggggctggtcggctgctccgagctcagactccagaaacattcttatggacttgg acatcaagcctatcccagtgccaggccagccctcacagactaaggctcaaggcccacttt ggtgccaagtcagcttctgtggactcacagaaggcttcagaaacagcggcggcgacagca gcaggagcgtcatggccgtggcgctgtctgcgccggcgatccgcctttcggactgaggcc cagcgcagcgcttgcaaagagagccaaagagctgagcaaaaggagctgctgcagcttgaa gaaaaagccacccctgaaggctcctgccagagccaggcttaccttctctccctcgagaat cttctcccgaaggaccagcccagggaagggaagcagctgctctggactcactcaagtgtg tctcacaccccttcctcttccagcagcagctacctggcaactgaacccatcatcaccaca gccactcctgcagctgccacggtttctgccacctctaagatgtgccctggtaactggctt tgggcttctatgacttttatggcccgcttctcccggagtagctcaaggtctcctgttcga actcgagggaccctggaggagatgccaaccgttcaacatcctttcctcaatgtcttcgag ttggagcggctcctctacacaggcaagacagcctgtaaccatgccgacgaggtctggcca ggcctctatctcggagaccaggacatggctaacaaccgccgggagcttcgccgcctgggc atcacgcacgtcctcaatgcctcacacagccggtggcgaggcacgcccgaggcctatgag gggctgggcatccgctacctgggtgttgaggcccacgactcgccagcctttgacatgagc atccacttccagacggctgccgacttcatccaccgggcgctgagccagccaggagggaag atcctggtgcattgtgctgtgggcgtgagccgatccgccaccctggtactggcctacctc atgctgtaccaccaccttaccctcgtggaggccatcaagaaagtcaaagaccaccgaggc atcatccccaaccggggcttcctgaggcagctcctggccctggaccgcaggctgcggcag ggtctggaagcatga