GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:16:43 Sequence gi568815591r:129734940_130052555 : 317616 bp : 46.05% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 9217 9290 74 1 2 97 94 41 0.251 4.65 1.02 Term + 20079 20242 164 1 2 119 44 245 0.928 21.30 1.03 PlyA + 22114 22119 6 1.05 2.00 Prom + 22761 22800 40 -6.56 2.01 Sngl + 27461 27820 360 1 0 79 53 215 0.339 11.18 2.02 PlyA + 28331 28336 6 1.05 3.08 PlyA - 28604 28599 6 1.05 3.07 Term - 34846 34685 162 1 0 45 48 102 0.373 -0.16 3.06 Intr - 35584 35370 215 2 2 118 36 61 0.238 2.33 3.05 Intr - 37209 37097 113 2 2 106 31 65 0.324 2.62 3.04 Intr - 39661 39586 76 2 1 97 32 59 0.211 -0.23 3.03 Intr - 43926 43787 140 2 2 101 55 55 0.685 3.61 3.02 Intr - 44666 44150 517 0 1 95 65 107 0.242 1.12 3.01 Init - 44953 44851 103 1 1 56 109 90 0.620 6.35 3.00 Prom - 47000 46961 40 -4.06 4.00 Prom + 48663 48702 40 -6.76 4.01 Init + 50769 50869 101 2 2 79 69 143 0.192 9.24 4.02 Intr + 56071 56176 106 1 1 33 49 84 0.035 -0.78 4.03 Term + 69781 69891 111 0 0 102 49 74 0.678 3.46 4.04 PlyA + 72452 72457 6 1.05 5.00 Prom + 84062 84101 40 -1.86 5.01 Init + 89887 90011 125 0 2 36 75 142 0.408 5.45 5.02 Intr + 90066 90169 104 0 2 65 68 35 0.105 -0.98 5.03 Term + 99254 99369 116 0 2 84 44 131 0.933 7.03 5.04 PlyA + 99639 99644 6 1.05 6.08 PlyA - 99658 99653 6 -0.45 6.07 Term - 100122 99998 125 1 2 76 39 221 0.960 14.55 6.06 Intr - 104396 104268 129 0 0 101 116 62 0.673 10.97 6.05 Intr - 131354 131182 173 1 2 89 38 66 0.052 1.29 6.04 Intr - 141294 141180 115 1 1 77 29 66 0.053 -0.89 6.03 Intr - 144703 144629 75 2 0 68 77 38 0.305 0.19 6.02 Intr - 146032 145956 77 0 2 62 100 37 0.362 1.46 6.01 Init - 150658 150594 65 1 2 60 113 12 0.536 1.52 6.00 Prom - 151505 151466 40 -3.06 7.00 Prom + 213987 214026 40 -0.96 7.01 Init + 217229 217410 182 1 2 68 35 150 0.470 6.36 7.02 Intr + 223593 223614 22 0 1 142 96 -5 0.496 3.25 7.03 Intr + 228743 228843 101 1 2 37 103 65 0.250 1.81 7.04 Intr + 259452 259476 25 0 1 71 96 53 0.017 2.43 7.05 Term + 266576 266677 102 1 0 96 48 23 0.024 -2.62 7.06 PlyA + 269024 269029 6 1.05 8.08 PlyA - 269041 269036 6 1.05 8.07 Term - 274001 273708 294 2 0 6 35 172 0.067 -0.99 8.06 Intr - 287586 287380 207 0 0 119 81 98 0.993 11.47 8.05 Intr - 288784 288572 213 1 0 93 109 128 0.991 14.31 8.04 Intr - 294129 293963 167 1 2 81 101 20 0.192 2.28 8.03 Intr - 306146 306012 135 0 0 41 95 109 0.406 7.44 8.02 Intr - 313045 313011 35 0 2 44 44 33 0.066 -7.63 8.01 Init - 316427 316282 146 2 2 104 77 105 0.332 10.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 266496 266544 49 2 1 87 94 51 0.822 6.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:129734940_130052555|GENSCAN_predicted_peptide_1|79_aa XSVLLSGLFMADRAGRKWILTDKATGLVQIPVSMYQTVVTSLAQGNGPVQVAMAPVTTRI SDSAVTMDGQAVEVVTLEQ >gi568815591r:129734940_130052555|GENSCAN_predicted_CDS_1|240_bp nnttctgtgttattgtcaggcctctttatggcagatcgtgcaggtcgcaagtggatcctg actgacaaagccacaggcctggtccagatccctgtgagcatgtaccagactgtggtgacc agcctcgcccagggcaacggaccagtgcaggtggccatggcccctgtgaccaccaggata tcagacagcgcagtcaccatggacggccaagctgtggaggtggtgacattggaacagtga >gi568815591r:129734940_130052555|GENSCAN_predicted_peptide_2|119_aa MKTSWALLRSLGAGHSCGAVGHGRAEGPWPRGLPSMPTPTPPVGPSQCGPCLLAQPERHD AGKRLRPAGAQHLPDCERGSPQMAWAKAARVRETRRGVTQSPSASRTQSRRRGRTRLAG >gi568815591r:129734940_130052555|GENSCAN_predicted_CDS_2|360_bp atgaagacttcgtgggccctgctgaggtccttgggagcagggcattcgtgcggggctgtg gggcacggccgagctgagggtccctggcccaggggcctcccgtccatgccgacgcccact cctcccgtaggacccagccagtgcgggccctgtcttctagcccagcctgaacgccacgat gcagggaagcgcctgaggcctgccggggctcaacacctgcccgactgtgagcgcggaagc ccgcagatggcctgggcgaaggcggcccgggtgcgggagacgcgacgcggggtcactcag agcccgagcgcatcccgaacccagagccgccgtcgggggcgcacccgattggccggctga >gi568815591r:129734940_130052555|GENSCAN_predicted_peptide_3|441_aa MRLFPNRGRFCLASLVWTPRGAERRLCAQPGAGAGGGAHGAPGSRVRQWRLPSAQRCSRA GAGAGAAAVRVRPVLARCLPARSLAGVGLRFAALVGSGGSARLGGGCGERRGPGAGPSSV LSGLRGWFEAASVARSRPAQSLGESGSQTSRVPPLRGHSSREPRRTRMRHRSPPSTAPFL LIALVAPRFPALPQDRRLDGLPLAPHRQGAAAPPRFAPVFAKLLRAKILRARWRILGPGH SALRVRITLYLSPTHLAQLLCEGPALVGSQIALHCLGPRPLSQPLPSLRRVEMRSGTIDR LQPQAADGARLVRPQLAQTEASPAPGGSCLPPPVFGNGRTHTGEVTGSGGSRLANYGART QPAPCAQPAREGPAMLDLLFSARKEGTQVAHGLDLYASSIPLHGFVPPDASEPETKPPLT FPELVFAVGAPLSRSRATGSE >gi568815591r:129734940_130052555|GENSCAN_predicted_CDS_3|1326_bp atgcggctctttccaaaccggggccgtttttgcctggcttctctggtgtggactccgcgc ggagctgagaggcggctgtgcgcccagcctggagccggcgcaggcggaggcgcccacggg gcgcctgggtcccgggtccgccagtggcgtctgcccagcgcccagcgctgttcccgggcg ggggcgggggcgggagcggctgcggtccgagtgcggccggtgcttgcgaggtgcctaccc gcgaggagtttggctggggtcgggctgcgtttcgccgccctggtgggctcaggcggaagc gccaggctgggagggggctgcggagagcgccggggccccggagctggcccctccagcgtc ttgtcagggctgcgcggctggttcgaggctgcttccgtggcccggagccgccccgcccag tctctgggggaaagcggctctcagacctcccgtgtgcctccacttcgcggccactcatcc cgggaaccccggcgcacgcgaatgagacaccgttccccgccctcaactgccccattcctg ttaatagcgctagtggcacccaggttcccagcccttccccaagatcggcggctggacggc ttgcctctggcgcctcaccggcaaggtgcggcggcgcctcccaggtttgcgcccgtcttc gcaaagctcctgagggcgaaaatcctgcgagccaggtggcggattctcgggccgggtcac tcagctctacgggtgcgcatcaccctctacctgtcacccacccatctggctcagctgctg tgtgagggcccagcgctggtgggcagccagatcgccttacactgcctggggccacggcca ttatcccagcctctgcccagcctcaggcgggtcgaaatgcgatccggcaccattgatcgc ctgcagccccaggctgccgatggagcccggctggtgcggccacagctggcacagaccgag gcctccccagctcctggggggagctgcttgcctccccccgtttttggcaatggtagaact cacactggtgaggtaacaggatccggtggttctagacttgccaactatggggcgaggact cagccggcaccctgtgcacagccagcgagggaagggccggccatgctggacctgctgttc tccgcgaggaaggaggggactcaggtggcccatgggctggacctctatgcctccagcatt ccacttcatggatttgttcctcctgacgcctcagaacctgagacaaagcctccactcacc ttcccagagcttgtgtttgcagtgggagctcccctgagccggagccgggccacagggtca gagtga >gi568815591r:129734940_130052555|GENSCAN_predicted_peptide_4|105_aa MRSVQDLGVPFSLHPRLTAGSSPDAYRLFAPSLSAFMDIISSDSQDNPILRVRKLRLGEI EWLAQAIQLLCIEMKRNTNQRCLLNVLGVSCSHHHFIERFLDGKR >gi568815591r:129734940_130052555|GENSCAN_predicted_CDS_4|318_bp atgcgctctgtccaggacctaggagtccctttctccctccaccctcggctcacagcgggc agctcccccgacgcgtaccgcctctttgcccccagcctgagtgctttcatggacatcatc tcatctgattctcaggataaccccattttacgagtgagaaaactgagacttggagagatt gagtggcttgctcaagccatccagctactctgcatcgaaatgaagagaaacacaaaccag cgatgtctcctcaacgtcctgggcgttagctgttcccaccatcacttcatagagaggttt ctggatgggaagcgttag >gi568815591r:129734940_130052555|GENSCAN_predicted_peptide_5|114_aa MRPAAAILSCTGSGHPQPAHALVVAKPQLLLAVGSGLRFPARASDRRQEEGARESPEGGR RSLHSINLRRERPAGPGLDFTLYPKKKNWFRKHFPKFLQQNTVAAGPGSVKGSN >gi568815591r:129734940_130052555|GENSCAN_predicted_CDS_5|345_bp atgaggcctgccgcagccattctcagctgcaccgggtccgggcatccacagccggctcac gcgctggtggtcgccaagcctcagctcttgctggctgttggttcagggcttcggttccca gccagagcaagtgataggagacaggaagagggagctagagagagccctgaaggtggacgt cgcagtcttcactcaatcaatctccgcagggaacgccctgcaggcccaggtctggatttc acgctctaccccaagaaaaagaactggttcagaaaacacttccccaagtttctccaacag aacactgtggctgcagggcctggtagtgtgaagggcagcaactga >gi568815591r:129734940_130052555|GENSCAN_predicted_peptide_6|252_aa MWAITGDHGKEKAWRIFFIKYTIESKHEVTILGGLNEFVVKFYGPQGTPYEGGVWKVRVD LPDKYPFKSPSIENEILELNRFCMFDMTSVLKLGVPGILLDMTKATEDLYCPFLRLYITR EKGRNRKEDPLLSLDLLNTHRISSEVPMTQGIESELLAWFLTQEETGPDLTNIFESFLPQ LLAYPNPIDPLNGDAAAMYLHRPEEYKQKIKEYIQKYATEEALKEQEEGTGDSSSESSMS DFSEDEAQDMEL >gi568815591r:129734940_130052555|GENSCAN_predicted_CDS_6|759_bp atgtgggctattactggggatcatgggaaagagaaggcttggaggatcttctttataaaa tacaccatcgagagtaaacatgaggttacgatcctgggaggacttaatgaatttgtagtg aagttttatggaccacaaggaacaccatatgaaggcggagtatggaaagttagagtggac ctacctgataaataccctttcaaatctccatctatagaaaatgagatactagaattaaac agattctgcatgtttgatatgaccagtgtattaaaactcggggtacctggcatcctgctg gacatgacgaaggctacagaagatctttactgcccattcctcagactctatattaccaga gagaaaggaagaaacagaaaagaggaccccttgttgagtttagatctcctgaacacacac aggatcagttcagaagttcccatgacacagggcattgagagtgagctccttgcctggttc ctcacccaagaagagacaggcccagatcttaccaatatatttgagtccttcctgcctcag ttattggcctatcctaaccccatagatcctctcaatggtgacgctgcagccatgtacctc caccgaccagaagaatacaagcagaaaattaaagagtacatccagaaatacgccacggag gaggcgctgaaagaacaggaagagggtaccggggacagctcatcggagagctctatgtct gacttttccgaagatgaggcccaggatatggagttgtag >gi568815591r:129734940_130052555|GENSCAN_predicted_peptide_7|143_aa MGSVVESEATGRAKRKKAVSGVQTSRRKVTGSGDTGRASSNAYFGVLRRKGELKRLSAPG RAEPTLDQALYQGALLTTNQPGRTDVILLVAQMKNLTFKEVMLYALYVYQGKYSRYKYHI DRYDAVCPLPANAGECVNKPTPG >gi568815591r:129734940_130052555|GENSCAN_predicted_CDS_7|432_bp atgggctcggtggtagagagtgaggcgactggtcgtgcaaaaagaaaaaaggccgtctcg ggagtccaaacgtcgcggaggaaagtcacaggctctggggatacaggaagagcgagcagc aacgcgtatttcggggtgctgcggagaaaaggcgagctgaaaaggctctcggcccctggg agagctgagcccacccttgaccaggcactgtaccaaggtgcattacttacaaccaaccaa cctggcaggacagatgttattctgcttgttgcacagatgaagaatcttacgttcaaagaa gtcatgttatatgccctctatgtgtaccagggtaaatacagtagatataagtatcatatc gatagatatgatgctgtatgtcccttacctgccaatgctggggagtgtgtaaacaagcca acccctggctag >gi568815591r:129734940_130052555|GENSCAN_predicted_peptide_8|398_aa MAAPCEGQAFAVGVEKNWGAVVRSPEGTPQKIRQLIDEGIAPEEGGVDAKVHNLAADFHQ SKPFELSPLVCAKYGWVTVECDMLKCSSCQAFLCASLQPAFDFDRFYTNIIFQFTFSTDR FGMLPLDEPAILVSEFLDRFQSLCHLDLQLPSLRPEDLKTMAEKSPGPIVSRTRSWDSSS PVDRPEPEAASPTTRTRPVTRSMGTGDTPGLEVPSSPLRKAKRARLCSSSSSDTSSRSFF DPTSQHRDWCPWVNITLGKESRENGGTEPDASAPAEPGWKAVLTILLAHKQSSQPAETDS MIWLQTVKSLVGPDNSSETQSLLCGKQNVNRGLRVFGKSAPRIRDGPGSSFSLSSIPCYT RSHSLCPTGPQGALGSLAKSQHCPGPCSRSTVTGLNSH >gi568815591r:129734940_130052555|GENSCAN_predicted_CDS_8|1197_bp atggcggcgccctgtgagggacaagcgtttgccgtaggggttgaaaagaattggggtgca gtagttcgctccccagaagggaccccccagaaaatccggcagctgatagatgaggggatt gccccggaagagggaggcgtggacgcgaaagttcacaacttggcagctgacttccatcag agtaagccctttgagctgtctccactcgtctgtgcaaaatatggctgggtcacagtggaa tgtgatatgctcaagtgctctagctgtcaagcttttctctgtgccagtttacaaccagct tttgactttgacagattttacaccaatataatcttccaatttactttttccacagaccga tttgggatgttgcccctggatgagcctgctattcttgttagtgaattcctagatcgtttt caaagcctttgtcacttggacctccagcttccttccctaaggccggaggacttgaaaact atggctgaaaagagccctggtcccattgtctctcgaactcggagctgggactcttccagt cctgttgaccgtcctgagccagaggctgctagccccaccaccagaactcgcccagtgacc cgaagcatgggaacaggagacacccctggcctggaggtaccatctagccctctgcggaaa gccaagcgagctcgcctctgctcctccagcagttcggacacatcttcccgaagcttcttt gatcccacctctcagcatagagactggtgcccttgggtgaatatcacacttggcaaagaa agcagggagaatggtggaactgaaccagatgccagcgccccagcagagccaggctggaaa gcagtgctgaccatcctcttggcgcacaaacagtctagccagccagctgaaacggactcc atgatctggctccaaacagtgaagagcctggtggggcctgacaacagttcagagacgcag agcctcctctgtgggaaacagaatgtgaaccgaggccttagagtgtttggaaaatcagct ccacgaatcagagatggaccgggcagcagcttctccctctcctccatcccatgctacacc cgtagccactctctgtgccccactggtcctcagggggccctgggctctcttgccaagtct cagcactgcccaggcccatgcagccggtcaacagtgactgggctcaactcccactaa