GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:32:57 Sequence gi568815588r:101727356_101939840 : 212485 bp : 45.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5629 5755 127 2 1 109 38 88 0.094 5.64 1.02 Term + 22035 22086 52 0 1 115 39 36 0.009 -1.70 1.03 PlyA + 24620 24625 6 1.05 2.05 PlyA - 26004 25999 6 1.05 2.04 Term - 32205 32078 128 1 2 100 53 127 0.987 8.84 2.03 Intr - 32606 32392 215 1 2 59 77 115 0.865 5.86 2.02 Intr - 39347 39223 125 2 2 106 90 80 0.625 9.38 2.01 Init - 39648 39640 9 0 0 69 65 4 0.313 -3.43 2.00 Prom - 39714 39675 40 -9.26 3.06 PlyA - 41120 41115 6 -1.75 3.05 Term - 43264 42974 291 1 0 90 43 720 0.999 63.04 3.04 Intr - 44214 44108 107 1 2 58 100 167 0.857 14.83 3.03 Intr - 47524 47377 148 1 1 55 99 333 0.814 30.91 3.02 Intr - 48757 48514 244 0 1 38 76 171 0.356 8.40 3.01 Init - 50149 50133 17 1 2 94 81 -3 0.539 -0.52 3.00 Prom - 51986 51947 40 -10.84 4.23 PlyA - 53994 53989 6 1.05 4.22 Term - 54499 54381 119 2 2 73 41 155 0.992 8.00 4.21 Intr - 54996 54903 94 0 1 87 77 103 0.999 8.64 4.20 Intr - 55242 55123 120 0 0 58 101 169 0.994 15.89 4.19 Intr - 55569 55484 86 1 2 59 77 146 0.972 10.14 4.18 Intr - 56131 56026 106 1 1 101 41 54 0.494 1.79 4.17 Intr - 63683 63541 143 0 2 76 75 76 0.451 5.17 4.16 Intr - 70799 70625 175 2 1 49 82 104 0.881 5.41 4.15 Intr - 72100 71487 614 2 2 96 87 539 0.916 46.70 4.14 Intr - 73045 72887 159 2 0 81 73 128 0.867 10.56 4.13 Intr - 76664 76380 285 0 0 103 80 180 0.981 16.01 4.12 Intr - 78788 78690 99 0 0 87 111 -19 0.565 0.38 4.11 Intr - 80546 80375 172 2 1 130 72 -3 0.991 1.82 4.10 Intr - 82959 82829 131 1 2 42 93 100 0.967 6.31 4.09 Intr - 90946 90469 478 1 1 65 105 120 0.005 4.02 4.08 Intr - 100396 100334 63 1 0 71 87 53 0.727 2.41 4.07 Intr - 100638 100534 105 0 0 63 80 125 0.993 9.61 4.06 Intr - 100903 100796 108 1 0 92 46 154 0.999 12.08 4.05 Intr - 101104 101034 71 2 2 58 83 99 0.738 5.30 4.04 Intr - 101341 101272 70 1 1 123 78 67 0.932 7.95 4.03 Intr - 101844 101720 125 1 2 98 92 221 0.999 23.80 4.02 Intr - 103812 103672 141 1 0 110 28 65 0.792 3.02 4.01 Init - 107014 106912 103 1 1 100 84 153 0.966 14.70 4.00 Prom - 111855 111816 40 -5.26 5.03 PlyA - 112843 112838 6 1.05 5.02 Term - 113691 113516 176 1 2 63 48 69 0.427 -1.68 5.01 Init - 116213 116141 73 2 1 93 100 134 0.995 16.33 5.00 Prom - 120714 120675 40 -9.16 6.04 PlyA - 122115 122110 6 1.05 6.03 Term - 122271 122134 138 0 0 53 37 151 0.424 4.46 6.02 Intr - 122537 122421 117 2 0 110 105 250 0.235 29.56 6.01 Init - 127914 127912 3 0 0 98 53 0 0.101 -2.50 6.00 Prom - 128568 128529 40 -4.06 7.00 Prom + 137066 137105 40 -2.46 7.01 Init + 140001 140201 201 2 0 95 46 155 0.785 10.88 7.02 Term + 141241 141300 60 0 0 124 39 86 0.996 5.10 7.03 PlyA + 142023 142028 6 1.05 8.06 PlyA - 144065 144060 6 1.05 8.05 Term - 153531 153368 164 1 2 48 43 94 0.133 -1.00 8.04 Intr - 155755 155707 49 1 1 74 75 33 0.157 -1.15 8.03 Intr - 162188 162057 132 2 0 58 115 59 0.141 6.44 8.02 Intr - 200124 200038 87 0 0 17 96 119 0.789 5.77 8.01 Init - 202633 202580 54 1 0 54 119 14 0.848 2.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:101727356_101939840|GENSCAN_predicted_peptide_1|59_aa XCVFRLRKKQIPPSSMKMQEAAVACSKHNVIVTGPHGPWCMERFSLTLMFFPLLHNKPF >gi568815588r:101727356_101939840|GENSCAN_predicted_CDS_1|180_bp ngctgtgtcttcaggcttaggaaaaagcagatacctcccagtagcatgaagatgcaagaa gctgcagttgcctgtagcaaacacaacgtgattgtaacaggtccccatggcccttggtgc atggaaaggttttccttgactctgatgtttttccccctgctgcacaacaagcccttctga >gi568815588r:101727356_101939840|GENSCAN_predicted_peptide_2|158_aa MARTLTSSHLNIPEKRMSVKGGGDFKQGDQEDLVDNFGSEHTPSGDYPNQKLPFGTKLPG MKPELPLRGSQANWEVAGKGRENLQSTGSLREAVKPEDKRPASGLVLRSCHRLLQKAAST YSNGLLEQSVMGPPAAMGACTRSAKGGQRRGIAMVTMY >gi568815588r:101727356_101939840|GENSCAN_predicted_CDS_2|477_bp atggctaggactctgacatccagccatctcaatattccagagaagaggatgagtgtgaag ggtggtggcgattttaaacaaggtgatcaggaagaccttgtggataacttcgggtctgag cacactccatcaggggattatcccaaccagaaacttccatttggaacaaagctcccagga atgaaacccgagctgccactaagaggctcccaggctaactgggaagtggctgggaaaggg agggaaaacttgcaaagcactgggagcctcagggaggctgtgaaacctgaagacaaacga ccagccagcggactggtgctgcggagctgccaccgccttctgcagaaagctgcttccact tactccaatgggctgctggagcagtcagtcatggggccgccagcagcaatgggggcctgc acccgctcagccaagggagggcagaggagaggaatagccatggtgaccatgtactga >gi568815588r:101727356_101939840|GENSCAN_predicted_peptide_3|268_aa MGHRESQRPPAPAAAHSDSVRRGEHDVPRDPRSRVVIAAGLPHPHPLRSRPAQRVLPRRP AGRRDPPGSRCPGAARHGQPPLRAELPHVREQSLVTDQLSRRLIRTYQLYSRTSGKHVQV LANKRINAMAEDGDPFAKLIVETDTFGSRVRVRGAETGLYICMNKKGKLIAKSNGKGKDC VFTEIVLENNYTALQNAKYEGWYMAFTRKGRPRKGSKTRQHQREVHFMKRLPRGHHTTEQ SLRFEFLNYPPFTRSLRGSQRTWAPEPR >gi568815588r:101727356_101939840|GENSCAN_predicted_CDS_3|807_bp atggggcacagggaaagccagcggccaccggctccggcagcggcgcacagcgattcggtg cggcgcggcgagcacgacgttccacgggacccgcggagccgcgtcgtgatcgccgccggc ctcccgcacccgcaccctctccgctcgcgccctgctcagcgcgtcctcccgcggcggccc gcgggacggcgtgacccgccgggctctcggtgccccggggccgcgcgccatgggcagccc ccgctccgcgctgagctgcctcatgtgagggagcagagcctggtgacggatcagctcagc cgccgcctcatccggacctaccaactctacagccgcaccagcgggaagcacgtgcaggtc ctggccaacaagcgcatcaacgccatggcagaggacggcgaccccttcgcaaagctcatc gtggagacggacacctttggaagcagagttcgagtccgaggagccgagacgggcctctac atctgcatgaacaagaaggggaagctgatcgccaagagcaacggcaaaggcaaggactgc gtcttcacggagattgtgctggagaacaactacacagcgctgcagaatgccaagtacgag ggctggtacatggccttcacccgcaagggccggccccgcaagggctccaagacgcggcag caccagcgtgaggtccacttcatgaagcggctgccccggggccaccacaccaccgagcag agcctgcgcttcgagttcctcaactacccgcccttcacgcgcagcctgcgcggcagccag aggacttgggcccccgagccccgatag >gi568815588r:101727356_101939840|GENSCAN_predicted_peptide_4|1188_aa MNLEGLEMVAVLVVLALFVKVLEQFGLFEPVSLEGHPPGPTKKALKQRFLKLLPCCGPQA LPSVSESKCLSCASGGGARCVHSVDDEFELSTVCHRPEGLEQLQEQTKFTRKELQVLYRG FKNECPSGIVNEENFKQIYSQFFPQGDSSTYATFLFNAFDTNHDGSVSFEDFVAGLSVIL RGTVDDRLNWAFNLYDLNKDGCITKEEMLDIMKSIYDMMGKYTYPALREEAPREHVESFF QKMDRNKDGVVTIEEFIESCQKGTRGHSLALRLGFWRPSLKPRLRLTKGSRRCRRPSHPG IRVPAEREGEGQRGRGRSRRGAHLELKPSPGLRAGAPTDRGRGGPAEVAAAGGRRMVQKE SQATLEERESELSSNPAASAGASLEPPAAPAPGEDNPAGAGGAAVAGAAGGARRFLCGVV EEQLMTLISAAREYEIEFIYAISPGLDITFSNPKEVSTLKRKLDQVSQFGCRSFALLFDD IDHNMCAADKEVFSSFAHAQVSITNEIYQYLGEPETFLFCPTEYCGTFCYPNVSQSPYLR TVGEKLLPGIEVLWTGPKVVSKEIPVESIEEVSKIIKRAPVIWDNIHANDYDQKRLFLGP YKGRSTELIPRLKGVLTNPNCEFEANYVAIHTLATWYKSNMNGVRKDVVMTDSEDSTVSI QIKLENEGSDEDIETDVLYSPQMALKLALTEWLQEFGVPHQYSSRQVAHSGAKASVVDGT PLVAAPSLNATTVVTTVYQEPIMSQGAALSGEPTTLTKEEEKKQPDEEPMDMVVEKQEET DHKNDNQILSEIVEAKMAEELKPMDTDKESIAESKSPEMSMQEDCISDIAPMQTDEQTNK EQFVPGPNEKPLYTAEPVTLEDLQLLADLFYLPYEHGPKGAQMLREFQWLRANSSVVSVN CKGKDSEKIEEWRSRAAKFEEMCGLVMGMFTRLSNCANRTILYDMYSYVWDIKSIMSMVK SFVQWLEDEDGICGYALGTVDVTPFIKKCKISWIPFMQEKYTKPNGDKELSEAEDCASPL MGGAGTGRGSPALCVSGAEFEEGSYSMAAGCELSGHTRSFTFKVEEEDDAEHVLALTMLC LTEGAKDECNVVEVVARNHDHQEIAVPVANLKLSCQPMLSLDDFQLQPPVTFRLKSGSGP VRITGRHQIVTMSNDVSEEESEEEEEDSDEEEVELCPILPAKKQGGRP >gi568815588r:101727356_101939840|GENSCAN_predicted_CDS_4|3567_bp atgaacctggaagggctggagatggttgctgtgctcgtggtcctcgctctgtttgtcaag gtcctggagcagtttggcctctttgagcctgtctccttggaaggccaccctccagggccc actaaaaaagcgctgaagcagcgattcctcaagctgctgccgtgctgcgggccccaagcc ctgccctcagtcagtgaaagcaagtgcctctcatgtgcttccgggggcggggctcgatgt gtgcacagcgtggacgatgaatttgaattgtccaccgtgtgtcaccggcctgagggtctg gagcagctgcaggagcaaaccaaattcacgcgcaaggagttgcaggtcctgtaccggggc ttcaagaacgaatgtcccagcggaattgtcaatgaggagaacttcaagcagatttactcc cagttctttcctcaaggagactccagcacctatgccacttttctcttcaatgcctttgac accaaccatgatggctcggtcagttttgaggactttgtggctggtttgtccgtgattctt cggggaactgtagatgacaggcttaattgggccttcaacctgtatgaccttaacaaggac ggctgcatcaccaaggaggaaatgcttgacatcatgaagtccatctatgacatgatgggc aagtacacgtaccctgcactccgggaggaggccccaagggaacacgtggagagcttcttc cagaagatggacagaaacaaggatggtgtggtgaccattgaggaattcattgagtcttgt caaaagggaacgagaggtcacagcctcgctctccgcttaggcttctggcgccccagctta aagccgaggctgcggctgacaaagggctcgcgccggtgccgccgcccttctcatccgggc attcgggtccctgcggagagggagggggaagggcagagggggaggggaaggagccggagg ggcgcacacttggagctgaagccctctccagggctccgggccggtgccccaacggacaga ggtcgaggaggacccgcagaggtggcagcggccgggggcaggaggatggtgcagaaggag agtcaagcgacgttggaggagcgggagagcgagctcagctccaaccctgccgcctctgcg ggggcatcgctggagccgccggcagctccggcacccggagaagacaaccccgccggggct gggggagcggcggtggccggggctgcaggaggggctcggcggttcctctgcggtgtggtg gaagagcaacttatgactctcatctctgctgcacgagaatatgagatagagttcatctat gcgatctcacctggattggatatcactttttctaaccccaaggaagtatccacattgaaa cgtaaattggaccaggtttctcagtttgggtgcagatcatttgctttgctttttgatgat atagaccataatatgtgtgcagcagacaaagaggtattcagttcttttgctcatgcccaa gtctccatcacaaatgaaatctatcagtacctaggagagccagaaactttcctcttctgt cccacagaatactgtggcactttctgttatccaaatgtgtctcagtctccatatttaagg actgtgggtgaaaagcttctacctggaattgaagtgctttggacaggtcccaaagttgtt tctaaagaaattccagtagagtccatcgaagaggtttctaagattattaagagagctcca gtaatctgggataacattcatgctaatgattatgatcagaagagactgtttctgggcccg tacaaaggaagatccacagaactcatcccacggttaaaaggagtcctcactaatccaaat tgtgaatttgaagccaactacgttgctatccacacccttgccacctggtacaaatcaaac atgaatggagtgagaaaagatgtagtgatgactgacagtgaagatagtactgtgtccatc cagataaaattagaaaatgaaggcagtgatgaagatattgaaactgatgtactctatagt ccacagatggctctaaagctagcattaacagaatggttgcaagagtttggtgtgcctcat caatacagcagtaggcaagttgcacacagtggagctaaagcaagtgtagttgatgggact cctttagttgcagcaccctctttaaatgccacaaccgtagtaacaacagtttatcaggag cccattatgagccagggagcagccttgagtggtgagcctactactctgaccaaggaagaa gaaaagaaacagcctgatgaagaacccatggacatggtggtggaaaaacaagaagaaacg gaccacaagaatgacaatcaaatactgagtgaaattgttgaagcgaaaatggcagaggaa ttgaaaccaatggacactgataaagagagcatagctgaatcaaaatccccagagatgtcc atgcaagaagattgtattagtgacattgcccccatgcaaactgatgaacagacaaacaag gagcagtttgtgccaggtccaaatgaaaagcctttgtacactgcggaaccagtgaccctg gaggatttgcagttacttgctgatctattctaccttccttacgagcatggacccaaagga gcacagatgttacgggaatttcaatggcttcgagcaaatagtagtgttgtcagtgtcaat tgcaaaggaaaagactctgaaaaaattgaagaatggcggtcacgagcagccaagtttgaa gagatgtgtggactagtgatgggaatgttcactcggctctccaattgtgccaacaggaca attctttatgacatgtactcctatgtttgggatatcaagagtataatgtctatggtgaag tcttttgtacagtggttagaagatgaagatggcatatgtggttatgccttgggcactgta gatgtgaccccctttattaaaaaatgtaaaatttcctggatccccttcatgcaggagaag tataccaagccaaatggtgacaaggaactctctgaggctgaggactgtgcgtctccttta atgggcggggccggaactggccgagggtccccggctttgtgcgtgtcgggggcggagttt gaagaaggctcttacagcatggccgccggctgtgagctctccggccacacccgctccttc acctttaaggtagaggaagaggatgatgcggagcacgtgctggcactaaccatgctctgc ctcaccgagggagccaaagacgagtgtaatgtggtagaagttgtggcccggaaccatgac catcaggagatcgcagtccctgtggccaacctcaagctgtcctgccaacccatgctcagt ctggatgacttccagctccaaccacctgtaaccttccgcctgaagtcgggctctggccct gtgcggatcactgggcggcaccagattgttacgatgagcaatgatgtttctgaggaggag agcgaggaagaggaagaggacagtgatgaggaagaagttgagctgtgccccatccttcct gccaaaaagcaggggggcaggccctag >gi568815588r:101727356_101939840|GENSCAN_predicted_peptide_5|82_aa MRGQGRKESLSDSRDLDGSYDQLTARRGRWAVRFRGERPRLAAALSERKRERPREDAERV GAWLASPELGPPTGISVESLRP >gi568815588r:101727356_101939840|GENSCAN_predicted_CDS_5|249_bp atgcggggccagggccgcaaggagagtttgtccgattcccgagacctggacggctcctac gaccagctcacggcccggcggggccgctgggcagtgcggttccgaggggaaaggccgcgt ttagcagcggccctgtctgagcggaagagagaaaggccaagagaagatgcggagagggtt ggagcctggctggctagccctgagctcggtcccccgactgggatctcagtggagtccctc cggccctga >gi568815588r:101727356_101939840|GENSCAN_predicted_peptide_6|85_aa MVLEVVRANYDTLTLKLQDGLDQYERYSEQHKEAAFFKELAVREDAIRGSIAYFSFAAME TALAQLQLGLGFDVCSQQQIWLIIP >gi568815588r:101727356_101939840|GENSCAN_predicted_CDS_6|258_bp atggtgctggaggtggtgagagccaactatgacacgctcacgctgaagctgcaggatggc ctggaccagtatgagcgctactcagagcagcacaaggaagctgccttcttcaaagagctg gcagtaagagaagatgctattcgaggtagtattgcctacttctcctttgcagccatggag acagcccttgcacagctccagctgggcctggggtttgacgtgtgttcccagcagcagatc tggctcattattccttaa >gi568815588r:101727356_101939840|GENSCAN_predicted_peptide_7|86_aa MGAPKMSLIQPPNQRVIKTFKAHYIWYSMQRTVNAVGENPIGENIMKIWKDYNTEDAITV TEKAMGEPTQHEDDEEDLYDPLPLNE >gi568815588r:101727356_101939840|GENSCAN_predicted_CDS_7|261_bp atgggtgccccaaagatgtctctaattcagcctccaaatcagagggtcataaagaccttt aaggctcattacatatggtactctatgcaaaggactgtcaatgctgtgggagaaaaccca attggagagaatatcatgaaaatctggaaggattacaacactgaagatgccatcactgtt acagaaaaagccatgggcgagcctactcaacatgaagatgatgaggaagatctttatgac ccacttccacttaatgaatag >gi568815588r:101727356_101939840|GENSCAN_predicted_peptide_8|161_aa MLIGTICVSDFWIRDAQLTVTDEIAKNIGEEMDDWQFAAEYYSSWSQMYPPLLSGTILIR ILCFRAIINHFNPKIESYAAVNHISQLSEEQEKDFDIISCIILEMHPQRIEALIRGEPQF VGNNIVRGPGWHPLRENIQEKNRKILGKGPENTKTYEENGA >gi568815588r:101727356_101939840|GENSCAN_predicted_CDS_8|486_bp atgctcattggaacaatttgtgtttcagatttctggattagggatgctcaactgacagtt acagatgaaattgccaagaacattggagaagaaatggatgattggcagtttgctgcagaa tattatagctcttggtcccagatgtatccacctcttctatctggcaccatattaatacga attctttgtttcagagccatcatcaaccactttaaccccaaaattgagtcctacgctgct gtgaatcacatatcccaactgtcagaggagcaggaaaaagattttgatattatcagctgc atcatattagaaatgcatccacaaaggatcgaagcactcataagaggagaaccccagttt gtggggaacaacattgttaggggccctgggtggcatccattacgtgagaacatccaggag aagaacagaaagatcttggggaaagggccagagaacacaaagacttatgaggaaaatgga gcctga