GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:39:22 Sequence gi568815588r:101686454_101918022 : 231569 bp : 45.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 3021 3112 92 1 2 84 42 91 0.568 1.98 1.02 PlyA + 3312 3317 6 1.05 2.02 PlyA - 3910 3905 6 1.05 2.01 Sngl - 8730 7924 807 0 0 94 47 641 0.937 54.41 2.00 Prom - 16392 16353 40 -3.66 3.00 Prom + 21057 21096 40 -2.96 3.01 Init + 40718 40769 52 1 1 79 71 64 0.821 5.14 3.02 Intr + 46531 46657 127 2 1 109 38 88 0.179 5.64 3.03 Term + 62937 62988 52 0 1 115 39 36 0.010 -1.70 3.04 PlyA + 65522 65527 6 1.05 4.05 PlyA - 66906 66901 6 1.05 4.04 Term - 73107 72980 128 1 2 100 53 127 0.987 8.84 4.03 Intr - 73508 73294 215 1 2 59 77 115 0.865 5.86 4.02 Intr - 80249 80125 125 2 2 106 90 80 0.625 9.38 4.01 Init - 80550 80542 9 0 0 69 65 4 0.313 -3.43 4.00 Prom - 80616 80577 40 -9.26 5.06 PlyA - 82022 82017 6 -1.75 5.05 Term - 84166 83876 291 1 0 90 43 720 0.999 63.04 5.04 Intr - 85116 85010 107 1 2 58 100 167 0.857 14.83 5.03 Intr - 88426 88279 148 1 1 55 99 333 0.814 30.91 5.02 Intr - 89659 89416 244 0 1 38 76 171 0.356 8.40 5.01 Init - 91051 91035 17 1 2 94 81 -3 0.539 -0.52 5.00 Prom - 92888 92849 40 -10.84 6.23 PlyA - 94896 94891 6 1.05 6.22 Term - 95401 95283 119 2 2 73 41 155 0.992 8.00 6.21 Intr - 95898 95805 94 0 1 87 77 103 0.999 8.64 6.20 Intr - 96144 96025 120 0 0 58 101 169 0.994 15.89 6.19 Intr - 96471 96386 86 1 2 59 77 146 0.972 10.14 6.18 Intr - 97033 96928 106 1 1 101 41 54 0.494 1.79 6.17 Intr - 104585 104443 143 0 2 76 75 76 0.451 5.17 6.16 Intr - 111701 111527 175 2 1 49 82 104 0.881 5.41 6.15 Intr - 113002 112389 614 2 2 96 87 539 0.916 46.70 6.14 Intr - 113947 113789 159 2 0 81 73 128 0.867 10.56 6.13 Intr - 117566 117282 285 0 0 103 80 180 0.981 16.01 6.12 Intr - 119690 119592 99 0 0 87 111 -19 0.565 0.38 6.11 Intr - 121448 121277 172 2 1 130 72 -3 0.991 1.82 6.10 Intr - 123861 123731 131 1 2 42 93 100 0.967 6.31 6.09 Intr - 131848 131371 478 1 1 65 105 120 0.005 4.02 6.08 Intr - 141298 141236 63 1 0 71 87 53 0.727 2.41 6.07 Intr - 141540 141436 105 0 0 63 80 125 0.993 9.61 6.06 Intr - 141805 141698 108 1 0 92 46 154 0.999 12.08 6.05 Intr - 142006 141936 71 2 2 58 83 99 0.738 5.30 6.04 Intr - 142243 142174 70 1 1 123 78 67 0.932 7.95 6.03 Intr - 142746 142622 125 1 2 98 92 221 0.999 23.80 6.02 Intr - 144714 144574 141 1 0 110 28 65 0.792 3.02 6.01 Init - 147916 147814 103 1 1 100 84 153 0.966 14.70 6.00 Prom - 152757 152718 40 -5.26 7.03 PlyA - 153745 153740 6 1.05 7.02 Term - 154593 154418 176 1 2 63 48 69 0.427 -1.68 7.01 Init - 157115 157043 73 2 1 93 100 134 0.995 16.33 7.00 Prom - 161616 161577 40 -9.16 8.04 PlyA - 163017 163012 6 1.05 8.03 Term - 163173 163036 138 0 0 53 37 151 0.424 4.46 8.02 Intr - 163439 163323 117 2 0 110 105 250 0.235 29.56 8.01 Init - 168816 168814 3 0 0 98 53 0 0.101 -2.50 8.00 Prom - 169470 169431 40 -4.06 9.00 Prom + 177968 178007 40 -2.46 9.01 Init + 180903 181103 201 2 0 95 46 155 0.785 10.88 9.02 Term + 182143 182202 60 0 0 124 39 86 0.996 5.10 9.03 PlyA + 182925 182930 6 1.05 10.04 PlyA - 184967 184962 6 1.05 10.03 Term - 194433 194270 164 1 2 48 43 94 0.134 -1.00 10.02 Intr - 196657 196609 49 1 1 74 75 33 0.159 -1.15 10.01 Intr - 203037 202959 79 2 1 54 115 90 0.481 7.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_1|30_aa XTIKNKAFPNAEDNKYIHIRNHSVNYRTNK >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_1|93_bp naaaccattaaaaataaggcctttccaaacgccgaagacaataaatacatccatattcga aatcattccgtgaattacagaacaaataagtga >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_2|268_aa MVSPPPGAAAVAARAGLRVGPAPRPLMGSQGRSGPPGNGGPGEGEGGEARKLQEGRVARG KRRKGKGKGKARAGQGGRGSGAEGKPGPQTAKEAAGPGADAGARACPREEAEGGRSVEEG ARGIVKGVEGSAGAGKEAQGREYGKKEEWRVRARRREGARPGRAQGRGGQAWADIAGTGV AMAAAAGEEEEEEEAARESAARPAAGPALWRLPEELLLLICSYLDMRALGRLAQVCRWLR RFTSCDLLWRRIARASLNSGFTRLGTDL >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_2|807_bp atggtgtcgcctcctcccggggcggcggcggtggcggctcgggctgggctccgcgtcggg ccggccccgcggccgctcatgggcagccagggccgctcggggccccccgggaacggcggg cccggcgagggcgagggcggagaggcgaggaagctgcaggaagggagggtggcgaggggg aagcgaaggaaggggaaggggaagggaaaagcgagagcggggcaaggcggaagaggaagc ggggcggaagggaagcccgggccgcagacggcgaaggaggcagccgggccgggggctgac gcgggagcgagggcatgcccaagggaggaagcagagggaggcagaagcgtggaggaaggg gcgagaggcatcgtcaagggagtcgaggggagcgcaggggccgggaaggaggcacaagga agagagtatgggaagaaggaggaatggagggtcagggctaggcggcgggagggcgccagg ccgggaagagcacagggacgagggggtcaggcttgggccgacatcgcggggacaggggtg gccatggcggcggcggccggggaggaggaggaggaggaggaggcggctcgggagtcggct gcccgcccggccgcggggcctgcgctctggcgcctgccggaggagctgctgctgctcatc tgctcctacctggacatgcgggccctcggccgcctggcccaggtgtgccgctggctgcgg cgcttcaccagctgcgatctgctctggcgccggatagcccgggcctcgctcaactccggc ttcacgcggctcggcaccgacctgtaa >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_3|76_aa MGIAFLRESNAATDLMGGCVFRLRKKQIPPSSMKMQEAAVACSKHNVIVTGPHGPWCMER FSLTLMFFPLLHNKPF >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_3|231_bp atggggatcgcattcctacgagaatctaatgccgccactgatctgatgggaggctgtgtc ttcaggcttaggaaaaagcagatacctcccagtagcatgaagatgcaagaagctgcagtt gcctgtagcaaacacaacgtgattgtaacaggtccccatggcccttggtgcatggaaagg ttttccttgactctgatgtttttccccctgctgcacaacaagcccttctga >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_4|158_aa MARTLTSSHLNIPEKRMSVKGGGDFKQGDQEDLVDNFGSEHTPSGDYPNQKLPFGTKLPG MKPELPLRGSQANWEVAGKGRENLQSTGSLREAVKPEDKRPASGLVLRSCHRLLQKAAST YSNGLLEQSVMGPPAAMGACTRSAKGGQRRGIAMVTMY >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_4|477_bp atggctaggactctgacatccagccatctcaatattccagagaagaggatgagtgtgaag ggtggtggcgattttaaacaaggtgatcaggaagaccttgtggataacttcgggtctgag cacactccatcaggggattatcccaaccagaaacttccatttggaacaaagctcccagga atgaaacccgagctgccactaagaggctcccaggctaactgggaagtggctgggaaaggg agggaaaacttgcaaagcactgggagcctcagggaggctgtgaaacctgaagacaaacga ccagccagcggactggtgctgcggagctgccaccgccttctgcagaaagctgcttccact tactccaatgggctgctggagcagtcagtcatggggccgccagcagcaatgggggcctgc acccgctcagccaagggagggcagaggagaggaatagccatggtgaccatgtactga >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_5|268_aa MGHRESQRPPAPAAAHSDSVRRGEHDVPRDPRSRVVIAAGLPHPHPLRSRPAQRVLPRRP AGRRDPPGSRCPGAARHGQPPLRAELPHVREQSLVTDQLSRRLIRTYQLYSRTSGKHVQV LANKRINAMAEDGDPFAKLIVETDTFGSRVRVRGAETGLYICMNKKGKLIAKSNGKGKDC VFTEIVLENNYTALQNAKYEGWYMAFTRKGRPRKGSKTRQHQREVHFMKRLPRGHHTTEQ SLRFEFLNYPPFTRSLRGSQRTWAPEPR >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_5|807_bp atggggcacagggaaagccagcggccaccggctccggcagcggcgcacagcgattcggtg cggcgcggcgagcacgacgttccacgggacccgcggagccgcgtcgtgatcgccgccggc ctcccgcacccgcaccctctccgctcgcgccctgctcagcgcgtcctcccgcggcggccc gcgggacggcgtgacccgccgggctctcggtgccccggggccgcgcgccatgggcagccc ccgctccgcgctgagctgcctcatgtgagggagcagagcctggtgacggatcagctcagc cgccgcctcatccggacctaccaactctacagccgcaccagcgggaagcacgtgcaggtc ctggccaacaagcgcatcaacgccatggcagaggacggcgaccccttcgcaaagctcatc gtggagacggacacctttggaagcagagttcgagtccgaggagccgagacgggcctctac atctgcatgaacaagaaggggaagctgatcgccaagagcaacggcaaaggcaaggactgc gtcttcacggagattgtgctggagaacaactacacagcgctgcagaatgccaagtacgag ggctggtacatggccttcacccgcaagggccggccccgcaagggctccaagacgcggcag caccagcgtgaggtccacttcatgaagcggctgccccggggccaccacaccaccgagcag agcctgcgcttcgagttcctcaactacccgcccttcacgcgcagcctgcgcggcagccag aggacttgggcccccgagccccgatag >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_6|1188_aa MNLEGLEMVAVLVVLALFVKVLEQFGLFEPVSLEGHPPGPTKKALKQRFLKLLPCCGPQA LPSVSESKCLSCASGGGARCVHSVDDEFELSTVCHRPEGLEQLQEQTKFTRKELQVLYRG FKNECPSGIVNEENFKQIYSQFFPQGDSSTYATFLFNAFDTNHDGSVSFEDFVAGLSVIL RGTVDDRLNWAFNLYDLNKDGCITKEEMLDIMKSIYDMMGKYTYPALREEAPREHVESFF QKMDRNKDGVVTIEEFIESCQKGTRGHSLALRLGFWRPSLKPRLRLTKGSRRCRRPSHPG IRVPAEREGEGQRGRGRSRRGAHLELKPSPGLRAGAPTDRGRGGPAEVAAAGGRRMVQKE SQATLEERESELSSNPAASAGASLEPPAAPAPGEDNPAGAGGAAVAGAAGGARRFLCGVV EEQLMTLISAAREYEIEFIYAISPGLDITFSNPKEVSTLKRKLDQVSQFGCRSFALLFDD IDHNMCAADKEVFSSFAHAQVSITNEIYQYLGEPETFLFCPTEYCGTFCYPNVSQSPYLR TVGEKLLPGIEVLWTGPKVVSKEIPVESIEEVSKIIKRAPVIWDNIHANDYDQKRLFLGP YKGRSTELIPRLKGVLTNPNCEFEANYVAIHTLATWYKSNMNGVRKDVVMTDSEDSTVSI QIKLENEGSDEDIETDVLYSPQMALKLALTEWLQEFGVPHQYSSRQVAHSGAKASVVDGT PLVAAPSLNATTVVTTVYQEPIMSQGAALSGEPTTLTKEEEKKQPDEEPMDMVVEKQEET DHKNDNQILSEIVEAKMAEELKPMDTDKESIAESKSPEMSMQEDCISDIAPMQTDEQTNK EQFVPGPNEKPLYTAEPVTLEDLQLLADLFYLPYEHGPKGAQMLREFQWLRANSSVVSVN CKGKDSEKIEEWRSRAAKFEEMCGLVMGMFTRLSNCANRTILYDMYSYVWDIKSIMSMVK SFVQWLEDEDGICGYALGTVDVTPFIKKCKISWIPFMQEKYTKPNGDKELSEAEDCASPL MGGAGTGRGSPALCVSGAEFEEGSYSMAAGCELSGHTRSFTFKVEEEDDAEHVLALTMLC LTEGAKDECNVVEVVARNHDHQEIAVPVANLKLSCQPMLSLDDFQLQPPVTFRLKSGSGP VRITGRHQIVTMSNDVSEEESEEEEEDSDEEEVELCPILPAKKQGGRP >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_6|3567_bp atgaacctggaagggctggagatggttgctgtgctcgtggtcctcgctctgtttgtcaag gtcctggagcagtttggcctctttgagcctgtctccttggaaggccaccctccagggccc actaaaaaagcgctgaagcagcgattcctcaagctgctgccgtgctgcgggccccaagcc ctgccctcagtcagtgaaagcaagtgcctctcatgtgcttccgggggcggggctcgatgt gtgcacagcgtggacgatgaatttgaattgtccaccgtgtgtcaccggcctgagggtctg gagcagctgcaggagcaaaccaaattcacgcgcaaggagttgcaggtcctgtaccggggc ttcaagaacgaatgtcccagcggaattgtcaatgaggagaacttcaagcagatttactcc cagttctttcctcaaggagactccagcacctatgccacttttctcttcaatgcctttgac accaaccatgatggctcggtcagttttgaggactttgtggctggtttgtccgtgattctt cggggaactgtagatgacaggcttaattgggccttcaacctgtatgaccttaacaaggac ggctgcatcaccaaggaggaaatgcttgacatcatgaagtccatctatgacatgatgggc aagtacacgtaccctgcactccgggaggaggccccaagggaacacgtggagagcttcttc cagaagatggacagaaacaaggatggtgtggtgaccattgaggaattcattgagtcttgt caaaagggaacgagaggtcacagcctcgctctccgcttaggcttctggcgccccagctta aagccgaggctgcggctgacaaagggctcgcgccggtgccgccgcccttctcatccgggc attcgggtccctgcggagagggagggggaagggcagagggggaggggaaggagccggagg ggcgcacacttggagctgaagccctctccagggctccgggccggtgccccaacggacaga ggtcgaggaggacccgcagaggtggcagcggccgggggcaggaggatggtgcagaaggag agtcaagcgacgttggaggagcgggagagcgagctcagctccaaccctgccgcctctgcg ggggcatcgctggagccgccggcagctccggcacccggagaagacaaccccgccggggct gggggagcggcggtggccggggctgcaggaggggctcggcggttcctctgcggtgtggtg gaagagcaacttatgactctcatctctgctgcacgagaatatgagatagagttcatctat gcgatctcacctggattggatatcactttttctaaccccaaggaagtatccacattgaaa cgtaaattggaccaggtttctcagtttgggtgcagatcatttgctttgctttttgatgat atagaccataatatgtgtgcagcagacaaagaggtattcagttcttttgctcatgcccaa gtctccatcacaaatgaaatctatcagtacctaggagagccagaaactttcctcttctgt cccacagaatactgtggcactttctgttatccaaatgtgtctcagtctccatatttaagg actgtgggtgaaaagcttctacctggaattgaagtgctttggacaggtcccaaagttgtt tctaaagaaattccagtagagtccatcgaagaggtttctaagattattaagagagctcca gtaatctgggataacattcatgctaatgattatgatcagaagagactgtttctgggcccg tacaaaggaagatccacagaactcatcccacggttaaaaggagtcctcactaatccaaat tgtgaatttgaagccaactacgttgctatccacacccttgccacctggtacaaatcaaac atgaatggagtgagaaaagatgtagtgatgactgacagtgaagatagtactgtgtccatc cagataaaattagaaaatgaaggcagtgatgaagatattgaaactgatgtactctatagt ccacagatggctctaaagctagcattaacagaatggttgcaagagtttggtgtgcctcat caatacagcagtaggcaagttgcacacagtggagctaaagcaagtgtagttgatgggact cctttagttgcagcaccctctttaaatgccacaaccgtagtaacaacagtttatcaggag cccattatgagccagggagcagccttgagtggtgagcctactactctgaccaaggaagaa gaaaagaaacagcctgatgaagaacccatggacatggtggtggaaaaacaagaagaaacg gaccacaagaatgacaatcaaatactgagtgaaattgttgaagcgaaaatggcagaggaa ttgaaaccaatggacactgataaagagagcatagctgaatcaaaatccccagagatgtcc atgcaagaagattgtattagtgacattgcccccatgcaaactgatgaacagacaaacaag gagcagtttgtgccaggtccaaatgaaaagcctttgtacactgcggaaccagtgaccctg gaggatttgcagttacttgctgatctattctaccttccttacgagcatggacccaaagga gcacagatgttacgggaatttcaatggcttcgagcaaatagtagtgttgtcagtgtcaat tgcaaaggaaaagactctgaaaaaattgaagaatggcggtcacgagcagccaagtttgaa gagatgtgtggactagtgatgggaatgttcactcggctctccaattgtgccaacaggaca attctttatgacatgtactcctatgtttgggatatcaagagtataatgtctatggtgaag tcttttgtacagtggttagaagatgaagatggcatatgtggttatgccttgggcactgta gatgtgaccccctttattaaaaaatgtaaaatttcctggatccccttcatgcaggagaag tataccaagccaaatggtgacaaggaactctctgaggctgaggactgtgcgtctccttta atgggcggggccggaactggccgagggtccccggctttgtgcgtgtcgggggcggagttt gaagaaggctcttacagcatggccgccggctgtgagctctccggccacacccgctccttc acctttaaggtagaggaagaggatgatgcggagcacgtgctggcactaaccatgctctgc ctcaccgagggagccaaagacgagtgtaatgtggtagaagttgtggcccggaaccatgac catcaggagatcgcagtccctgtggccaacctcaagctgtcctgccaacccatgctcagt ctggatgacttccagctccaaccacctgtaaccttccgcctgaagtcgggctctggccct gtgcggatcactgggcggcaccagattgttacgatgagcaatgatgtttctgaggaggag agcgaggaagaggaagaggacagtgatgaggaagaagttgagctgtgccccatccttcct gccaaaaagcaggggggcaggccctag >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_7|82_aa MRGQGRKESLSDSRDLDGSYDQLTARRGRWAVRFRGERPRLAAALSERKRERPREDAERV GAWLASPELGPPTGISVESLRP >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_7|249_bp atgcggggccagggccgcaaggagagtttgtccgattcccgagacctggacggctcctac gaccagctcacggcccggcggggccgctgggcagtgcggttccgaggggaaaggccgcgt ttagcagcggccctgtctgagcggaagagagaaaggccaagagaagatgcggagagggtt ggagcctggctggctagccctgagctcggtcccccgactgggatctcagtggagtccctc cggccctga >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_8|85_aa MVLEVVRANYDTLTLKLQDGLDQYERYSEQHKEAAFFKELAVREDAIRGSIAYFSFAAME TALAQLQLGLGFDVCSQQQIWLIIP >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_8|258_bp atggtgctggaggtggtgagagccaactatgacacgctcacgctgaagctgcaggatggc ctggaccagtatgagcgctactcagagcagcacaaggaagctgccttcttcaaagagctg gcagtaagagaagatgctattcgaggtagtattgcctacttctcctttgcagccatggag acagcccttgcacagctccagctgggcctggggtttgacgtgtgttcccagcagcagatc tggctcattattccttaa >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_9|86_aa MGAPKMSLIQPPNQRVIKTFKAHYIWYSMQRTVNAVGENPIGENIMKIWKDYNTEDAITV TEKAMGEPTQHEDDEEDLYDPLPLNE >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_9|261_bp atgggtgccccaaagatgtctctaattcagcctccaaatcagagggtcataaagaccttt aaggctcattacatatggtactctatgcaaaggactgtcaatgctgtgggagaaaaccca attggagagaatatcatgaaaatctggaaggattacaacactgaagatgccatcactgtt acagaaaaagccatgggcgagcctactcaacatgaagatgatgaggaagatctttatgac ccacttccacttaatgaatag >gi568815588r:101686454_101918022|GENSCAN_predicted_peptide_10|97_aa XAIINHFNPKIESYAAVNHISQLSEEQEKDFDIISCIILEMHPQRIEALIRGEPQFVGNN IVRGPGWHPLRENIQEKNRKILGKGPENTKTYEENGA >gi568815588r:101686454_101918022|GENSCAN_predicted_CDS_10|294_bp nnagccatcatcaaccactttaaccccaaaattgagtcctacgctgctgtgaatcacata tcccaactgtcagaggagcaggaaaaagattttgatattatcagctgcatcatattagaa atgcatccacaaaggatcgaagcactcataagaggagaaccccagtttgtggggaacaac attgttaggggccctgggtggcatccattacgtgagaacatccaggagaagaacagaaag atcttggggaaagggccagagaacacaaagacttatgaggaaaatggagcctga