GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:07:54 Sequence gi568815588f:28577898_28782398 : 204501 bp : 42.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1348 1387 40 -2.65 1.01 Init + 3119 3221 103 1 1 54 97 31 0.621 1.05 1.02 Intr + 5502 5608 107 1 2 89 92 51 0.797 4.61 1.03 Intr + 17880 18144 265 2 1 68 91 237 0.767 18.16 1.04 Intr + 24677 24793 117 0 0 67 82 35 0.543 0.32 1.05 Intr + 30289 30534 246 2 0 60 94 205 0.995 14.81 1.06 Intr + 33877 34025 149 2 2 107 95 102 0.999 11.83 1.07 Intr + 36670 36788 119 0 2 37 100 144 0.996 8.84 1.08 Intr + 38276 38465 190 2 1 57 91 90 0.827 4.87 1.09 Intr + 39760 39887 128 0 2 78 115 74 0.911 7.66 1.10 Intr + 41640 41703 64 0 1 101 46 1 0.347 -5.00 1.11 Term + 43351 43467 117 0 0 76 47 107 0.409 2.96 1.12 PlyA + 43905 43910 6 -0.45 2.03 PlyA - 44667 44662 6 1.05 2.02 Term - 46727 46494 234 2 0 79 49 166 0.911 7.14 2.01 Init - 51045 50833 213 0 0 65 81 121 0.941 7.90 2.00 Prom - 52048 52009 40 -12.82 3.00 Prom + 52449 52488 40 -6.15 3.01 Init + 52864 53058 195 0 0 88 99 120 0.988 12.08 3.02 Intr + 58780 58863 84 0 0 97 53 75 0.059 4.00 3.03 Intr + 80331 80456 126 2 0 0 62 179 0.058 6.46 3.04 Intr + 86059 86295 237 0 0 44 68 168 0.522 7.29 3.05 Term + 88479 88544 66 2 0 97 37 95 0.560 2.26 3.06 PlyA + 90330 90335 6 1.05 4.00 Prom + 91597 91636 40 -6.75 4.01 Init + 100055 100471 417 1 0 101 65 245 0.225 18.04 4.02 Intr + 103462 103648 187 0 1 85 96 119 0.232 10.74 4.03 Intr + 104086 104375 290 2 2 98 77 226 0.048 18.54 4.04 Intr + 123418 123516 99 0 0 39 92 53 0.011 0.19 4.05 Intr + 127549 127740 192 0 0 57 23 178 0.561 6.97 4.06 Intr + 131758 131841 84 0 0 101 73 58 0.710 4.70 4.07 Intr + 132497 132658 162 1 0 62 84 54 0.505 1.65 4.08 Intr + 132974 133079 106 1 1 2 99 99 0.391 1.27 4.09 Intr + 135461 135530 70 0 1 79 86 20 0.396 -1.78 4.10 Term + 138681 138759 79 0 1 35 44 178 0.695 4.36 4.11 PlyA + 140941 140946 6 1.05 5.10 PlyA - 142822 142817 6 1.05 5.09 Term - 144639 144545 95 1 2 66 47 112 0.269 1.91 5.08 Intr - 144883 144742 142 1 1 22 74 107 0.656 1.71 5.07 Intr - 158250 158077 174 0 0 78 98 38 0.038 3.01 5.06 Intr - 165687 165546 142 2 1 82 37 103 0.065 4.03 5.05 Intr - 169023 168874 150 2 0 71 49 97 0.019 2.46 5.04 Intr - 174405 174325 81 2 0 74 98 73 0.016 4.73 5.03 Intr - 183542 183433 110 2 2 56 105 35 0.000 0.16 5.02 Intr - 200274 200135 140 1 2 62 77 65 0.286 2.06 5.01 Init - 203371 203191 181 1 1 45 57 128 0.162 4.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 104086 104504 419 2 2 98 49 284 0.912 19.95 S.002 Term + 174593 174778 186 1 0 78 42 189 0.843 9.81 S.003 Term + 186016 186303 288 0 0 41 45 195 0.817 4.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:28577898_28782398|GENSCAN_predicted_peptide_1|534_aa MHLIPPATNCDTLVECCLPGKLTRDLVCTAFTGFGTSYSPQENSHNHSALHSSNSHSSNP SNNPSKTSDANILSQTSRHNDRDYRLPRAETHSSSTPVQHPIKPVVHPTATPSTVPSSPF TLQSDHQPKKSFDANGASTLSKLPTPTSSVPAQKTERKVLQQLSKHNRKVMIRSALITSI ILDLLYIEKDLRYQKNIESTSGDKPVSHSCTTPSTSSASGLNPTSAPPTSASAVPVSPVP QSPIPPLLQDPNLLRQLLPALQATLQLNNSNVDISKINEAQPSNQSPMSLTSDASSPRSY VSPRISTPQTNTVPIKPLISTPPVSSQPKVSTPVVKQGPVSQSATQQPVTADKQQGHEPV SPRSLQRSSSQRSPSPGPNHTSNSSNASNATVVPQNSSARSTCSLTPALAAHFSENLIKH VQGWPADHAEKQASRLREEAHNMGTIHMSEICTELKNLRSLVRVCEIQATLREQRILFLR QQIKELEKLKNQNSFMISASKTGSTKGLFFMLDINVNVTTFSDANINFEIALVL >gi568815588f:28577898_28782398|GENSCAN_predicted_CDS_1|1605_bp atgcacttaattcctccagcaacaaattgtgacacacttgtggaatgttgtctaccaggg aagctcactagagatttagtgtgcacggcttttactggatttgggaccagttactctcca caagaaaattcacacaaccacagtgctcttcatagttcaaattcacattcttctaatcca agcaataacccaagcaaaacttcagatgcaaatattttgtctcaaacaagcagacacaat gacagagactacagactgccaagagcagagactcacagtagttctacgccagtacagcac cccatcaaaccagtggttcatccaactgctaccccaagcactgttccttctagtccattt acgctacagtctgatcaccagccaaagaaatcatttgatgctaatggagcatctacttta tcaaaactgcctacacccacatcttctgtccctgcacagaaaacagaaagaaaagtatta cagcaactgtcaaaacacaacaggaaagtaatgattagaagtgcacttattaccagtatc attttagatcttctgtatattgaaaaggatttgagataccagaaaaatatagaatctaca tcaggagacaaacccgtatcacattcttgcacaactccttccacgtcttctgcctctgga ctgaaccccacatctgcacctccaacatctgcttcagcggtccctgtttctcctgttcca cagtcgccaatacctcccttacttcaggacccaaatcttcttagacaattgcttcctgct ttgcaagccacgctgcagcttaataattctaatgtggacatatctaaaataaatgaagcc cagccatctaatcagtctccgatgtctttaacatctgatgcgtcatccccaagatcatat gtttctccaagaataagcacacctcaaactaacacagtccctatcaaacctttgatcagt actcctcctgtttcatcacagccaaaggttagtactccagtagttaagcaaggaccagtg tcacagtcagccacacagcagcctgtaactgctgacaagcagcaaggtcatgaacctgtc tctcctcgaagtcttcagcgctcaagtagccagagaagtccatcacctggtcccaatcat acttctaatagtagtaatgcatcaaatgcaacagttgtaccacagaattcttctgcccga tccacgtgttcattaacgcctgcactagcagcacacttcagtgaaaatctcataaaacac gttcaaggatggcctgcagatcatgcagagaagcaggcatcaagattacgcgaagaagcg cataacatgggaactattcacatgtccgaaatttgtactgaattaaaaaatttaagatct ttagtccgagtatgtgaaattcaagcaactttgcgagagcaaaggatactatttttgaga caacaaattaaggaacttgaaaagctaaaaaatcagaattccttcatgataagtgcttcc aaaactggcagcaccaagggcttattttttatgttagacatcaatgtcaatgttactaca ttctcggatgctaacataaattttgaaattgctcttgtgctttaa >gi568815588f:28577898_28782398|GENSCAN_predicted_peptide_2|148_aa MDSGGQWITLLCLQPVTQSCSQHVGAVHLPDSAKEATVHPYPGSSANKVTRAEAEEVFER TKPAVAQLWREKSEGGWLDVKASSKPQVKPVKPGDLVKPGQSPTPMPHPILALEAGSLPP TSPSPQDCRIRSSISFLIGLFTPKTCSS >gi568815588f:28577898_28782398|GENSCAN_predicted_CDS_2|447_bp atggactctggaggccaatggataactctcctgtgcttgcagcctgtgacgcagtcctgc tctcaacatgtgggtgctgtccacctccctgactctgcaaaagaagccactgtccaccca tacccaggtagctctgcgaacaaggtgactagagcagaggcagaagaggttttcgagagg acaaagccagcagtggctcagttgtggagagaaaagtcagaagggggctggttggatgtc aaagcatcttctaaaccacaggtgaaacctgtgaaaccaggtgacctagtgaaaccaggc caatctccaacccccatgcctcacccaatcctggctttagaagcagggtcacttcctcct accagtccatctccccaggattgtcgcatcagaagcagcatctcatttctcattggctta tttactcctaaaacgtgttcctcttag >gi568815588f:28577898_28782398|GENSCAN_predicted_peptide_3|235_aa MEAAGSICLPYSNHSEKPKALWLEGSTSHSLSSCKPTWLLYLHAMDRLRGEGHRAAAGGT GLAVQACTDSGRLVLQVCMEQGFSIQACYWAVPNWRIPGASSSRPKVLKGWSPVGSISIT WELTRPAGDAPETGQNDLASCINSKQQLLSSQLSAASLSPQEGEKDLRQDLSLPAGGWKR KLWAPLRGHFILWKIKVIRDAKVPVLAEETVVTLLGIFGLEQLRKVRADTQVLTP >gi568815588f:28577898_28782398|GENSCAN_predicted_CDS_3|708_bp atggaagctgcaggaagcatctgcctgccctattctaaccactctgaaaaacccaaagcc ttgtggctggagggatctacttcccattcactctcatcgtgtaagccgacttggcttctg tacctccatgccatggaccggcttagaggagaaggacatagggctgctgcgggtggaaca gggttagcagtgcaggcctgcacggactcagggcggcttgtgttgcaagtgtgcatggag cagggtttctctatccaggcatgctactgggctgttccgaactggagaatacctggtgcc tctagctctaggccaaaggttctcaaagggtggtccccagttggcagcatcagcatcacc tgggaacttacaagacctgcaggagatgcaccagagactggacagaacgaccttgcttct tgtatcaacagcaaacagcagctgctgtcttcccagctgagcgctgcttccctctcacca caagagggcgagaaagacctgcgacaggacctctccctccctgcgggaggatggaagaga aagctctgggctccgttaagagggcattttattctctggaaaataaaagtaataagagat gccaaagttccagtgttggctgaagaaacagtggttaccctgttgggcatctttggtctt gaacagctcaggaaagtccgagctgacactcaggttcttactccttag >gi568815588f:28577898_28782398|GENSCAN_predicted_peptide_4|561_aa MAVLLTKGGCRGRTGPGVPSSPRGFGAGCRGFVSGRRGSLLRRSRRSRQVAVFLEVAAVA WRVDADSDLRAGADQRVLLALPAGSGAFAAQRGAGPESRPRRRRFRTLGARAAGRNRRSW PVAAACPQRVCVSASGCISNSNSPLTHGCLDSLASTTDICQAKQARNHSGTTIPTLECCH EDMCNYRGLHDVLSPPRGEASGQGNRYQHDGSRNLITKVQELTSSKELWFRAAVIAVPIA GGLILVLLIMLALRMLRSENKRLQDQRQQMLSRLHYSFHGHHSKKGQVAKLDLECMVPAG SETARGQQPQSWLIQAEQGLIEWNTNVLWMQFYNNQLGRKWQITAAVCTPFGPISLANQA ARRPWACSSPGCVAHSGCGAVGDSEGASGLTPGGGDMMLEGTSGMIPGQNVDGSQPKRKI QIIPGFLSSTSCHSESEQRSPCHYSGQGKLWIQIPAVRKVIRVLQGHSFAACHVKSRIQE VQNENEKADLDGMLLEGKGRCSGQRWCPGPGTVQPLQNTILLLTSAATARQPVLEGFEGI EGRWLFNDEDEEEPENAKEEE >gi568815588f:28577898_28782398|GENSCAN_predicted_CDS_4|1686_bp atggccgtgctgctcaccaaaggtgggtgccgggggcgcacggggcccggggtcccgtcc tcgccgcggggctttggagccggctgcagaggctttgttagcggcaggcgaggctccctc ctgcgccggagccgcaggtcgcgacaagttgctgtgttcttagaagtcgcggcggtggcc tggcgcgtggatgcggactccgatctaagggcaggcgctgatcagagggtcctcctggct ctgcctgcggggagcggcgcgtttgcagcccagcggggagcggggccggagtctcggcct cgacgccgccgcttccgcaccctgggggctcgggcggccgggaggaatcgcagatcctgg cctgtagctgccgcgtgccctcagcgtgtctgtgtgagtgcctcagggtgtatttctaac tcaaattccccactcacccatggctgcctggactctcttgcaagcacgacagacatctgc caagccaaacaggcccgaaaccactctggcaccaccatacccacattggaatgctgtcat gaagacatgtgcaattacagagggctgcacgatgttctctctcctcccaggggtgaggcc tcaggacaaggaaacaggtatcagcatgatggtagcagaaaccttatcaccaaggtgcag gagctgacttcttccaaagagttgtggttccgggcagcggtcattgccgtgcccattgct ggagggctgattttagtgttgcttattatgttggccctgaggatgcttcgaagtgaaaat aagaggctgcaggatcagcggcaacagatgctctcccgtttgcactacagctttcacgga caccattccaaaaaggggcaggttgcaaagttagacttggaatgcatggtgccggctggg tcagagacagcgagaggccagcagccacagtcctggctgatccaggcagagcaaggactg atagaatggaacacaaatgtgctgtggatgcagttttacaacaaccaactgggaaggaag tggcagataacagcggcggtctgcaccccttttggccccatatccctggccaaccaggct gctagacgaccgtgggcctgcagcagccctggctgtgtggcccactcaggttgtggggca gttggagactctgagggagcatcaggactgactcctggtggcggggatatgatgctagaa gggacctcagggatgattccaggccagaatgttgacggctcccaaccaaagagaaaaatt cagattatacctggcttcttgtcctccaccagctgtcactcggaaagtgaacagagaagc ccttgccactatagtggacaagggaaactgtggatacagattccagcagtgagaaaggtc attagagttctccaagggcacagctttgcagcatgtcatgtcaaaagcaggatccaggaa gttcagaatgaaaatgaaaaagcagatctagatggaatgcttctagaaggcaaaggaagg tgcagtggacagaggtggtgtccaggccctggcactgtgcagcccctgcagaataccata ttgctgctcacctctgcagccactgccagacagcctgtactggaaggctttgaaggcatt gaaggaagatggctgtttaatgatgaagatgaagaagagcctgaaaatgctaaagaggaa gaatga >gi568815588f:28577898_28782398|GENSCAN_predicted_peptide_5|404_aa MMMEAEVREQIFAAGFEDRGRNYEEAAFRSKKRLRNGCFPVMVNTECQLDWTEGCKVLFL DPGPEAKFPCTSHSLATNVVCLSLVALRGKDEENSCSVQLIPSDDFKNPSYSSISQTSLF CDRVLANEIQADTLKSTFRKSPKRRSQHLASKGTKLGENEFDELTEVGFRRNMDEVGNHH SQQTNTGTENQTPHVLTHKCELTMRTHEHREGNITHWGLSRSLPDPQAAADCHHNRPLTS TLMAKLPPANPSPPPLKPHLSQQLTSASCWYTDLLHIGQCTSYGHRHSWISYTLFPLGSE LNCKHQESSVSPCCKFYLVQCLHECGAPPERGTSAEMNIDLTWLSLLQQGDAIPAAPRAT CCWGAFTPGTGMGGISLPPEHPLSAAADGTRASRDQWTQSEVGT >gi568815588f:28577898_28782398|GENSCAN_predicted_CDS_5|1215_bp atgatgatggaagcagaagtcagagaacaaatctttgctgctggctttgaagacagagga aggaactatgaggaggcagcctttagaagcaagaaaagactaagaaatggatgtttccct gtgatggttaatactgagtgtcaacttgattggactgaaggatgcaaagtattgttcctg gatccaggacccgaagccaagtttccctgcacatctcactccctggccacaaatgtggtc tgtctcagccttgttgccttgagaggaaaagatgaggaaaattcctgttctgtccagctc atcccctctgatgacttcaagaaccccagttattcaagcatttcacaaacatctttgttc tgtgatcgagtgctggcaaatgaaatacaagcagatacgttgaaaagtaccttcaggaag tctccaaaaagaagatcacaacacctcgccagcaagggaacaaaactgggggagaatgag tttgacgaattgacagaagtaggcttcagaagaaacatggatgaagttggaaaccatcat tctcagcaaactaacacaggaacagaaaaccaaacaccgcatgttctcactcataagtgt gagttgacaatgagaacacatgaacacagggagggtaacatcacacactggggcctgtcg aggtctctcccagatcctcaggcagctgctgactgccaccacaacagacccctcacctcc actctgatggcaaaactgccccctgccaatccctctccaccccccttaaaacctcaccta tcgcagcagctgacatcagccagttgttggtatactgacttgctgcacatcgggcaatgt acttcttatggtcataggcattcttggatcagctacacccttttccctcttggaagtgaa cttaactgcaagcaccaagagtcctctgtttctccatgctgtaagttttacctggtccag tgcctacatgagtgtggggcccccccagagagaggcacctctgcggaaatgaacattgac ctcacgtggctgagtcttctgcagcaaggggatgccatccctgcggctcctagagccacg tgttgctggggcgcatttacacctggcaccgggatgggcgggatttcactcccgcccgag caccctctttcggctgctgctgatgggacgcgtgcatcaagagatcagtggacccagagc gaggtggggacgtga