GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:42:51 Sequence gi568815595f:5087806_5315915 : 228110 bp : 42.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 8037 8416 380 1 2 40 43 308 0.604 15.47 1.02 PlyA + 9270 9275 6 1.05 2.10 PlyA - 11007 11002 6 1.05 2.09 Term - 17713 17688 26 2 2 125 43 14 0.002 -1.99 2.08 Intr - 34883 34381 503 1 2 34 56 285 0.011 11.60 2.07 Intr - 54037 53781 257 1 2 12 -14 281 0.003 5.62 2.06 Intr - 54859 54569 291 1 0 7 55 156 0.122 0.61 2.05 Intr - 57170 56847 324 2 0 55 62 254 0.197 14.55 2.04 Intr - 57952 57227 726 1 0 6 71 259 0.623 6.70 2.03 Intr - 58575 58178 398 1 2 51 64 208 0.026 8.07 2.02 Intr - 60792 60577 216 1 0 30 47 152 0.002 2.85 2.01 Init - 90056 89996 61 2 1 60 94 79 0.048 7.16 2.00 Prom - 91864 91825 40 -3.05 3.00 Prom + 97341 97380 40 -7.55 3.01 Init + 100001 100509 509 1 2 88 86 491 0.989 41.37 3.02 Intr + 107404 107476 73 1 1 82 92 41 0.480 2.39 3.03 Intr + 111787 111890 104 0 2 94 93 191 0.961 18.25 3.04 Intr + 113978 114119 142 2 1 64 63 135 0.809 8.03 3.05 Intr + 115161 115344 184 2 1 106 36 121 0.995 7.14 3.06 Intr + 117262 117436 175 2 1 107 75 112 0.999 9.88 3.07 Intr + 119348 119468 121 2 1 86 68 173 0.999 14.68 3.08 Intr + 120288 120458 171 2 0 115 79 146 0.999 15.62 3.09 Intr + 125514 125717 204 2 0 91 92 182 0.999 17.37 3.10 Intr + 128024 128049 26 1 2 114 63 60 0.276 2.01 3.11 Intr + 129485 129554 70 2 1 76 97 53 0.264 3.27 3.12 Intr + 131625 131652 28 2 1 110 59 32 0.859 -0.73 3.13 Intr + 133250 133375 126 0 0 100 22 135 0.688 7.83 3.14 Intr + 140628 140771 144 1 0 71 60 105 0.717 5.33 3.15 Term + 143274 143377 104 1 2 85 41 121 0.922 4.56 3.16 PlyA + 146606 146611 6 1.05 4.00 Prom + 153836 153875 40 -7.35 4.01 Init + 155616 155769 154 2 1 97 -4 199 0.703 11.79 4.02 Term + 156101 156744 644 0 2 -6 44 254 0.430 4.94 4.03 PlyA + 157468 157473 6 1.05 5.00 Prom + 165614 165653 40 -6.75 5.01 Init + 166519 166601 83 0 2 111 44 118 0.951 10.19 5.02 Intr + 167003 167079 77 2 2 50 98 38 0.054 -0.76 5.03 Intr + 181359 181492 134 1 2 92 12 114 0.020 3.64 5.04 Intr + 183239 183333 95 1 2 41 92 74 0.019 0.94 5.05 Term + 186395 186632 238 2 1 65 52 152 0.155 3.96 5.06 PlyA + 188580 188585 6 1.05 6.00 Prom + 192888 192927 40 -6.15 6.01 Init + 193607 193615 9 1 0 114 115 9 0.041 6.26 6.02 Intr + 198252 198311 60 2 0 100 87 48 0.042 3.91 6.03 Intr + 211474 211562 89 0 2 60 62 120 0.045 4.45 6.04 Intr + 214787 214901 115 2 1 58 69 54 0.152 0.03 6.05 Term + 220456 220575 120 0 0 43 42 129 0.069 1.29 6.06 PlyA + 222008 222013 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 28033 27824 210 1 0 57 34 227 0.937 9.14 S.002 Init + 34661 34783 123 1 0 94 89 189 0.910 17.91 S.003 Init - 58783 58178 606 1 0 13 64 293 0.923 14.75 S.004 Term + 90859 90978 120 0 0 103 31 103 0.819 3.59 S.005 Term + 100744 100882 139 1 1 73 36 158 0.951 5.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:5087806_5315915|GENSCAN_predicted_peptide_1|126_aa XYFWPQSSVKNPMLSTAFYLDALNPFSPSTSAEQTACLGLVNAKGVLDGCVYPESFQPGF ELDLATDENLGEHPQTTPLRVRALLPRPPAGRSPCSAPRDHQLEPKGGAQARSPRGAQAG ARVSSA >gi568815595f:5087806_5315915|GENSCAN_predicted_CDS_1|381_bp ncttatttttggccccagagcagcgtgaagaacccgatgttatcaacagcgttttatttg gacgctctgaatccgttttcccctagcacatcggcagagcagaccgcctgcctcggcctt gtcaacgccaagggtgttttggacggatgtgtctatcctgaaagcttccagcctggcttt gaacttgacttggcaacggatgaaaacctcggcgagcatccccagacaaccccgctgcgc gtcagagcgctgctcccgcgtcctcctgccggcagaagcccctgcagtgcgccccgcgac caccagctggaaccaaaaggcggcgcgcaggcgcggagcccacggggagcccaggctggg gcgcgtgtcagctccgcctga >gi568815595f:5087806_5315915|GENSCAN_predicted_peptide_2|933_aa MGTTVHNTSKATLIGTEWEKEEEKEPNKKPSPSERPWNHLSRMPPPYISQNSGQENQGAA GGLEEDRPGDHRRAKPTAPLNPYPNLRKELEQFKKPDGSYRLVQNLRTINQIVQTCHPVV PNPYTLLSKIPYEHKWFSVVDLKDAFWACPLDLRSRDLFAFEWENPTTGRKQQYHCTVLQ EGFTEAPNLFGQVLEKVLEEFQPSRETQLLQYVNDLLISGERRAKIKSYAQKTKIVYLKL LEEEPDSLQWSPEEIQAVKELKQALITAPALVLPSLEKPFHLFVTVDQGMPLAVLTQTWG GKRQHIAFVSKLLDPVSLGWPECVQAVAATALLVEESRKLSFGGALIVSTPHQVRNILNQ KAGRWLLDSQILKYEAILLEKKDLVVTTNTCLNPASFLYKGENKEPSDHNCLDITEYKIK AKPDLREAPQHDGIRLFVDGLSQVDRWQREIMVMLSLMEINTPYVRKALKLLEGQEGTIF TNSKYAYGVVHTFGKIWTVQGLINSRGKELVHGELVKQVLESLQLPAEVAIVRINGHQKG NTIEAVGNKLADKAAMQASLEEEIRLFSLIPDIPKVKIIEVKDLRQTLEIETGCRDVNAW VKWIKFSVQAFNKSDCYACATGGPQAQVVPFPLGWDTDPEEMDCMLALYQDKDAWGNKTC KESVTALSHIAEDTLKGMASQLDATSPMAWKNRLALDVIPEKGGVSVMLGGKRYTFIPNN TAPDGTITKALQGLTTLANEQAENAGINNPFTGWLNGCPIAPRGAPTQALEKRRAGLLAA EPRASEWAGAHRDDIDEGGLARVLQPHERQLHLFLPEERTEPVQQAGDERQHDGGRERER TGGRTSGGVDDFHTSGPPPGPSNGSLVPRRDTGGHRRAAAARPRKPSGSYDSPYPPRPSR RRRRHPRQERRGQSGQARPPLASPPAEIPMPSA >gi568815595f:5087806_5315915|GENSCAN_predicted_CDS_2|2802_bp atggggacaacagtgcacaatacgagcaaagccacactcattggaactgaatgggagaaa gaagaagaaaaagaacctaataaaaagccctcacccagtgaaaggccctggaaccaccta tcacgcatgccccctccatacatctcacaaaatagtggacaggaaaatcaaggggcagca ggagggttagaggaagatagacctggagaccacaggagagccaaaccaactgctccttta aatccttatccaaatttaagaaaagaattagaacagttcaaaaagcctgatgggtcgtat agattggtgcaaaatctaaggactataaatcaaattgtccagacctgccaccctgtggtg cccaacccctacaccctccttagtaagataccctatgaacataagtggttcagtgtggta gatctaaaagatgcattctgggcatgtcccctagaccttaggagcagggacctctttgcc tttgaatgggaaaatcctacaactgggagaaaacaacaataccactgtactgtgctgcaa gaaggcttcacggaagccccaaacttatttggtcaagtcttagaaaaagtcctggaggaa ttccaaccttccagggaaacccagttgttacaatatgtaaatgatcttttaatttctggg gagaggagagccaagattaagtcatatgctcaaaagacaaagattgtgtatctgaaacta ctagaggaggaacctgattccttgcaatggtccccagaagaaattcaggcagtgaaagaa ctaaaacaggccctcattacagccccggccctagtcctcccatctttagagaaaccattc catctgtttgtaacagtagaccagggcatgccccttgcggtgctcactcaaacctgggga gggaagaggcaacatattgcttttgtctccaagcttcttgatcctgtctctctcgggtgg cccgaatgtgtacaagcagtagctgccacagccctgctggtagaggagagtcgaaagcta tcctttggtggggccctaatagtaagcaccccacaccaggtcaggaatatattaaatcaa aaagccgggagatggttactggattctcagattctaaaatatgaagccatattactagaa aaaaaagatttggtcgtaacaacaaatacttgcctgaatccagccagtttcctatacaaa ggagagaacaaagagccatcagaccataactgcttagatatcactgaatacaaaatcaaa gctaaacctgaccttagggaagctccacaacatgatgggataaggctgtttgtggatggg ttgtcccaagtggatagatggcaaagagagataatggttatgctatcactgatggaaata aacactccttatgtgagaaaggcgctaaagctccttgaaggccaagaaggcactatattc actaattctaaatatgcctatggggtggtacacacttttggaaaaatctggacagtgcag ggcctaataaatagcaggggaaaagaattggtacatggggaactggtcaaacaggtttta gaaagcctccagcttccagcagaggttgccatagttcgcataaatggtcatcagaaaggt aacactatagaagctgtaggaaacaagcttgcagataaagctgctatgcaagcctccctg gaggaagaaattagactatttagcctgatcccagacatccctaaggtaaaaataattgag gtaaaggatttaaggcaaaccttagaaattgagacagggtgcagggatgtaaacgcctgg gtcaaatggatcaaattttcggtacaagccttcaacaagagtgactgctatgcgtgtgct acgggaggacctcaggcacaggtggttccatttcccctaggatgggatactgatcctgaa gaaatggattgcatgttggctctataccaggacaaggacgcatggggaaacaagacttgt aaagagtctgtcactgctctttcccacattgcagaggacaccctcaaagggatggctagc cagttagatgccaccagcccaatggcctggaaaaacaggcttgcactagacgtaatacca gaaaaagggggtgtaagtgttatgctgggtgggaaacgttatactttcattcccaacaat actgccccagatgggaccatcacaaaagctttacaaggactgacaactctagccaacgaa caggcagaaaatgctggaattaacaacccatttacgggttggctaaatggttgtcccatc gcgccgaggggggccccaactcaggccttggagaagcgccgggccggactcctggctgcg gagccccgggcgagtgagtgggcgggcgctcaccgcgatgacattgacgaaggtggtctt gcccgagtactgcagccccacgagcgtcagctccatctcttccttccagaagagcgaacg gaaccagtccagcaggcgggagatgagcgccagcatgatggcggccgggagcgagaacgg acgggaggacggacgagcggcggcgtcgacgacttccacacgagcgggcctccgcccgga ccctccaacggctccttggtgccccggcgcgacacgggcggacaccggcgggcggcagca gccagacccaggaagccgagcggatcatatgactcgccctaccccccacgcccgtcccgg cgccgccgtcggcacccacggcaagagcggcgcggccaatccggccaggcccgcccgccg ctggcatcgccacctgctgagattccaatgcctagtgcataa >gi568815595f:5087806_5315915|GENSCAN_predicted_peptide_3|726_aa MQWRALVLGLVLLRLGLHGVLWLVFGLGPSMGFYQRFPLSFGFQRLRSPDGPASPTSGPV GRPGGVSGPSWLQPPGTGAAQSPRKAPRRPGPGMCGPANWGYVLGGRGRGPDEYEKRYSG AFPPQLRAQMRDLARGMFVFGYDNYMAHAFPQDELNPIHCRGRGPDRGDPSNLNINDVLG NYSLTLVDALDTLAIMGNSSEFQKAVKLVINTVSFDKDSTVQVFEATIRIITDSKQPFGD MTIKDYDNELLYMAHDLAVRLLPAFENTKTGIPYPRVNLKTGVPPDTNNETCTAGAGSLL VEFGILSRLLGDSTFEWVARRAVKALWNLRSNDTGLLGNVVNIQTGHWVGKQSGLGAGLD SFYEYLLKSYILFGEKEDLEMFNAAYQSIQNYLRRGREACNEGEGDPPLYVNVNMFSGQL MNTWIDSLQAFFPGLQVLIGDVEDAICLHAFYYAIWKRYGALPERYNWQLQAPDVLFYPL RPELVESTYLLYQLFDEDNPVHKSGTRYMFTTEGHIVSVDEHLRELPWKEFFSEEGGQDQ GGKSVHRPKPHELKVINSSSNCNRVPDERSAAGLTLSLASMKRHGWKRILVKKGRDPLGN RGGCLSSGSTRPFAICGASGAQQSCMNAVVSVGWSMGTLGHTVLVLTALHRTLVSDLSGS GTTLWSLGAVAGMIAKQLLDGLDRIYIELSPGGCQLPPLQEGLIKRMQERERTCEALSPW PAVGVR >gi568815595f:5087806_5315915|GENSCAN_predicted_CDS_3|2181_bp atgcaatggcgagcgctcgtcctggggctggtgctcctccggcttggcctccatggagta ttgtggctcgtcttcgggctggggcccagcatgggcttctaccagcgctttccgctcagc ttcggcttccagcgtctgaggagccccgacggccccgcgtcgcccacctcggggcccgtg ggccggcctgggggggtatccgggccgtcgtggctgcagccgccggggaccggggcagcg cagagcccgcgcaaggctccgcggcgtcctgggccggggatgtgcggcccagccaactgg ggctacgtgctgggcggccggggccgcggcccggacgagtacgagaagcgctacagcggc gccttccctccgcagctgcgtgcccagatgcgcgacctggcacggggcatgttcgtcttt ggctacgacaactacatggctcacgccttcccccaggacgagctcaaccccatccactgc cgcggccgtgggcccgaccgcggggacccttcaaatctgaacatcaatgatgtactaggg aactactcattgactcttgttgatgcattggatacacttgcaataatgggaaattcatcc gagttccagaaagccgtcaagttagtgatcaacacagtttcatttgacaaagattccacc gtccaagtctttgaggccacgataagaataataactgactccaagcagccctttggtgac atgacaattaaggactatgataatgagttgttatacatggcccatgacctggcggtgcgg ctcctccctgcttttgaaaacaccaagacagggattccatatcctcgggtgaatctaaag acaggagttcctcctgacaccaataatgagacatgcacagcgggagccggttccctcctg gtggaatttgggattctgagtcgactcctgggggactccacatttgagtgggtggccaga cgagcagtgaaagccctttggaacctccggagcaatgatacaggattactaggcaatgtc gtgaacattcagacgggccactgggttggaaagcagagtggcctgggtgccgggctggac tccttctatgaatacctcttgaaatcttacattctctttggagaaaaagaagacctagaa atgtttaatgctgcatatcagagtattcagaactacttaagaagagggcgggaagcctgc aatgaaggagaaggagaccctccactctatgtcaacgtgaacatgttcagtgggcagctg atgaacacctggattgactctctgcaggcctttttccctggactgcaggtgctgatagga gatgtggaagatgccatctgccttcatgccttctactatgccatatggaaacgatatggt gccctccctgagagatataactggcagctgcaggcccctgacgttctcttctacccactg agaccagagttagtggaatccacatatctcctctaccagctgtttgatgaagacaatcca gtacacaagtctggaaccagatacatgttcacaacagagggacacattgtatctgtggat gagcatcttcgggaattgccatggaaggaattcttctctgaagagggagggcaggaccaa gggggaaagtctgtgcacaggccgaaacctcatgagttaaaagtcatcaactccagctcc aactgcaatcgtgtacctgatgagaggagtgcagcagggttgactcttagtctggcttcc atgaagcgtcatgggtggaaacgcattctagtaaaaaagggcagggaccccttgggaaat cgaggaggctgcctcagtagtgggagcaccaggccctttgccatctgtggagcaagtgga gcccagcagtcctgtatgaacgctgtagtcagtgtcgggtggagtatggggactctgggc cacacggtgctggttctgacagcacttcacagaacacttgtttctgacctatcgggttct ggcacaaccctatggtccttgggggcagtggctggcatgattgctaaacaattgctggat ggcttggacagaatctacattgaactgtctcccggggggtgtcaattgcctcctctgcaa gaagggctgattaagaggatgcaggaaagggagagaacctgtgaggctctcagcccgtgg ccagcagtgggtgttcggtaa >gi568815595f:5087806_5315915|GENSCAN_predicted_peptide_4|265_aa MAAVQSSFGSASEGDCGERGRGRPWEEGEGDRGERERESESENESESDSDYAWYWYQNKY MDQWNRTEASEITPHIYNHLIFDKPDKNKQWGKDSQFNQWCWENWLAMCRKLTLDHFLTP YTKINSKWMKDLNITPKTIKTLEENLGNTIQDRGMGKDFMTKTPKAMATKAKIEKSDLIK LKNFCTAKETIIRVNRQPAEWEKIFAIYPSDKGLISRIYKELKQIHKKKKNNPIKKWAKD MNRRFSKEGIYVANKHEKKLIITGH >gi568815595f:5087806_5315915|GENSCAN_predicted_CDS_4|798_bp atggcagcagtacagtccagctttggctcggcatcagagggagactgtggagagagaggg agagggagaccgtgggaagagggagagggagaccgtggggagagggagagggaaagcgag agcgagaacgagagcgagagcgacagcgactatgcatggtactggtaccaaaacaaatat atggaccaatggaacagaacagaggcctcagaaataacaccacacatctacaaccatctg atctttgacaaacctgacaaaaataagcagtggggaaaggattcccaatttaatcaatgg tgttgggaaaactggttagccatgtgtagaaaactgacactggaccacttccttacacct tatacaaaaattaactcaaaatggatgaaagacttaaacataacacctaaaaccataaaa accctagaagaaaacctaggcaataccattcaggacagaggcatgggcaaagactttatg actaaaacaccaaaagcaatggcaacaaaagccaaaattgagaaatcggatctaattaaa ctaaagaacttctgcacagcaaaagaaactatcattagagtgaacagacaacctgcagaa tgggagaaaatttttgcaatctatccatctgacaaagggctaatatccagaatctacaag gaacttaaacaaattcacaagaaaaaaaaaaacaaccccattaaaaagtgggcaaaagat atgaacagacgcttctcaaaagaaggcatttatgtggccaacaaacatgaaaaaaagctc atcatcactggtcattag >gi568815595f:5087806_5315915|GENSCAN_predicted_peptide_5|208_aa MGLLSQLSNLPTPTEMQFLAKHSTANERSSGTPSVELFSLAISSSSSAELSGAESEGTIH DMQGQWIWTNRVGYPESKANRPKFIGRIVRVCRDLSMLLWVDGVERLCFTLKSAGFDLCR VFVILLRVRRWRKRAANQEMQAPLVAENSSWLTTSQEIRISVLQPHGTEFCQQLSELGSE SIPRASGKEYNPANTLILTLWNLEQRTS >gi568815595f:5087806_5315915|GENSCAN_predicted_CDS_5|627_bp atggggctccttagtcaactttccaatctcccaactcccactgagatgcagtttctggca aagcactcaaccgccaacgaaaggtcatcaggaactccatcagtggagttgttttctctg gctatcagctcatctagcagtgctgagctctcaggagcagagagtgagggtactattcat gacatgcaagggcaatggatatggaccaatagggttggatacccagaaagcaaagccaat aggcccaaattcataggtaggattgtacgcgtatgtagggacctgtccatgttgctgtgg gtggatggggttgagaggctgtgctttaccttaaagtctgcaggatttgatctgtgccgt gtatttgtcatcctgctcagagtacgcagatggaggaaaagggccgccaaccaggaaatg caagcacccctagttgctgagaacagctcctggctgacaaccagccaggaaataaggatc tcagtcttacaaccacatggaactgaattctgccaacaactaagtgagcttggaagtgaa tccatccctagggcctctggaaaggaatacaaccctgccaacaccttgattttgaccttg tggaacctggagcagagaaccagttga >gi568815595f:5087806_5315915|GENSCAN_predicted_peptide_6|130_aa MPKPFPAVSAVSVSLLIWVDETKSSEMRLQYRDIKGLMSELVVLISGKIKASRVSLAHPC KNSLRAEIMRQGGMSILLQRRNKIENIFMVQMCHDEVTEPLPARVPEYICDAGHPGHFCW MCSLSKKYTK >gi568815595f:5087806_5315915|GENSCAN_predicted_CDS_6|393_bp atgcccaagccatttccagctgtctcagctgtgtcagtctccctgctgatctgggtggat gaaactaaatctagtgaaatgcgtttgcaatatagagacatcaaaggcctgatgagtgag ctggtggtgctgatcagcggaaagatcaaagcaagcagggtctcgctagcccacccgtgc aaaaacagccttagagcagaaatcatgcgtcagggtggaatgtccatcttgcttcagaga agaaacaaaatagaaaacatcttcatggtccagatgtgccacgacgaggtcacggaacct ttgccggccagagtccctgagtacatctgtgatgcagggcaccctgggcatttctgctgg atgtgtagtttaagcaagaaatataccaagtga