GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:44:07 Sequence gi568815595f:123109218_123371719 : 262502 bp : 45.28% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1727 1787 61 1 1 92 76 69 0.267 4.41 1.02 Intr + 7014 7081 68 1 2 54 97 60 0.187 2.12 1.03 Intr + 13440 13493 54 2 0 44 96 46 0.387 0.18 1.04 Intr + 14849 14940 92 1 2 119 42 148 0.846 12.09 1.05 Intr + 15055 15126 72 1 0 116 100 118 0.994 14.32 1.06 Intr + 21263 21399 137 2 2 99 84 158 0.999 16.71 1.07 Intr + 36305 36501 197 0 2 95 -14 140 0.317 3.63 1.08 Intr + 36882 37042 161 2 2 75 106 198 0.808 19.09 1.09 Intr + 37156 37274 119 1 2 52 31 174 0.949 8.21 1.10 Intr + 41040 41147 108 1 0 74 89 165 0.716 15.46 1.11 Intr + 45754 45824 71 2 2 102 116 82 0.990 11.30 1.12 Intr + 52104 52238 135 2 0 76 98 271 0.998 27.66 1.13 Intr + 54346 54478 133 0 1 104 33 38 0.590 0.12 1.14 Intr + 55253 55338 86 0 2 121 74 65 0.656 7.94 1.15 Intr + 78383 78496 114 1 0 34 77 71 0.015 1.24 1.16 Intr + 115945 116065 121 0 1 46 99 51 0.008 2.07 1.17 Intr + 136682 136797 116 0 2 99 84 36 0.450 4.47 1.18 Term + 157140 157172 33 2 0 122 54 38 0.201 1.39 1.19 PlyA + 160047 160052 6 1.05 2.05 PlyA - 161322 161317 6 1.05 2.04 Term - 165471 165430 42 0 0 112 55 42 0.713 0.46 2.03 Intr - 169335 169284 52 2 1 97 82 -7 0.115 -1.49 2.02 Intr - 169666 169506 161 1 2 51 89 111 0.185 6.29 2.01 Init - 172146 172069 78 0 0 52 94 54 0.879 3.46 2.00 Prom - 172234 172195 40 -2.26 3.29 PlyA - 174739 174734 6 1.05 3.28 Term - 175519 175391 129 1 0 139 43 255 0.998 24.28 3.27 Intr - 175815 175730 86 1 2 61 76 92 0.315 4.84 3.26 Intr - 176200 176170 31 1 1 75 97 5 0.267 -2.20 3.25 Intr - 177592 177468 125 2 2 134 109 296 0.719 36.60 3.24 Intr - 180773 180533 241 2 1 85 109 608 0.966 59.72 3.23 Intr - 182159 181878 282 2 0 77 28 725 0.814 63.02 3.22 Intr - 182505 182185 321 0 0 107 -3 115 0.356 0.36 3.21 Intr - 186999 186867 133 2 1 83 94 172 0.995 17.95 3.20 Intr - 188165 188136 30 1 0 108 89 43 0.923 3.65 3.19 Intr - 191078 190903 176 2 2 101 22 575 0.997 50.94 3.18 Intr - 194002 193838 165 1 0 118 69 254 0.999 26.66 3.17 Intr - 194945 194850 96 2 0 113 64 161 0.793 16.41 3.16 Intr - 205105 205018 88 0 1 133 113 63 0.993 13.27 3.15 Intr - 208900 208803 98 1 2 96 81 145 0.900 13.41 3.14 Intr - 210601 210457 145 0 1 129 87 201 0.994 24.38 3.13 Intr - 211554 211532 23 0 2 134 89 49 0.996 5.94 3.12 Intr - 216245 216105 141 2 0 120 101 236 0.998 28.65 3.11 Intr - 218542 218401 142 0 1 107 25 319 0.999 27.86 3.10 Intr - 219585 219427 159 2 0 103 96 148 0.975 16.20 3.09 Intr - 221491 221279 213 0 0 96 53 61 0.643 1.23 3.08 Intr - 221799 221672 128 0 2 62 99 64 0.977 4.38 3.07 Intr - 223458 223347 112 2 1 81 73 276 0.190 25.78 3.06 Intr - 238686 238565 122 0 2 79 75 162 0.375 13.29 3.05 Intr - 240999 240784 216 0 0 34 44 117 0.438 0.60 3.04 Intr - 243364 243215 150 1 0 92 109 136 0.997 16.46 3.03 Intr - 243608 243457 152 0 2 0 59 117 0.579 -0.12 3.02 Intr - 246233 246173 61 2 1 86 81 29 0.054 0.31 3.01 Init - 260236 260174 63 1 0 92 110 36 0.687 5.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:123109218_123371719|GENSCAN_predicted_peptide_1|625_aa DFRRLLKKEEKPLLIMFYAPWCSMCKRMMPHFQKAATQLRGHAFSKEPFDGHIFQQGTSV RVLAGMNVYSSEFENIKEEYSVRGFPTICYFEKGRFLFQYDNYGSTAEDIVEWLKNPQPP QPQVPETPWADEGGSVYHLTDEDFDQFVKEHSSVLVMFHAPWCGHCKKMKPEFEKAAEAL HGEADVSFLSFPLTVLFERITDRWNHYDFPLQALLQLPWSVEASSTSSSGVLAAVDATVN KALAERFHISEFPTLKYFKNGEKYAVPVLRTKKKFLEWMQKSQDGSDFLNPKTTVHMLIT CLDQVTAGSPSGRSFIDALNQPTWEEQQTSVLHLVGDNFRETLKKKKHTLVMFYAPWCPH CKKVIPHFTATADAFKDDRKIACAAVDCVKDKNQDLCQQEAVKGYPTFHYYHYGKFAEKY DSDRTHPLPWQWPLDFTQGLDMKDRSIHLDIIETQLLSPFLLGTMNILQEVFGFQELFCA GHGKVMLPGSEAGFLHRNGCLAGSVNAPPRAVPALCSCIPSPRIALISQQASRRTQINLS DMQTEIKLRPPYQISMCELGSANGVTSAFSVDCKGAAHQRLEPATLSGIVGFILSLLCGA LNLIRGFHAIESLLQVLEVGAYMIG >gi568815595f:123109218_123371719|GENSCAN_predicted_CDS_1|1878_bp gacttcagacggctcctgaagaaggaagagaagccgctcctgatcatgttttatgccccc tggtgcagcatgtgcaagaggatgatgccgcatttccagaaggctgcgactcagctgcga ggccacgccttctctaaagagccctttgatggtcacatcttccagcaagggacctcagtg agggtgctggccgggatgaatgtctactcctctgaatttgaaaacatcaaggaggagtac agcgtgcgcggcttccccaccatctgctattttgagaaaggacggttcttgttccagtat gacaactatgggtccacagctgaggacattgtggagtggctgaagaatccgcagccgcca cagccccaggtccctgagactccctgggcagatgagggcggctccgtttatcacctgacc gatgaagactttgaccagtttgtgaaggaacactcctctgtcctcgtcatgttccacgcc ccatggtgtggccactgtaagaaaatgaagccggagtttgagaaggcagcagaagccctc catggagaagcggatgtaagcttcctttccttccccctcaccgttctctttgaaagaatc actgacaggtggaatcattatgactttcccctgcaagccctgctgcagctcccgtggtcc gtggaggccagcagtacctctagctctggtgtccttgcagctgtcgatgccactgtcaac aaggccctggcagaaagattccacatctcagagtttcctacgttgaagtattttaagaat ggagagaaatacgcagtgcctgtgctcaggacaaagaagaagtttctcgagtggatgcaa aagagtcaagacggatctgacttcctcaaccctaaaaccaccgttcatatgctgatcacg tgtcttgatcaggtcactgctggcagtccttctggacgcagcttcatcgacgcactaaac cagcccacgtgggaagagcagcagacaagcgtgttgcacctggtgggggacaacttccgg gagaccctgaagaagaagaaacacaccttggtcatgttctacgccccttggtgcccacac tgtaagaaggtcattccgcactttactgctactgctgatgccttcaaagatgaccgaaag attgcctgtgccgctgttgactgtgtcaaagacaagaaccaagacctgtgccagcaggag gcggtcaagggctaccccactttccactactaccactatgggaagttcgcagaaaagtat gacagcgaccgcacacacccacttccttggcagtggcccctggacttcacccaaggattg gatatgaaagacagaagcatacatttagacatcattgaaacacaattattgagtcccttc ctgttggggaccatgaacatcttacaagaggtgtttggcttccaggagctgttctgcgcc gggcacgggaaggtgatgcttccaggttcagaagctggttttctccaccgcaacggttgc ttggctgggtcagtcaacgcacctccaagagctgtgcctgccctgtgctcctgcatcccc agcccacggattgccctgatctcccagcaagcgtccagacgcacacagataaatctttct gacatgcagacggaaatcaagctgaggcctccttatcaaatttccatgtgcgaactgggg tcagccaatggagtcacatcagcattttctgttgactgtaaaggtgctgctcaccagcga ctggaaccagcaactctgtcagggattgtaggatttatccttagtcttttatgtggagct ctgaatttaattcgaggctttcatgctatagaaagtctcctgcaggttcttgaagtagga gcttacatgattggttga >gi568815595f:123109218_123371719|GENSCAN_predicted_peptide_2|110_aa MWPPAPKKVDIIKDFNVPAERCHFTEHLHDTAAEGCGCLPTPLIFPSTRVHTHEGPQSVG TRAAAIHEQIVDQIRMFQVRRTLRWAKHLAYSRHLANSEGSYYCSYATKL >gi568815595f:123109218_123371719|GENSCAN_predicted_CDS_2|333_bp atgtggcctcctgctccaaagaaggtagacatcattaaggacttcaatgtgcctgcagaa agatgtcattttactgagcatctgcatgatactgcagctgaaggctgtggctgcctcccc acacccctcatcttcccatcaacgcgtgtgcacactcacgagggccctcagagtgtgggc acaagggctgctgccatccatgagcaaattgtagatcagattcgaatgttccaagtcagg aggactttaaggtgggcaaagcacctggcatacagcaggcatttggcaaatagtgagggc agctactactgctcctatgctacaaaactatga >gi568815595f:123109218_123371719|GENSCAN_predicted_peptide_3|1275_aa MAVSTTLIFHLILMWAWAFLQIQPAMDGKYVQLPYRRVLIQGDSVSLEDISHDKPAKLLA LRTSESPIDYARLLTVVGRVWGALSICLLAPGLVSNVLIFSCTNIVGVCTHYPAEVSQRQ AFQETRECIQARLHSQRENQQQRVGEGMPAARIGPPPGKRETLKEGEEQRRRRVMPAEEL VHNSKKDPRKCGGLCVLLYLEWSVDFISVLLGQQERLLLSVLPRHVAMEMKADINAKQED MMFHKIYIQKHDNVSILFADIEGFTSLASQCTAQELVMTLNELFARFDKLAAENHCLRIK ILGDCYYCVSGLPEARADHAHCCVEMGMDMIEAISVPVPYSSLGSFSQRKCVGGEGSADS RSGGEAGSHVRILCRVRPLHFRGLMFPTDTQPGIELLVPLLPRCERLVREVTGVNVNMRV GIHSGRVHCGVLGLRKWQFDVWSNDVTLANHMEAGGKAGRIHITKATLNYLNGDYEVEPG CGGERNAYLKEHSIETFLILRCTQKRKEEKAMIAKMNRQRTNSIGHNPPHWGAERPFYNH LGGNQVSKEMKRMGFEDPKDKNAQESANPEDEVDEFLGRAIDARSIDRLRSEHVRKFLLT FREPDLEKKYSKQVDDRFGAYVACASLVFLFICFVQITIVPHSIFMLSFYLTCSLLLTLV VFVSVIYSCVKTLSRKIVRSKMNSTLVGVFTITLVFLAAFVNMFTCNSRDLLGCLAQEHN ISASQVNACHVAESAVNYSLGDEQGFCGSPWPNCNFPEYFTYSVLLSLLACSVFLQISCI GKLVLMLAIELIYVLIVEVPGVTLFDNADLLVTANAIDFFNNGTSQCPEHATKVALKVVT PIIISVFVLALYLHAQQVESTARLDFLWKLQPGSGLWQTGSLQPHLQACSCSGLPAAGSA PPERLAERVRGSEMSFLIVKVLLFGENLEVETAERGGETAVLANILGTSWHLRDWCFFSR PRRGGGGGADIWAHIGIWATEEKEEMEELQAYNRRLLHNILPKDVAAHFLARERRNDELY YQSCECVAVMFASIANFSEFYVELEANNEGVECLRLLNEIIADFDEVHSGAWAWGDPGLA SFIQIISEDRFRQLEKIKTIGSTYMAASGLNDSTYDKVGKTHIKALADFAMKLMDQMKYI NEHSFNNFQMKIGLNIGPVVAGVIGARKPQYDIWGNTVNVASRMDSTGVPDRIQQWDSEI SEAIGQSVHGPQATLKQHTQAARHSAWHEQVLRVTTDMYQVLAANTYQLECRGVVKVKGK GEMMTYFLNGGPPLS >gi568815595f:123109218_123371719|GENSCAN_predicted_CDS_3|3828_bp atggcagttagtaccacactcatctttcatttaattctgatgtgggcctgggcattcctg cagattcaaccagccatggatggaaaatatgtacagttgccatatcggagggttttgatc caaggtgattcagtgtctttagaggacatatcccatgacaagccagccaaacttctggct ttgaggacatcagagagcccgatcgattatgctcggctgctcacagtggttggccgcgtg tggggtgccctcagcatttgtttgctagcgccaggtcttgtctccaatgttctcattttc tcctgcaccaacatcgtgggtgtctgcacccactatccggctgaggtctcccagagacag gctttccaggagacccgagagtgcatccaggcgcggctccactcgcagcgggagaaccag cagcagcgagtgggggaggggatgccagcagccagaattggacctcctcctgggaaaagg gagaccctgaaggagggtgaggagcagcggaggaggagggtgatgccagctgaggagctt gtgcacaacagcaagaaagaccccaggaagtgtgggggcctctgtgttctgctctatctg gagtggagtgtggatttcatttcagtgctgctgggacagcaggaacggctcctgctgtct gtccttccccgtcatgttgccatggagatgaaagcagacatcaacgccaagcaggaggat atgatgttccataagatttacatccagaaacatgacaacgtgagcatcctgtttgctgac atcgagggcttcaccagcctggcgtcccagtgcactgcacaggaactggtcatgaccctc aacgagctcttcgcccgctttgacaagctggccgcagagaatcactgtttacgtattaag atccttggggattgttattactgcgtctcggggctgcctgaagcaagggctgaccacgcc cactgctgtgtggagatgggcatggacatgatcgaggccatctcggttcctgttccctac agcagcctgggcagcttttcccagaggaagtgtgtgggaggggagggctctgcagactcc aggagtggaggagaggctggcagccatgtgaggatcctgtgccgtgtgaggcccttgcac ttccggggcctcatgtttcctacagacacccagccaggcatcgaactgctggttccactt cttcccagatgtgaaaggttggtccgggaggtgacaggggtgaacgtgaacatgcgtgtg ggaattcacagcgggcgagtacactgcggtgtccttggtctcaggaagtggcagttcgac gtctggtctaacgatgtcacgctagccaaccacatggaggctggcggcaaggcaggacgc atccacatcaccaaggctacactcaactacctgaatggggactacgaggtggagccaggc tgtgggggcgagcgcaacgcctacctcaaggagcacagtatcgagaccttcctcatcctg cgctgcacccagaagcggaaagaagagaaggccatgatcgccaagatgaaccgccagaga accaactccatcgggcacaacccaccacactggggggctgagcgccccttctacaaccac ctgggtggcaaccaggtgtccaaggagatgaagcggatgggctttgaagaccccaaggac aagaacgcccaggagagtgcgaaccctgaggatgaagtggatgagtttctgggccgtgcc attgacgccaggagcattgataggcttcggtctgagcacgtccgcaagttcctcctgacc ttcagggagcctgacttagagaagaagtactccaagcaggtagacgaccgatttggtgcc tatgtggcgtgtgcctcgctcgtcttcctcttcatctgctttgtccagatcaccatcgtg ccccactccatattcatgctcagcttctacctgacctgttccctgctgctgaccttggtg gtgtttgtgtctgtgatctactcctgcgtaaagaccctctccaggaagatcgtgcggtcc aagatgaacagcaccctggttggggtgttcaccatcaccctggtgttcctggcggctttt gtcaacatgttcacgtgcaactccagggacctgctgggctgcttggcacaggagcacaac atcagcgcgagccaggtcaacgcgtgtcacgtggcggagtcggccgtcaactacagcctg ggcgatgagcagggcttctgtggcagcccctggcccaactgcaacttccccgagtacttc acctacagcgtgctgctcagcctgctggcctgctccgtgttcctgcagatcagctgcatc gggaagctggtgctcatgctggccatcgagctcatctacgtgctcatcgtggaggtgcca ggtgtcacgctcttcgacaacgccgacctgctggtcaccgccaacgccatagacttcttc aacaacgggacctcccagtgccctgagcatgcaaccaaggtggcattgaaggtggtgacg cccatcatcatctcagtctttgtgctggccctgtacctgcacgcccagcaggtggagtcc actgcccgcctcgacttcctctggaaactgcagcctggttctggcctttggcagacagga tccttgcagcctcacctccaggcttgttcctgctcggggctccctgctgcaggctcagca cccccagagaggctggcagagagagttcgaggctcagaaatgtcttttctgattgtcaaa gtcctgctctttggagaaaacctggaagtagaaacagcagagagaggaggagagacagcg gtacttgccaacatcctgggcacaagttggcatttgcgtgactggtgctttttctccagg ccacggagaggaggaggtggaggggccgacatctgggctcacattggcatctgggccaca gaggagaaagaggagatggaggagctgcaggcctacaaccggcggctgctgcacaacatc ctgcccaaggacgtggccgctcacttcctggcccgcgagcggcgcaatgatgagctctac tatcagtcctgtgagtgtgtggcggtcatgttcgcctccatcgccaacttctccgagttc tacgttgagctggaggccaacaacgagggtgtcgagtgcctgcggctactcaatgagatc atcgctgactttgatgaggtgcacagcggggcctgggcttggggtgacccaggtttggcc tccttcatccagatcatcagcgaggatcggttccggcagctggagaagatcaagaccatc ggcagcacctacatggctgcctccggcctcaacgactctacctacgacaaggtgggcaag acccacatcaaggcactggccgactttgccatgaagctgatggaccagatgaagtacatc aatgagcactccttcaacaacttccagatgaagatcgggctcaacatcggccccgtggtg gccggggtgataggggcacgaaagcctcagtacgacatctggggcaataccgtgaacgtg gccagccgcatggacagcaccggtgtacccgaccgcatccagcaatgggatagtgaaatt agtgaggccatcgggcagtcagtccatggaccacaggccacactgaaacagcacacgcaa gctgcccgtcacagtgcctggcacgagcaggtgctgagggtcaccacagacatgtaccag gtgctggctgccaacacgtaccagctggagtgccggggcgtggtcaaggtcaagggcaaa ggcgagatgatgacctacttcctcaatggagggcccccgctcagttag