GENSCAN 1.0 Date run: 4-Nov-116 Time: 13:43:10 Sequence gi568815586f:68548725_68757184 : 208460 bp : 40.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 65 60 6 1.05 1.01 Sngl - 5158 4271 888 1 0 60 43 948 0.989 83.32 1.00 Prom - 5891 5852 40 -7.85 2.00 Prom + 10208 10247 40 -6.85 2.01 Init + 11814 12024 211 2 1 91 55 218 0.856 17.79 2.02 Intr + 25790 25844 55 0 1 64 66 65 0.001 -0.98 2.03 Intr + 41152 41326 175 1 1 54 97 91 0.028 5.62 2.04 Intr + 47158 47263 106 0 1 78 36 53 0.272 -2.03 2.05 Term + 47547 47860 314 1 2 78 36 163 0.582 4.28 2.06 PlyA + 48525 48530 6 1.05 3.03 PlyA - 51066 51061 6 1.05 3.02 Term - 62386 62168 219 1 0 74 45 213 0.602 11.66 3.01 Init - 69241 69239 3 1 0 108 81 0 0.290 1.35 3.00 Prom - 71805 71766 40 -6.05 4.02 PlyA - 71873 71868 6 1.05 4.01 Sngl - 78616 78146 471 1 0 54 55 431 0.368 30.27 4.00 Prom - 84268 84229 40 -3.35 5.03 PlyA - 84278 84273 6 1.05 5.02 Term - 94217 93790 428 0 2 -5 38 482 0.431 28.38 5.01 Init - 94342 94285 58 1 1 49 -3 78 0.465 -3.68 5.00 Prom - 95536 95497 40 -6.75 6.00 Prom + 97320 97359 40 -7.65 6.01 Init + 100001 100057 57 1 0 69 111 1 0.668 1.98 6.02 Intr + 101676 101744 69 2 0 67 71 80 0.819 2.66 6.03 Intr + 105388 105528 141 0 0 108 69 125 0.966 12.23 6.04 Intr + 107582 107725 144 1 0 58 64 99 0.098 4.06 6.05 Intr + 137803 138025 223 0 1 68 13 132 0.363 0.48 6.06 Intr + 141907 142022 116 2 2 46 80 86 0.453 2.85 6.07 Intr + 143244 143388 145 2 1 88 58 170 0.864 12.93 6.08 Intr + 148095 148198 104 1 2 75 115 61 0.359 6.47 6.09 Intr + 152002 152129 128 0 2 113 63 19 0.097 0.46 6.10 Intr + 156921 157233 313 0 1 65 87 299 0.057 22.76 6.11 Intr + 157361 157696 336 1 0 14 38 529 0.962 35.29 6.12 Intr + 157818 157976 159 2 0 -11 64 219 0.932 8.86 6.13 Intr + 160514 160585 72 1 0 64 80 51 0.586 0.58 6.14 Intr + 161281 161369 89 0 2 111 111 37 0.940 6.15 6.15 Intr + 165006 165084 79 0 1 80 27 68 0.862 -1.47 6.16 Intr + 166903 167016 114 0 0 138 78 97 0.876 13.42 6.17 Term + 170617 170760 144 0 0 95 36 78 0.446 0.23 6.18 PlyA + 170939 170944 6 1.05 7.00 Prom + 171703 171742 40 -5.55 7.01 Init + 173105 173262 158 1 2 43 87 183 0.918 13.13 7.02 Intr + 173380 173428 49 1 1 46 94 66 0.976 0.76 7.03 Intr + 182386 182536 151 0 1 91 98 69 0.998 7.01 7.04 Intr + 184728 184888 161 1 2 63 77 228 0.995 17.99 7.05 Intr + 185984 186109 126 1 0 46 100 99 0.991 6.86 7.06 Intr + 186507 186620 114 2 0 105 99 94 0.999 11.92 7.07 Intr + 193089 193256 168 2 0 97 119 33 0.918 6.52 7.08 Term + 193631 193738 108 1 0 75 29 64 0.886 -3.27 7.09 PlyA + 193785 193790 6 1.05 8.00 Prom + 194254 194293 40 -7.85 8.01 Init + 197654 198055 402 1 0 88 105 411 0.916 37.47 8.02 Intr + 199206 199320 115 2 1 103 19 50 0.533 -1.30 8.03 Intr + 201758 201969 212 0 2 4 99 112 0.476 1.61 8.04 Intr + 203308 203466 159 0 0 70 111 183 0.971 18.06 8.05 Intr + 203539 203604 66 0 0 61 116 59 0.669 4.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 28955 29086 132 1 0 65 105 129 0.979 12.49 S.002 Term + 38686 38813 128 2 2 57 43 145 0.917 4.36 S.003 Term + 107582 107797 216 1 0 58 40 169 0.887 5.26 S.004 Init + 156910 157233 324 0 0 86 87 305 0.898 27.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:68548725_68757184|GENSCAN_predicted_peptide_1|295_aa MSGALDVLQMREEDVLKFLAAGTHLGGTNLDFQMEQYIYERKSDGIYIINLKRTWEKLLL AAHAIVAIENPADVSVISSRNTGQTAVLKFAAATGATPIAGRFTPGTFTNQIQAAFREPR LLVVTDPRADHQPLTEASYVNLPTIALCNTDSPLRYVDIAIPCNNKGAYSVGLMWWMLAR EVLCMLGTISCEHPWEVMPDLYFYRDPEEIEKEEQAAAEKAVTKEEFQGEWTAPAPEFTA IQPEVADWSEGVQVPSVPIQQFPTEDWSSQPAMEDWSAAPTAQATEWVGATTDWS >gi568815586f:68548725_68757184|GENSCAN_predicted_CDS_1|888_bp atgtccggagcccttgatgtcctgcaaatgagggaggaggatgtccttaagttccttgca gcaggaacccacttaggtggcaccaatcttgacttccagatggaacagtacatctatgaa aggaaaagtgatggcatctatatcataaatctgaagaggacctgggagaagcttctgctg gcagctcatgctattgttgccattgaaaaccctgctgatgtcagtgttatatcctccagg aatactggccagacggctgtgctgaagtttgctgctgccactggagccactccaattgct gggcgcttcactcctggaaccttcactaaccagatccaggcagccttccgggagccacgg cttcttgtggttactgaccccagggctgaccaccagcctctcacggaggcatcttatgtt aacctacctaccattgcgctgtgtaacacagattctcctctgcgctatgtggacattgcc atcccatgcaacaacaagggagcttactcagtgggtttgatgtggtggatgctggctcgg gaagttctgtgcatgcttggcaccatttcctgtgaacacccatgggaggtcatgcctgat ctgtacttctacagagatcctgaagagattgaaaaagaagagcaggctgctgctgaaaag gcagtgaccaaggaggaatttcagggtgaatggactgctccagctcctgagttcactgct attcagcctgaggttgcagactggtctgaaggtgtacaggtgccctctgtgcctattcag caattccctactgaagactggagctctcagcctgccatggaagactggtctgcagctccc actgctcaggccactgaatgggtaggagcaaccactgactggtcttaa >gi568815586f:68548725_68757184|GENSCAN_predicted_peptide_2|286_aa MDKSRGDIAADRWRTLQEEGSHLAGKLPAGKNAGGCLSWEEFPSPLQAQKPGNEWEAAGV GQASQGKWPRNSWAPQVQAQMRWESMDPSPRGLGRKNGFHGPGPVPCCPEQPHSLGTLLP ASWLLRLPQWLTGNQVKLTLLLQRVQAGPQSAPSLLRFQDLRPDLLTSLSPRALSTSDTH LSSSSFFPLAERRLEKKEPPLHSIRDSSSSKVISERAAAPKVKLAPVIVTSGLFERRSMG FEAKSTYIQNSFERRSMSFEAKSTYIQNSGSLHTQASSEPLSLPLK >gi568815586f:68548725_68757184|GENSCAN_predicted_CDS_2|861_bp atggacaagtctagaggcgatatagctgcagacaggtggaggacactgcaagaggaaggc tcccatttagcaggaaaactgccagctgggaagaatgctggtggctgcctgagttgggaa gaattcccaagccctctgcaagctcagaagccaggaaatgaatgggaagcagctggagtg ggccaagcaagtcaaggcaaatggcccagaaattcatgggcccctcaagtccaagcacag atgcgttgggaatctatggatcccagtcccagaggcctaggaaggaagaatggtttccat gggccaggtccagtgccctgctgccctgagcagcctcacagcctcgggacactgcttcct gcatcctggctgctccggctcccacagtggctaacagggaaccaggttaagctcacactg ctgcttcagagggtgcaagccggtccacaatctgccccgtcactgctcaggtttcaggac ctccggccagacctgctgacctctctgagccctagggcactgagcactagtgacactcac ttgtcaagcagcagcttctttcccttagcagagcgcaggctagagaagaaagagccacca ctacattccatcagagacagcagtagttccaaagtgatttcagaacgtgcagcagctcca aaagtgaagctagctcctgtgatagtaacaagtggcttgtttgagagaaggagcatgggc tttgaagccaaatcaacctatattcaaaatagctttgagagaaggagcatgagctttgaa gccaaatcgacctatattcaaaattctggctctctgcacacacaggcaagttctgagcct ctttctttgcctttaaaatag >gi568815586f:68548725_68757184|GENSCAN_predicted_peptide_3|73_aa MPAPGPARPPPRLPSAATHSLCEPLTLSPGPCSQRRPALPPGRYSRRHGGPAAAAPLRLV YTPESGRGLAMSP >gi568815586f:68548725_68757184|GENSCAN_predicted_CDS_3|222_bp atgccggcgcccggccccgcgcggccgccgccgcggcttccctcagcggccacgcactca ctctgcgaacctctcacgctgtcaccgggtccctgcagccagcgtcgccccgcgctcccc ccgggtcgctactctaggcgccacggcggtcctgccgctgccgcgccgctccggctggtt tacacgcctgaatctgggcgaggtttggcgatgtcgccttga >gi568815586f:68548725_68757184|GENSCAN_predicted_peptide_4|156_aa MGGPSVQWLQTAASLVVYAACVLYCMGCSKGPWRQSFQMDVHVSDLALPQCRFQTGMRGA FGKPQGTVARVHTGQVIISIHTKLQNKEHVIEALRRAKFKFSGRQKIHISKKWGFTKFNA NEFEDMVTEKRLIPDGCRVKYISNRGPVDKWRALHS >gi568815586f:68548725_68757184|GENSCAN_predicted_CDS_4|471_bp atgggggggcccagtgtgcaatggctgcaaacagcagcttccttggtagtgtatgcagcc tgtgtgttgtattgtatgggttgctctaagggaccctggagacagtcctttcagatggat gttcatgtttctgaccttgcactaccccagtgtaggttccaaacaggcatgcgaggtgcc tttgggaagccccagggcactgtggccagggttcacactggccaagttatcatctccatt cacaccaagctgcagaacaaggagcatgtgattgaggccctgcgcagggccaagttcaag ttttctggccgccagaagatccacatctcaaagaagtggggcttcaccaagttcaatgcc aatgagtttgaagacatggtgactgagaagcggctcatcccagatggctgtcgggtcaag tacatttccaatcgtggccctgtggacaagtggcgggccctgcactcatga >gi568815586f:68548725_68757184|GENSCAN_predicted_peptide_5|161_aa MNNLQEVKGGCSGPWAARVEIIPQNQKAIASFLKSWNETLTSRLAALPENPPVIDWAYYK ANVAKAGLVDDFKKFNALKVPVPEDKYTAQVDAEEKDVKSCAEWVSLSKARIVEYEKQME KMKNLIPFDQMTTEDLNEAFPETKLDEKKYPYWPHQLIENL >gi568815586f:68548725_68757184|GENSCAN_predicted_CDS_5|486_bp atgaacaaccttcaagaagttaaaggcggctgcagtggaccgtgggcagccagggtcgag atcataccccagaaccaaaaggccattgctagtttcctgaaatcctggaatgagaccctc acctccaggttggctgctttacctgagaatccaccggttatcgactgggcttactacaag gccaacgtggccaaggccggtttggtggatgactttaagaagtttaatgccctgaaggtt cccgtgccagaggataaatatactgcgcaggtggatgccgaagaaaaagatgtgaaatct tgtgctgagtgggtgtctctctcaaaggccaggattgtagaatatgagaaacagatggag aagatgaagaacttaattccatttgatcagatgaccactgaggacttgaacgaagctttc ccagaaaccaaattagacgagaaaaagtatccttattggcctcaccaactaatcgagaat ttataa >gi568815586f:68548725_68757184|GENSCAN_predicted_peptide_6|810_aa MREYKLVVLGSGGVGKSALTVQFVQGIFVEKYDPTIEDSYRKEQFTAMRDLYMKNGQGFA LVYSITAQSTFNDLQDLREQILRVKDTDDVPMILVGNKCDLEDERVVGKEQGQNLARQWN NCAFLESSAKSKINVNEERTPSKEQIAVMENSCKNCRGLQNKGLSVGREIEIPEVWGRAH ASEFQEQALSDREYQEMTSHFSRLSTYFILPITPTSRSLLRQPDISCILGTGGKSPRLTQ SSGFFGNLSMVTNLDDSNWAAAFSSQRSGLFTNTEPHSITEDVTISAVMLREDDPGEAAS MSMFSDFLQSFLKHSSSTVFDLVEEYENICGSQVNILSKIVSRATPGLQKFSKTASMLWL LQQEMVTWRLLASLYRVTQKSFKVSNSGPRAFSSHSYMCGPSACISSSSFSRMGSSSFRG SLGGDFGRASGRGGITPVTVNQSLLSPLNLEVDPNIQAVCTQVKEQIKTLNNKFASFIDK LYTLGQEKLKLEVELDNMQGLVEDFKNKYEDDIKKHTEMENEFVLIKKDMDEAYKNKVEL ESHLEGLTEEISFLRQLYEEEIRELQSQISDTSVVLSIDNSRSLDMNSIIAETEISKMNR NISRLQDEIEGLKGQRASLEAAIADAEQHGELAVKDANTKLSKLEAVNASEKTVVEALFQ RDSLVRQSQLVVDWLESIAKDEIGEFSDNIEFYAKSVYWENTLHTLKQRQLTSYVGSVRP LVTELDPDAPIRQKMPLDDLDREDEVRLLKYLFTLIRAGMTEEAQRLCKRCGQAWRAATL EGWKLYHDPNVNGGILVDFILTVEDKGISL >gi568815586f:68548725_68757184|GENSCAN_predicted_CDS_6|2433_bp atgcgtgagtataagctagtcgttcttggctcaggaggcgttggaaagtctgctttgact gtacaatttgttcaaggaatttttgtagaaaaatacgatcctacgatagaagattcttat agaaaggagcaatttacagcaatgagggatttatacatgaaaaatggacaaggatttgca ttagtttattccatcacagcacagtccacatttaacgatttacaagacctgagagaacag attcttcgagttaaagacactgatgatgttccaatgattcttgttggtaataagtgtgac ttggaagatgaaagagttgtagggaaggaacaaggtcaaaatctagcaagacaatggaac aactgtgcattcttagaatcttctgcaaaatcaaaaataaatgttaatgaggaaagaaca ccaagtaaagaacaaatagcagttatggaaaatagttgcaaaaactgcagggggctacag aacaaaggactgagtgtaggcagagagatagagatccccgaggtgtggggcagagcccac gcatctgaattccaagaacaggcgctttctgacagagaataccaggaaatgacttcacat ttttccagactatcaacatacttcattcttcccattaccccaacaagccgaagcttacta aggcagccagatatttcctgcattcttggaacaggagggaagtcgccccgacttacgcag tcttcagggttctttggaaatctctccatggttactaatctggatgacagtaactgggca gctgcattttcatcacagcgttccgggctgttcacaaacacagagccccacagtataaca gaagatgtaactatcagtgctgttatgttacgtgaggatgatcctggagaagctgcatcc atgagtatgttttctgatttcctgcagtcttttctgaagcactcttcgagtacagttttt gatcttgtggaagagtatgaaaacatctgtggtagtcaggtgaatatactgagtaaaata gtgagtcgagcaacacctggacttcaaaaattttcaaaaacagccagtatgctctggctt cttcaacaggagatggtcacatggaggctgctggcttctttgtatagagtgacccagaag tccttcaaggtgtccaactctggcccaagggccttcagtagccattcctacatgtgtggg cccagtgcctgcatcagctcctcgagcttctcccgaatgggcagcagcagtttccggggt agcctgggtggagactttggcagggccagtggtaggggaggcatcaccccagtcacggtc aaccagagcctgctgagcccccttaacctggaggtggaccccaacatccaagctgtgtgc acccaggtgaaggagcagatcaagaccctcaacaacaagtttgcctccttcattgacaag ctgtacactctgggccaagagaagctgaagctggaggtggaacttgataacatgcagggg ctggtggaggacttcaagaacaagtacgaggatgatatcaaaaagcatacagagatggag aatgaatttgtcctcatcaagaaggatatggatgaagcttacaagaacaaggtagagctg gagtctcacctggaagggcttactgaagagatcagcttcctcaggcaactgtatgaagag gagatccgggagctgcagtcccagatctcggacacatctgtggtgctgtccattgacaac agtcgctccctggacatgaacagcatcatcgctgagactgagatctccaagatgaacagg aacatcagtcggctccaggatgagattgagggcctcaaaggccagagggcttccctggag gccgccatcgcagatgctgagcagcatggggagctggcagttaaggatgccaacaccaag ctgtccaagctggaggctgttaatgccagtgaaaaaacagttgtggaagcgttatttcag agggattcacttgttcgacaaagtcagctggtggtagattggttagagagtattgccaaa gatgaaattggagaattttctgataatattgagttttatgcaaaatcagtatattgggaa aatactctgcataccttaaaacaacggcagctgacttcttacgttggaagtgttcgtccg cttgtcactgaattggaccctgatgctcccataagacagaaaatgccccttgatgatctg gatagagaagatgaagttagattactcaaatatctctttactctaatccgtgctggaatg acagaagaggcacaacgactctgtaaacgctgtggtcaagcatggagagctgcaacactt gaaggctggaaactgtaccatgaccctaatgttaatggaggtattttagtagattttatt ctgacagttgaggacaaaggcatttcgctctaa >gi568815586f:68548725_68757184|GENSCAN_predicted_peptide_7|344_aa MGKKLLPVCDTWEDTVWAYFRVMVDSLVEQEIQTSVATLDETEELPREYLGANWTLEKVF EELQATDKKLLIREKHTNLIAFYTCHLPQDLAVAQYALFLESVTEFEQRHHCLELAKEAA SKKHEAAKEVFVKIPQDSIAEIYNQCEEQGMESPLPAEDDNAIREHLCIRAYLEAHETFN EWFKHMNSVPQKPALIPQPTFTEKVAHEHKEKKYEMDFGIWKGHLDALTADVKEKMYNVL LFVDGGWMVDVREDAKEDHERTHQMVLLRKLCLPMLCFLLHTILHSTGQYQECLQLADMV SSERHKLYLVFSKEELRKLLQKLRESSLMLLDQGLDPLGYEIQL >gi568815586f:68548725_68757184|GENSCAN_predicted_CDS_7|1035_bp atggggaaaaagctgcttcctgtctgtgacacctgggaagacacagtttgggcctacttc cgggtgatggtggacagtctggtagaacaggagatccagacatcagtagcaactctggat gaaactgaagaactccctagagaatatctgggagcaaactggacgttagaaaaggttttt gaggaacttcaagctactgacaaaaagcttttaataagagagaaacatacaaatcttata gcattttatacctgtcatttgcctcaagacctagctgttgcccagtatgcattatttttg gaaagtgttacagaatttgaacagcgccaccattgcctggagttggctaaagaagcagca tcaaaaaagcacgaagctgcaaaagaagtatttgtgaaaattcctcaggattctatagca gaaatctataatcagtgcgaggaacaaggaatggaaagtccacttcctgctgaagatgat aatgctatccgagaacatttgtgcatcagagcttatttggaagcccatgaaacctttaat gagtggtttaagcatatgaattcagttccacaaaaacctgctttgatacctcaaccaact tttactgagaaagtggctcatgaacacaaagaaaagaaatatgaaatggattttggtatt tggaaagggcatttggatgccctaactgctgatgtgaaggagaaaatgtataacgtcttg ttgtttgttgatggagggtggatggtggatgttagagaggatgccaaagaagaccatgaa agaacacatcaaatggtcttactgagaaagctttgtctgccaatgttgtgttttctgctt catacgatattgcacagtactggtcagtatcaggaatgcctacagttagcagatatggta tcctctgagcgccacaaactgtacctggtattttctaaggaagagctaaggaagttgctg cagaagctcagagagtcctctctaatgctcctagaccagggacttgacccattagggtat gaaattcagttatag >gi568815586f:68548725_68757184|GENSCAN_predicted_peptide_8|318_aa MALLVDRVRGHWRIAAGLLFNLLVSICIVFLNKWIYVYHGFPNMSLTLVHFVVTWLGLYI CQKLDIFAPKSLPPSRLLLLALSFCGFVVFTNLSLQNNTIGTYQLAKAMTTPVIIAIQTF CYQKTFSTRIQLTLIPITLGVILNSYYDVKFNFLGMVFAALGVLVTSLYQVVGSAKRARR KGHRESGHVHGDHSALRRALEGWWQEPGCAALRDSGGLRAPAVPGSPVAKTRGCLKRLQK ERQWVGAKQHELQVNSMQLLYYQAPMSSAMLLVAVPFFEPVFGEGGIFGPWSVSALFSEH YVRSILTFIVIITIKGGK >gi568815586f:68548725_68757184|GENSCAN_predicted_CDS_8|954_bp atggcattgctggtggaccgagtgcggggccactggcgaatcgccgccgggctcctgttc aacctgctggtgtccatctgcattgtgttcctcaacaaatggatttatgtgtaccacggc ttccccaacatgagcctgaccctggtgcacttcgtggtcacctggctgggcttgtatatc tgccagaagctggacatctttgcccccaaaagtctgccgccctccaggctcctcctcctg gccctcagcttctgtggctttgtggtcttcactaacctttctctgcagaacaacaccata ggcacctatcagctggccaaggccatgaccacgccggtgatcatagccatccagaccttc tgctaccagaaaaccttctccaccagaatccagctcacgctgattcctataactttaggt gtaatcctaaattcttattacgatgtgaagtttaatttccttggaatggtgtttgctgct cttggtgttttagttacatccctttatcaagtggttggttctgcaaagagggctcgcagg aagggccacagggagtctggacatgtccatggagatcatagtgcccttaggagagcatta gaggggtggtggcaggagccaggctgtgctgcattgagggacagtggaggcctgagagcc cctgctgttccaggcagtcctgtggcgaagacacgcggctgcctgaagaggctgcaaaag gaaaggcagtgggtaggagccaaacagcatgaattacaagtgaactcaatgcagctgctg tactaccaggctccgatgtcatctgccatgttgctggttgctgtgcccttctttgagcca gtgtttggagaaggaggaatatttggtccctggtcagtttctgctttgttttcagagcac tatgtccgatccattctcacatttatcgtcataatcaccataaaaggaggtaag