GENSCAN 1.0 Date run: 5-Nov-116 Time: 18:56:54 Sequence gi568815596r:151170696_151382948 : 212253 bp : 40.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 9410 9561 152 1 2 77 71 166 0.674 11.37 1.02 Term + 15416 15887 472 2 1 -38 48 347 0.574 11.52 1.03 PlyA + 15896 15901 6 1.05 2.00 Prom + 16660 16699 40 -5.95 2.01 Init + 21547 21619 73 0 1 62 81 21 0.533 0.08 2.02 Intr + 21785 21991 207 0 0 40 89 168 0.860 10.23 2.03 Term + 27958 28145 188 2 2 47 55 141 0.711 3.37 2.04 PlyA + 30249 30254 6 1.05 3.04 PlyA - 30948 30943 6 1.05 3.03 Term - 52457 52402 56 0 2 79 55 65 0.329 -0.96 3.02 Intr - 54825 54711 115 0 1 51 72 88 0.529 2.60 3.01 Init - 65194 65186 9 1 0 110 107 27 0.824 5.90 3.00 Prom - 65906 65867 40 -4.25 4.00 Prom + 67429 67468 40 -7.35 4.01 Init + 70455 70457 3 2 0 113 89 0 0.867 2.65 4.02 Term + 72559 72687 129 0 0 87 42 145 0.936 7.00 4.03 PlyA + 72810 72815 6 1.05 5.06 PlyA - 73294 73289 6 1.05 5.05 Term - 75171 75151 21 0 0 131 54 10 0.290 -0.77 5.04 Intr - 80969 80220 750 2 0 100 96 296 0.613 22.32 5.03 Intr - 82160 82060 101 0 2 46 116 35 0.794 1.01 5.02 Intr - 85048 84838 211 1 1 100 63 144 0.948 10.66 5.01 Init - 85468 85466 3 1 0 113 22 0 0.802 -4.05 5.00 Prom - 85805 85766 40 -6.25 6.00 Prom + 87009 87048 40 -8.75 6.01 Sngl + 90973 91350 378 0 0 115 44 301 0.890 24.31 6.02 PlyA + 94433 94438 6 1.05 7.08 PlyA - 95338 95333 6 1.05 7.07 Term - 100180 99998 183 1 0 78 38 196 0.819 10.16 7.06 Intr - 101037 100931 107 1 2 67 103 26 0.830 1.01 7.05 Intr - 104975 104789 187 2 1 58 63 193 0.980 12.14 7.04 Intr - 105169 105063 107 2 2 73 92 28 0.742 0.71 7.03 Intr - 108295 108133 163 1 1 35 86 94 0.767 2.53 7.02 Intr - 111348 111253 96 0 0 25 78 88 0.560 0.79 7.01 Init - 112253 112173 81 2 0 82 93 102 0.991 11.12 7.00 Prom - 117559 117520 40 -6.95 8.00 Prom + 122443 122482 40 -5.85 8.01 Init + 125528 125719 192 1 0 97 44 83 0.841 1.98 8.02 Intr + 126391 126652 262 0 1 -16 33 251 0.223 5.34 8.03 Intr + 132672 132731 60 1 0 82 96 38 0.583 1.79 8.04 Term + 134605 134771 167 2 2 -13 39 276 0.976 9.70 8.05 PlyA + 135561 135566 6 1.05 9.00 Prom + 136027 136066 40 -8.65 9.01 Init + 136999 137058 60 0 0 34 98 63 0.702 1.24 9.02 Intr + 139782 139806 25 2 1 136 31 25 0.438 -1.72 9.03 Intr + 140629 140774 146 2 2 28 81 150 0.730 7.38 9.04 Intr + 142147 142722 576 0 0 74 65 321 0.501 20.39 9.05 Intr + 144702 144974 273 2 0 64 81 85 0.662 2.21 9.06 Term + 145693 145905 213 0 0 65 42 141 0.817 3.45 9.07 PlyA + 146207 146212 6 1.05 10.00 Prom + 151407 151446 40 -5.85 10.01 Init + 151823 151839 17 1 2 50 116 9 0.011 -0.39 10.02 Intr + 174290 174437 148 2 1 64 60 105 0.284 4.62 10.03 Intr + 179181 179309 129 2 0 49 75 55 0.088 0.27 10.04 Intr + 184868 185014 147 1 0 148 94 6 0.188 6.71 10.05 Intr + 193175 193385 211 1 1 26 21 208 0.174 5.46 10.06 Intr + 195361 195522 162 2 0 114 78 105 0.996 11.13 10.07 Intr + 199325 199553 229 0 1 97 102 216 0.991 19.91 10.08 Intr + 202854 202894 41 0 2 89 110 72 0.969 6.45 10.09 Term + 208669 208838 170 2 2 73 42 111 0.854 2.06 10.10 PlyA + 209332 209337 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_1|207_aa MMWARLRRGRFCLATLLLPFLGSQTTRQQSNKDLLHFLRALYGSTRLPFNRTQTSTLLTP IPLRWFRTLRTPACAHHGATVQQLEGRWRLADSKGFDAYMKKLGVGISLRNMGAMAKPDC IITCDGKNLTIKTESTLKTTQFSCTLGEKFEETTAVGRKTQTVCSFTDGALVPHQEWDGK ENTITRKLKDAISGGLCHEQCHLYSDL >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_1|624_bp atgatgtgggcaaggctgaggagaggaaggttctgcttagccacactcctcctgcccttc ctgggctcccagacaactcgccagcagagtaataaagatcttctccactttctgcgagct ttatatggaagtactcgtctaccctttaatagaacacaaaccagtacgctgctcacgccg atcccgctccgctggttccgcacgctccgcacaccagcctgcgcgcaccatggggccacc gttcagcagctggaaggaagatggcgcctggcggacagcaaaggctttgatgcatacatg aagaaactaggagtgggaatatctttgcgcaatatgggcgcaatggccaaaccagactgt atcatcacttgtgatggcaaaaacctcaccataaaaactgagagcactttgaaaacaaca cagttttcttgtaccctgggagagaagtttgaagaaaccacagctgttggcagaaaaact cagactgtctgcagctttacagatggtgcattggttccgcatcaggagtgggatgggaag gaaaacacaataacaagaaaattgaaagatgcaatcagtggtggattgtgtcacgaacaa tgtcacctgtactcggatctatga >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_2|155_aa MTSLMINKYPIQVQYQGYFYGIKQVVKGSDSWSLVDEHSKEWRRHETVKQVRCVPVANLS ESSDKTHSYATSSRKQRYYLQIGSKEQKKHEIHGKCTFTQEHRLTEHDINEKDKQSLTPP PNFSNPVGKKRCTSSHREEIISSQHSKQGVAHADD >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_2|468_bp atgacttcccttatgattaataaatatcccattcaggtgcaatatcagggctatttctat ggcattaagcaagtggttaaggggtctgatagttggtccttggtggatgaacacagtaag gaatggagaaggcatgagactgtgaagcaggttcgctgtgtaccagttgccaacttgtct gagtctagtgacaaaacacattcatatgcaacaagttccaggaagcagaggtattactta cagataggcagcaaggaacaaaagaagcatgagatccatgggaagtgtaccttcacacag gaacacagacttacagaacatgatattaatgaaaaggacaagcagagtttaacgcctcct cccaacttctcaaatccagtgggaaagaaaaggtgtacttctagccacagagaggagatc atttcttctcaacattccaagcagggtgtggcccacgctgatgactga >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_3|59_aa MALREQECIWPYIAAANDLASVRKVNWRTKPAHDGKITEKQATLAADWMVPTHIVGGSS >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_3|180_bp atggcgctgagggaacaagaatgcatatggccttatattgctgctgcaaatgatcttgcc tccgtgagaaaagttaactggaggacaaagccagcacatgatgggaaaattacagagaaa caggccacactggcagccgattggatggtgcctactcacattgtgggtgggtcttcctga >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_4|43_aa MKVQPSAMATGLGQHDDKDCTKMTDNWRQFVKQDGFPWQTGQV >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_4|132_bp atgaaagtacagccatcagcgatggccacaggacttggtcagcatgatgacaaggactgc actaaaatgacagacaactggagacaatttgttaaacaggatgggttcccgtggcaaact gggcaagtttaa >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_5|361_aa MASVLNVKESKAPERTVVVAGLPVDLFSDQLLAVLVKSHFQDIKNEGGDVEDVIYPTRTK GVAYVIFKEKKVAENVIRQKKHWLARKTRHAELTVSLRVSHFGDKIFSSVNAILDLSVFG KEVTLETLVKDLKKKIPSLSFSPLKPNGRISVEGSFLAVKRLRESLLARACSLLEKDRNF TSEERKWNRQNPQRNLQRSNNSLASVRTLVPETARSGEMLVLDTDVFLYLKHKCGSYEST LKKFHILSQEKVDGEITTICLKSIQVGSQPNNAKHVKELIEEWSHALYLKLRKETFILEG KENREKRMIKRACEQLSSRYLEVLINLYRTHIDIIGSSSDTYLFKKGVMKLIGQKIQEII N >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_5|1086_bp atggcatcagttttgaatgtcaaggaatccaaagctcctgaaagaacggttgtagttgct ggtcttccagttgacctttttagtgatcaattattggccgtattagtgaagagccacttc caagacattaagaatgagggcggagatgttgaagatgtgatatatccgacaagaaccaag ggagttgcatatgtaatattcaaagaaaaaaaagttgcagagaatgtcatcagacaaaag aaacactggctagcaaggaagactagacatgctgaactcacagtctctctcagagtctct cattttggtgacaagatcttcagctctgtaaatgccatccttgatctttctgtttttgga aaagaagttactctagaaactctggtaaaagacctgaaaaaaaaaatcccgagtttaagc ttcagtcctttgaaacccaatggaagaatctccgtggaaggatcatttctggctgtcaag aggctcagagaatctttgctagcaagagcatgttctctcttagaaaaagacagaaatttt accagtgaggagagaaagtggaatagacaaaatccccagaggaatctacagagaagtaat aactctttggcatcagtcaggaccttagtacctgagactgctagaagtggagaaatgctt gtgcttgacacagatgtttttctttacctgaaacacaagtgtggatcttatgaaagcaca ctgaaaaaattccacattctgagtcaggagaaagtggatggtgaaatcaccacaatttgt ctaaaaagcattcaagttggttctcagccaaacaatgcaaaacatgtaaaagagctcatt gaggaatggtcacatgctctttacttaaagcttagaaaagagacatttattttggaagga aaggaaaatagagagaaaagaatgatcaaaagggcatgtgaacaattaagttcgagatac cttgaagtcctgattaacctttataggacacacattgacattataggatcttcttctgac acttacctgtttaaaaaaggggtcatgaaattaatagggcaaaagatccaggagataatc aactga >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_6|125_aa MATVQARTCTAKPKAKATCHGGGETRPQRPQKAHNQRNAGDGERSEQAGFGFVFCSSSLG GYEEKGGPSTRSGCSATSWPLSGSSDSGVLLPAPAQTPSVVLQLVEALWREAERIPRQEI WEFSG >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_6|378_bp atggcgacggtacaggcacggacctgcaccgcaaaacccaaagcaaaagcaacttgccat ggcggaggggagacgcgccctcagcggccgcagaaagcccacaaccagcggaacgcaggc gatggggagaggagcgagcaggcaggttttggtttcgttttttgttccagctcccttgga ggctacgaagaaaagggcggtccttccacccgatccggctgttctgcgacctcgtggcct ctgagtgggagctcggactcaggagtgctgttgccagcgcctgcccagacgccctccgta gttttgcaacttgtggaagcactctggagagaggccgagaggattcctcgacaggaaatt tgggaattctctgggtga >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_7|307_aa MEADKDDTQQILKEHSPDEFIKDEQNKGLIDEITKKNIQLKKEIQKLETELQEATKEFQI KEDIPETKMKFLSVETPENDSQLSNISCSFQVSSKVPYEIQKGQALITFEKEEVAQNVVS MSKHHVQIKDVNLEVTAKPVPLNSGVRFQVYVEVSKMKINVTEIPDTLREDQMRDKLELS FSKSRNGGGEVDRVDYDRQSGSAVITFVEIGVADKILKKKEYPLYINQTCHRVTVSPYTE IHLKKYQIFSGTSKRTVLLTGMEGIQMDEEIVEDLINIHFQRAKNGGGEVDVVKCSLGQP HIAYFEE >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_7|924_bp atggaagctgataaagatgacacacaacaaattcttaaggagcattcgccagatgaattt ataaaagatgaacaaaataagggactaattgatgaaattacaaagaaaaatattcaacta aagaaggagatccaaaagcttgaaacggagttacaagaggctaccaaagaattccagatt aaagaggatattcctgaaacaaagatgaaattcttatcagttgaaactcctgagaatgac agccagttgtcaaatatctcctgttcgtttcaagtgagctcgaaagttccttatgagata caaaaaggacaagcacttatcacctttgaaaaagaagaagttgctcaaaatgtggtaagc atgagtaaacatcatgtacagataaaagatgtaaatctggaggttacggccaagccagtt ccattaaattcaggagtcagattccaggtttatgtagaagtttctaaaatgaaaatcaat gttactgaaattcctgacacattgcgtgaagatcaaatgagagacaaactagagctgagc ttttcaaagtcccgaaatggaggcggagaggtggaccgcgtggactatgacagacagtcc gggagtgcagtcatcacgtttgtggagattggagtggctgacaagattttgaaaaagaaa gaataccctctttatataaatcaaacctgccatagagttactgtttctccatacacagaa atacacttgaaaaagtatcagatattttcaggaacatctaagaggacagtgcttctgaca ggaatggaaggcattcaaatggatgaagaaattgtggaggatttaattaacattcacttt caacgggcaaagaatggaggtggagaagtagatgtggtcaagtgttctctaggtcaacct cacatagcatactttgaagaatag >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_8|226_aa MPEPPPRLRGFLWGPSLPDERRPLLHAPSPMDHPRAEECGRTAPDWQAAPPAAPVQDPAG LLSLKVCSFTPEASETTNPPGGTNNSRCAALRAVTLTAKVCSFTPEPVRPRTHQKEETPN TSEHQKEQTQDMPPLRAVTLTARVRSFVLEVTNLQHGLKTKPPEGGSWGKEVRKEEDEKR GEEGRRRKKKKKEEEEEEEEEEEKGGGGGEGGGEGEGEGEGEAEDS >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_8|681_bp atgcctgagcctcccccgcgcctccgtgggttcctgtggggcccgagcctccccgacgag cgccgccccctgctccacgcacccagtcccatggaccacccaagggctgaggagtgcggg cgcacggcaccggactggcaggcagctccacctgcggcccccgtgcaggatccagctggg ctcctgagtctgaaggtctgcagcttcactcctgaagccagtgagaccacgaacccaccg ggaggaacgaacaactccagatgcgccgccttaagagctgtaacgctcaccgcaaaggtc tgcagcttcactcctgagccagtgagaccacgaacccaccagaaggaagaaactccaaac acatccgaacatcagaaggaacaaactcaggacatgccgcctttaagagctgtaacactc actgcgagggtccgcagcttcgttcttgaagtcactaatttacaacatggcctcaaaacc aagccaccggaaggaggtagttggggtaaggaggttagaaaagaggaagatgaaaagaga ggagaggaagggagaagaagaaagaagaagaaaaaagaagaagaggaagaagaggaggaa gaagaagaaaaaggaggaggaggaggggaagggggaggggaaggggaaggggaaggagaa ggggaagcagaagattcctga >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_9|430_aa MPNLAYAAPQYLPLPTFPLTAAPRQAKPEASGTIVDAELLVTLTVEGKSVPFLINMEATH STLPYFQGPVSLASKTVLYSLFVESPTITIVPGLDFNLAFHIILDTTPDPHDCISLIHLT FTPFLLISFFRVRYPDHTWLIDGSSIKPNHHSPAKAGYAIVSSTSIIEATALPPSTTSQQ VKLIALIRVLTLVKGLLINIYADPISCTTMLLYELKVFLTTQGSSIINASLIKTLLKAAL LPKTARIIHCKGHQKASDPITQDNAYAHKESLGRAARGTGKRPQGKGNLQLNCVTIPTEC KISWPELRGGSDSGVQTPKAGRHESPACFCSWETGSLQQVLSPARPLLETDLVLLGVGGK TIQMRRNQKNNSGNMTKQGSLTAPKNDTSSPAMNPNQEEIYDLPKKEFRKSVIKLIKEAP EKDEVQVKGI >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_9|1293_bp atgcccaacctggcttacgcagccccacagtacctgccactgcctacctttcccctcaca gctgctcctcgccaggccaagccagaagcctccgggaccatcgtggatgccgagcttctg gtaactcttacagtggagggtaagtccgtccccttcttaatcaacatggaggctactcac tccacattaccttattttcaagggcctgtttccctggcctccaaaactgttctatactca ctctttgttgagtctcccacaattaccattgttcctggcctggacttcaatctggccttc cacattattctggataccacacctgacccccatgactgtatctctctgatccacctgaca ttcactccatttctccttatttccttctttcgtgttcgttaccctgatcacacttggctt attgatggcagttccatcaagcctaatcaccactcaccagcaaaggcaggctatgctata gtatcttccacatctatcattgaggctactgctctgcccccctccactacctctcagcaa gtcaaactcattgccttaattcgggtcctcactcttgtaaagggactactcatcaatatt tatgctgaccccatatcctgcaccaccatgctgctttatgagctgaaagttttcctcact acacaagggtcctccatcattaatgcctctttaataaaaactcttctcaaggctgcttta cttccaaagacagctagaattattcactgcaaaggccatcaaaaggcatcagatcccatc actcaggacaatgcttatgctcataaggagtccttggggagggctgccagaggaactggg aaaagaccacagggaaaaggaaacctccagctgaactgtgtaacaattccaacagaatgc aaaatctcctggccagaactccggggagggagtgattctggtgtgcagactccaaaagca ggcagacacgaaagccctgcttgcttttgcagctgggagactggtagcctgcagcaagtt ctcagccctgctcgcccactgctggaaacagacttggtgctgttgggagtcgggggaaag actatccaaatgagaaggaaccagaaaaacaattctggtaatatgacaaagcaaggctct ttaacagccccaaaaaatgacactagctcaccagcaatgaatccaaaccaagaagaaatc tatgatttgcctaaaaaagaattcagaaaatcagtcattaaactaatcaaggaggcacca gagaaagatgaagtccaagttaagggaatttaa >gi568815596r:151170696_151382948|GENSCAN_predicted_peptide_10|417_aa MTTELSSYGYLGSENSALFNRVCTSYCEEGVESAALLGCDNSSSTGNTSFSSLLRLESSH LTAQNGKGQGNRVCDCAQEKENLCGNLPTISATILSLKDYPLHLFHMKTPFPLSFIECPS KSELTSLGIILYFLDDMEDEIFRHYAELRPQNFPCSVRRNNSAFMTSSDFAERAAGVYHR EARSGKYKLTYAEAKAVCEFEGGHLATYKQLEAARKIGFHVCAAGWMAKGRVGYPIVKPG PNCGFGKTGIIDYGIRLNRSERWDAYCYNPHAKECGGVFTDPKQIFKSPGFPNEYEDNQI CYWHIRLKYGQRIHLSFLDFDLEDDPGCLADYVEIYDSYDDVHGFVGRYCGDELPDDIIS TGNVMTLKFLSDASVTAGGFQIKYVAMDPVSKSSQGKNTSTTSTGNKNFLAGRFSHL >gi568815596r:151170696_151382948|GENSCAN_predicted_CDS_10|1254_bp atgacaactgaattaagttcatatgggtatttgggatccgagaattctgctttgttcaat agagtctgcaccagttactgtgaagaaggagtagagtctgctgccctcttgggatgtgac aatagctcatctactggaaataccagtttctcttcccttctgaggctagaatctagccac ttgactgcacaaaatggcaaagggcagggaaatagagtctgtgactgtgcccaggaaaaa gagaacctttgtggtaatcttccaacaatctctgctaccattttgtctttgaaggattac cctctgcatcttttccatatgaaaaccccatttcctctttcctttattgaatgcccctca aaatcagaacttacatcattaggcattattctctatttccttgatgatatggaggatgag attttcagacactatgcagagctgaggccacagaatttcccctgttccgtaagaaggaac aacagtgcttttatgacatcatctgattttgcagaacgagcagccggtgtgtaccacaga gaagcacggtctggcaaatacaagctcacctacgcagaagctaaggcggtgtgtgaattt gaaggcggccatctcgcaacttacaagcagctagaggcagccagaaaaattggatttcat gtctgtgctgctggatggatggctaagggcagagttggataccccattgtgaagccaggg cccaactgtggatttggaaaaactggcattattgattatggaatccgtctcaataggagt gaaagatgggatgcctattgctacaacccacacgcaaaggagtgtggtggcgtctttaca gatccaaagcaaatttttaaatctccaggcttcccaaatgagtacgaagataaccaaatc tgctactggcacattagactcaagtatggtcagcgtattcacctgagttttttagatttt gaccttgaagatgacccaggttgcttggctgattatgttgaaatatatgacagttacgat gatgtccatggctttgtgggaagatactgtggagatgagcttccagatgacatcatcagt acaggaaatgtcatgaccttgaagtttctaagtgatgcttcagtgacagctggaggtttc caaatcaaatatgttgcaatggatcctgtatccaaatccagtcaaggaaaaaatacaagt actacttctactggaaataaaaactttttagctggaagatttagccacttataa