GENSCAN 1.0 Date run: 4-Aug-121 Time: 20:32:16 Sequence gi568815578f:58552007_58776288 : 224282 bp : 48.37% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14713 14782 70 0 1 75 76 64 0.284 3.14 1.02 Intr + 21570 21924 355 1 1 84 84 60 0.099 -0.45 1.03 Term + 22203 22308 106 0 1 63 50 99 0.188 1.48 1.04 PlyA + 22790 22795 6 1.05 2.06 PlyA - 22941 22936 6 1.05 2.05 Term - 26458 26415 44 2 2 38 49 131 0.317 1.52 2.04 Intr - 31213 31138 76 1 1 77 52 59 0.129 0.29 2.03 Intr - 33842 33740 103 1 1 138 75 -11 0.241 2.78 2.02 Intr - 40454 40294 161 2 2 87 68 10 0.302 -2.31 2.01 Init - 40758 40702 57 0 0 68 92 59 0.686 5.61 2.00 Prom - 41369 41330 40 -2.96 3.03 PlyA - 42205 42200 6 1.05 3.02 Term - 44871 44774 98 1 2 92 54 116 0.974 6.73 3.01 Init - 46602 46497 106 0 1 74 70 66 0.587 2.64 3.00 Prom - 49516 49477 40 -0.76 4.00 Prom + 50653 50692 40 -5.56 4.01 Init + 53720 53814 95 1 2 67 52 58 0.510 -0.05 4.02 Intr + 53888 54010 123 2 0 98 65 95 0.876 7.90 4.03 Intr + 56038 56165 128 1 2 85 101 5 0.714 1.82 4.04 Intr + 56502 56784 283 1 1 84 86 74 0.359 3.28 4.05 Intr + 63633 63769 137 0 2 87 80 18 0.083 1.11 4.06 Term + 67365 67639 275 1 2 89 55 90 0.062 1.43 4.07 PlyA + 67699 67704 6 1.05 5.00 Prom + 73938 73977 40 -6.46 5.01 Init + 83071 83295 225 0 0 77 5 236 0.077 10.67 5.02 Intr + 97849 98447 599 0 2 33 34 304 0.200 10.84 5.03 Intr + 110403 110460 58 0 1 119 26 9 0.026 -3.31 5.04 Intr + 115484 115591 108 1 0 102 100 149 0.999 17.98 5.05 Intr + 115981 116121 141 0 0 10 94 155 0.867 8.85 5.06 Intr + 117285 117447 163 2 1 72 92 219 0.989 20.25 5.07 Intr + 118506 118597 92 1 2 45 87 48 0.538 0.11 5.08 Intr + 119148 119291 144 2 0 41 98 196 0.957 16.38 5.09 Intr + 121625 121705 81 1 0 66 85 35 0.331 0.83 5.10 Intr + 123417 123517 101 2 2 108 67 72 0.309 6.01 5.11 Intr + 134266 134356 91 1 1 25 86 96 0.067 3.10 5.12 Intr + 139718 139824 107 1 2 114 27 69 0.025 2.41 5.13 Intr + 140891 141044 154 2 1 -15 83 233 0.026 12.57 5.14 Intr + 141731 141916 186 1 0 87 89 247 0.666 24.59 5.15 Intr + 142416 142586 171 2 0 99 94 269 0.936 28.74 5.16 Intr + 146678 146767 90 1 0 97 68 185 0.542 17.59 5.17 Intr + 147191 147272 82 1 1 15 90 164 0.943 8.51 5.18 Intr + 149010 149152 143 1 2 96 100 220 0.979 24.07 5.19 Intr + 150917 151065 149 1 2 73 75 105 0.838 6.73 5.20 Term + 151209 151236 28 0 1 86 48 50 0.325 -1.65 5.21 PlyA + 153141 153146 6 1.05 6.00 Prom + 153928 153967 40 -8.96 6.01 Init + 155123 155194 72 1 0 86 109 150 0.961 15.98 6.02 Intr + 155340 155405 66 2 0 109 56 34 0.534 1.40 6.03 Intr + 160473 160573 101 2 2 95 62 107 0.999 7.71 6.04 Intr + 161414 161537 124 2 1 41 109 240 0.928 21.99 6.05 Intr + 161911 162087 177 0 0 72 78 263 0.999 23.82 6.06 Intr + 162554 162664 111 1 0 92 68 164 0.982 15.38 6.07 Intr + 163162 163316 155 0 2 92 30 217 0.465 15.17 6.08 Intr + 164150 164305 156 2 0 26 94 104 0.283 3.93 6.09 Intr + 165170 165256 87 2 0 12 94 81 0.153 0.19 6.10 Intr + 165747 165989 243 0 0 0 57 181 0.153 2.81 6.11 Term + 168111 168225 115 0 1 64 55 105 0.066 2.84 6.12 PlyA + 168469 168474 6 1.05 7.08 PlyA - 169679 169674 6 -3.44 7.07 Term - 170143 170026 118 0 1 70 49 181 0.005 10.41 7.06 Intr - 186345 186314 32 0 2 50 101 49 0.168 -0.67 7.05 Intr - 189330 189183 148 2 1 111 98 31 0.762 6.64 7.04 Intr - 191375 191232 144 1 0 72 115 26 0.732 3.10 7.03 Intr - 193943 193858 86 2 2 126 52 7 0.051 -0.48 7.02 Intr - 205984 205811 174 1 0 47 59 163 0.042 9.44 7.01 Init - 214246 214082 165 1 0 53 64 159 0.482 9.87 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 140895 141044 150 2 0 90 83 243 0.958 22.14 S.002 Term + 171943 172061 119 2 2 21 32 215 0.917 7.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:58552007_58776288|GENSCAN_predicted_peptide_1|176_aa MGPFSAWLIWSNLAGSWLALLLTGKVYWLLSWINSENETHRILSRPGTATWPIHVEPAVT TSAVCTKLEGGWGCSKYKFPHQQTGKRKTDLEAGYGDKINMSLYMCIKPQHKQQSPHGSG RHCLEPYVTVFSHYLPTGLGDSPQALLPGTGVGHSFQFASETIFMTVSCYLKKQQQ >gi568815578f:58552007_58776288|GENSCAN_predicted_CDS_1|531_bp atggggccttttagcgcctggctcatttggtcaaacctggctgggtcctggctcgccttg ctgttgacgggaaaagtgtactggctcctgtcctggatcaactcagaaaatgaaacacat cggattctgtccaggccgggcacagcaacctggcccatccatgtggagcctgcagtgacc acttccgctgtctgcacaaaactggagggaggctgggggtgctccaagtataagtttcct catcagcaaaccggaaagagaaagaccgacctggaggctggttatggggataaaataaat atgtccttatatatgtgtataaagccccagcacaaacagcagtcgccgcatgggtcaggg aggcactgcttggagccttatgtgaccgtgttctcacattatcttccaactggtttggga gacagcccacaggccctgcttcctggaactggtgtgggtcactcattccagttcgcctct gaaactattttcatgacagtaagctgctacctcaagaaacagcaacagtag >gi568815578f:58552007_58776288|GENSCAN_predicted_peptide_2|146_aa MTFAVGKLNAVVDAKVEKQPPCLCHSGSAQGGRWCCGQFLLLLAAGWCHETSRQGSNQDG GSTMAPEMPAQPRPDSPAEEKQNNLLEANESIHHFLKLQSSIAGLGQQMDATPRPCWPQP FLLIVGIIFQILDTGDDYNDDGFYSY >gi568815578f:58552007_58776288|GENSCAN_predicted_CDS_2|441_bp atgacgtttgcagtgggcaagctaaatgctgtggtggatgcaaaggtggaaaagcagccg ccatgcctttgtcactctggaagtgcacagggtggaaggtggtgctgcgggcagtttctg ctgctgttagctgctggctggtgccatgaaaccagcagacagggctcgaatcaggacggt ggctccaccatggccccagagatgccagctcagcccaggcctgactcccccgcagaggag aagcaaaacaatctcttagaagcaaatgaatcaattcaccatttcttgaagctgcagagt tctatagctggcttggggcagcaaatggacgccaccccccggccctgctggccccagcct ttcctcctcattgttggtatcatcttccagatcctggacacgggtgatgactacaacgat gacggcttctactcctactag >gi568815578f:58552007_58776288|GENSCAN_predicted_peptide_3|67_aa MDRLSGAVGLKASGPRWLVARDSSFLPHGPLWLHQKMYIEDSEAVRKPAAGEMEKGSSSE KAVCIPG >gi568815578f:58552007_58776288|GENSCAN_predicted_CDS_3|204_bp atggatcgcctctcaggagctgttggactgaaggcctcagggcctcgctggctggtggcc agagacagcagcttcttgccacatgggcctctctggcttcatcagaaaatgtatattgaa gactcagaggcggtgaggaagccagcagctggcgagatggagaagggctcctcgtcagag aaagctgtctgcataccaggctga >gi568815578f:58552007_58776288|GENSCAN_predicted_peptide_4|346_aa MMSRKDSDNSSPCCGTWSIPVSGAVANTAFANTGSSGYTAITALLPTKAACLPGWAEHVG QTQGTLQTWDETWCHARKVERRQRGKSWPGKPLWGGDIGTEQKHIQKTIISGDGSGPASW AETAAWHFLPQLKETRQPTPAIRVACVGLGPGVTVGMGSAGPSLLCGRPGAAECPIVHQD VEAQARDASYFGCWAKAWIYLPQAFFSEGRVQTVPRPNRPGLGCFEGSTGKSSQDSCPHA LPLLNVFPLTKSSEAGRKHVRSAVGQSLSASQQGPSCSSRLLQQPHPLAQAGLAEHCRGQ VQPPCSKAQLPDQTRSKAIRSSKDYFHSIFIDFYHSLGLYRALSLP >gi568815578f:58552007_58776288|GENSCAN_predicted_CDS_4|1041_bp atgatgtcacggaaagacagtgacaacagcagcccttgctgcggcacatggagcatccct gtgtctggcgctgtcgcaaacactgcatttgctaacacagggtcctcaggctacacggca attacagcactcctgccgacaaaagctgcctgcttaccaggatgggctgagcacgtgggc cagacccaaggcaccctgcagacctgggacgaaacctggtgccacgccaggaaagtagag cgaaggcagagagggaagagctggccagggaagcctctctggggaggtgacattggaaca gagcagaaacatatccagaagacgatcatttcaggggatgggagcggccctgcctcgtgg gctgagactgctgcctggcacttcctgccccagctgaaggaaaccaggcagcccacacct gccattcgggtcgcatgtgtgggcctggggcccggggtcactgtcgggatgggctcagca ggtcctagcttgctgtgtggaaggcctggggccgcagaatgccccattgttcatcaggac gtggaggcccaggccagagatgccagctattttgggtgctgggcgaaggcctggatttac ttgccacaagcgtttttctcagaaggaagggtgcaaacggttcctcggcccaacaggcca gggctgggctgctttgagggaagtacaggaaagagtagccaggacagctgtccacacgct cttcctctccttaatgtgtttcctctgaccaagagcagcgaggcaggccggaagcacgtg cggtcagcggtgggccagagcctctcagcctctcagcaagggccttcctgtagctctcgg ctcctccaacagccccatcccttagcacaggcggggctggcggagcactgcaggggccag gtgcagccaccgtgcagcaaggcacagctgcctgatcaaaccagatcaaaagccattcga tcatccaaggactattttcattcgatatttattgatttctaccactcgctggggctgtac agggccctgagcttgccgtga >gi568815578f:58552007_58776288|GENSCAN_predicted_peptide_5|970_aa MTLLAQEWGAASHLLLWGRVSPGSGPRWRICSMSCDVLQPRVTESLLWATKSTQPLAMGH PAGPLDTETCGSQPVLHNERAGTFSVREKARVGGGYRCFPRSPGGPARAPRVPLSGWGPW VSRGHRLRVSPGATPLCKRSAHGKPSENTHVCKVCSTQKQRTRGGRFWGLGPLLEPRTAA RPAASPRKGKPALHRPGARKAPPHSSAPPTRSGRALGQALGLKPPGRLRAGPGSPSRDAS LFVGAVLGPGRPLPDAAPRSPCTNPADLARGGKSRVCMLVEKKGKSVKLNNSIRLADDRM ALVSGISLDPEAAIGVTKRPPPKWVDGVDEIQYDVGRIKQKMKELASLHDKHLNRPTLDD SSEEEHAIEITTQEITQLFHRCQRAVQALPSRARACSEQEGRLLGNVVASLAQALQELST SFRHAQSGYLKRMKNREERSQHFFDTSVPLMDDGDDNTLYHRGFTEDQLVLVEQNTLMVE EREREIRQIVQSISDLNEIFRDLGAMIVEQGTVLDRIDYNVEQSCIKTEDGLKQLHKCSA SFASQVGPFGISLAGMMEFPLLPSFQDAVLREALRFRFALDSTNHAASPGEGEVFAEEHQ VAGYEEETQNNRSGLWNYMEVTEEGTNQRPLSNQDVGKMANVGLQFQASAGDSDPQSRPL LLLGQLHHLHRVPWSHVRGKLQPRVTEELWQAALSTLNPNPTDSCPLYLNYATVAALPCR VSRHNSPSAAHFITRLVRTCLPPGAHRCIVMVCEQPEVFASACALARAFPLFTHRSGASR RLEKKTVTVEFFLVGQDNGPVEVSTLQCLANATDGVRLAARIVDTPCNEMNTDTFLEEIN KVGKELGIIPTIIRDEELKTRGFGGIYGVGKAALHPPALAVLSHTPDGATQTIAWVGKGI VYDTGGLSIKGKMSAEQKHFGHSNSKCSHGMREVKERRFRVFVLTIPHSRVLFWVLCFDT GRISNNDMRI >gi568815578f:58552007_58776288|GENSCAN_predicted_CDS_5|2913_bp atgacgctcctggcccaggagtggggggctgcctcacacctgctgctgtggggcagagtc tctcctgggtcggggcctcgctggcggatctgctccatgtcgtgtgatgtgctgcagccc cgcgtcaccgagagcctcttatgggccaccaagtccacacagccactggccatggggcac cctgcagggcctctggacacggagacctgtggttctcagccagtgttgcacaacgaacgc gcgggaacattctccgtgagagaaaaagcacgggtggggggcggttaccggtgcttcccc cggtcgcctgggggtcccgcacgcgccccgcgggtgccgctgtctggctggggtccctgg gtgagccgcgggcaccggcttcgcgtctccccgggcgcgactccgctttgcaagcgctca gcacacgggaaaccctcggaaaacacacacgtgtgtaaagtttgttccacgcagaaacaa aggacgcgtgggggccgcttctggggcctcggtcctttgttggaaccccgcaccgcagcc cggcccgcagcctcgccccgcaaggggaagccggccctgcacaggcccggggcccggaag gcgccgccgcacagctctgcgcccccgacccgctccggccgcgcgctgggccaggccctt ggcctcaagcctcccgggcggctccgggccggacccgggtctccgtcgcgggacgccagc ctgtttgtgggtgccgtgctcgggcccgggcggcccctgcccgacgcggccccacggagc ccctgcacgaaccccgccgacctggcccggggcggcaagtcgagagtatgtatgttagtt gagaaaaaaggaaaatcagtgaaacttaacaacagtatcaggcttgctgatgaccgtatg gcactggtgtcaggcatcagcttagatccagaagcagcgattggtgtgacaaaacggcca cctcctaagtgggtggatggagtggatgaaattcagtatgatgttggccggattaagcag aagatgaaagaattggccagccttcatgacaagcatttaaacagacccaccctggatgac agcagcgaagaggaacatgccattgagataactacccaagagatcactcagctcttccac aggtgccagcgtgccgtgcaggccctgccgagccgggcccgggcctgctccgagcaggag gggcggctgcttgggaacgtggtggcctcgctggcgcaggccctgcaggaactctccacc agcttccggcacgcacagtcaggctacctcaaacgcatgaagaatcgagaggaaagatcc cagcattttttcgacacatcagtaccactaatggatgatggagacgataacactctttac catcggggttttacagaggaccagttagttctggtggagcagaacacactgatggtggaa gagcgggaacgagagattcgccagattgtacagtccatttctgacctgaatgaaatattc agggacttaggggcgatgattgtagaacagggtacagtccttgacagaattgactataac gttgaacagtcctgtatcaaaactgaagatggtttgaaacagcttcacaagtgctctgcg tctttcgccagccaggtggggccctttggcatttctctagcaggcatgatggagttcccg ctcctgccgagcttccaggatgcggtgttaagagaggcgctgcgttttcgttttgcactg gactccacaaatcatgctgccagccctggggagggggaggtctttgctgaagaacaccag gtagctggttatgaagaagaaactcaaaataacaggagtggcttatggaactacatggag gtaacagaggagggtaccaaccaaaggcccttgagcaatcaggatgttgggaagatggcg aacgtggggctgcagttccaggcgagcgcgggggactcggacccacagagccggcccctg ctgctgctcgggcagctgcaccacctgcaccgcgtgccctggagccacgtccgcgggaag ctgcagccccgggtcaccgaggagctctggcaggctgccctgagcacgctcaaccccaac cccacggacagctgtcccctctacctgaactacgccaccgtggctgccctgccctgcagg gtgagccggcacaacagcccctcggccgcccacttcatcacgcggctggtgcggacctgc ctgccgcccggagcgcatcgctgcattgtgatggtctgcgagcagccggaggtctttgct tccgcctgtgccctggcccgggccttcccgctgttcacccaccgctcaggtgcctctcgg cgcttggagaagaagacggtcaccgtggagtttttcctggtgggacaagacaacgggccg gtggaggtgtccacattgcagtgcttagcgaatgccacagacggcgtgcggctagcagcc cgcatcgtggacacaccctgcaatgagatgaacaccgacaccttcctcgaggagattaac aaagttggaaaggagctggggatcatcccaaccatcatccgggatgaggaactgaagacg agaggatttggaggaatctatggggttggcaaagccgccctgcatcccccagccctggcc gtcctcagccacaccccagatggagccacgcagaccatcgcctgggtgggcaaaggcatc gtctatgacactggaggcctcagcatcaaagggaagatgtctgcagaacaaaagcatttt ggacacagtaactccaagtgttcacatggaatgagggaagtgaaggaacgcaggttccgc gtcttcgttttgaccatcccgcactctcgtgttttattttgggtcctctgtttcgacacc gggaggataagtaacaacgatatgcgaatttga >gi568815578f:58552007_58776288|GENSCAN_predicted_peptide_6|468_aa MPGMKRDCGGAAAVLGAFRAAIKQALLGPPELPAFIRAALGGPEGKGFKDNLHAVFCLAE NSVGPNATRPDDIHLLYSGKTVEINNTDAEGRLVLADGVSYACKDLGADIILDMATLTGA QGIATGKYHAAVLTNSAEWEAACVKAGRKCGDLVHPLVYCPELHFSEFTSAVADMKNSVA DRDNSPSSCAGLFIASHIGFDWPGVWVHLDIAAPVHAGERATGFGVALLLALFGRASEDP LLNLVSPLGCEVDVEEGDLGRDSKRRRLVVLEPALCEVDCASVPLGTVLGDKGQGPEANS DLPGAPPGFREPLNLGSRKSRALGGNGCFPRPHQSPGNRVMGRLAVLGGRLKRALPASAR KPGLWGCVIVDDHFQVVATAPVLVTAGPPILLRHPPLSHRAVFRASAGLSCCSQTLALRK AGQSTSCSLRGCVFPACKWNGFRKEEPARPCRKLEEGCAPGTHRGGGD >gi568815578f:58552007_58776288|GENSCAN_predicted_CDS_6|1407_bp atgccggggatgaagcgagactgcgggggtgctgcggccgtcctgggggccttcagagcc gcaatcaagcaggctcttctggggcccccagagcttcccgccttcatcagagctgctctg ggaggccccgagggcaagggtttcaaagacaacctccacgctgtgttctgcttggctgag aactcggtggggcccaatgcgacaaggccagatgacatccacctgctgtactcagggaag acggtggaaatcaacaacacggatgccgagggcaggctggtgctggcagatggcgtgtcc tatgcttgcaaggacctgggggccgacatcatcctggacatggccaccctgaccggggct cagggcattgccacagggaagtaccacgccgcggtgctcaccaacagcgctgagtgggag gccgcctgtgtgaaggcgggcaggaagtgtggggacctggtgcacccgctggtctactgc cccgagctgcacttcagcgagttcacctcagctgtggcggacatgaagaactcagtggcg gaccgagacaacagccccagctcctgtgctggcctcttcatcgcctcacacatcggcttc gactggcccggagtctgggtccacctggacattgctgcaccggtgcatgctggtgagcga gccacaggcttcggtgtggccctcctgctggcgctcttcggccgtgcctctgaggaccct ctgctgaacctggtgtccccactgggctgtgaggtggatgtcgaggagggggacctgggg agggactccaagagacgcaggcttgtggtcctggagcctgcgctgtgtgaggtcgactgt gcttccgtgcccctgggcactgtcctaggtgataaagggcagggccccgaagccaattcg gatctcccgggagccccgccaggattcagggaacccctgaacctgggctccaggaaaagc agggctctaggaggcaacggctgcttccccaggccccaccagagccctggcaacagggtc atggggcggctggcggtgctggggggcaggctgaagagagctttaccggcatccgccagg aagcctgggctctggggctgtgtcattgtagatgaccatttccaggtcgtggccacagcc ccggttctggtcacagctgggccaccaatcctcctgcgccacccaccacttagccatcgg gccgtcttcagggcatcggccggcctcagctgctgcagccagaccctggcgttgcggaag gctggccagtccacatcctgcagcctacgagggtgtgtcttcccagcctgcaagtggaac ggcttccggaaggaggagccggcacggccctgcaggaagctggaggagggctgtgcccct ggcacccacaggggcgggggtgactga >gi568815578f:58552007_58776288|GENSCAN_predicted_peptide_7|288_aa MTWINTSGAGYLVAFGYLMLHGGHLLLKPVKFILQPLDCLIACSVLVVAVKTLLSLQQRI LSLESAKIALSESLISYRERAEIVEKQTQALTMRVADPQRKVDAQPRQVSTVKRLQHLLQ SPYPPSGLLAECEFGEAVWPWWTFQVYHPICQFFHNFTHLQASWRLSPHVPGGRVQLCPH PLWVLTRPSRFCLEPFLLEPRAVIDWVWMDTALSLSNWIHLGNLCAKVFIMKHWWESEKL DFQCIGEEEWNESLVMNVEVASEHVVYYVDKETRDPLACTLTHTRESL >gi568815578f:58552007_58776288|GENSCAN_predicted_CDS_7|867_bp atgacatggatcaacacctcaggggctgggtacctggtggcctttggatacctcatgctg catggaggccacctcctcttgaagcccgtgaagttcattctccagccactggactgcctg attgcctgctcggtgctggtggttgctgtgaagactctcctttcgcttcagcagcggatc ctgagccttgaatctgcgaagattgccctgagtgagagtcttatttcctatagagaaaga gctgaaattgtggaaaaacagacacaagctcttactatgcgagtggctgacccgcaaaga aaagtggatgcacagcctcgccaggtgtctactgttaaaagactccagcaccttctccag tccccatatcctccatctggtttgctggccgagtgtgagtttggggaggctgtgtggcca tggtggaccttccaggtctaccaccccatctgccaattcttccacaacttcacccacctg caggccagctggaggttaagtcctcacgttcctggtggacgtgttcaactttgccctcat cctctttgggtactgactaggccttcacggttctgccttgagcctttcctcttggagccg agagctgtcatagactgggtatggatggacactgctctcagcctctctaactggatccat ctggggaacctgtgtgcaaaggtcttcatcatgaagcactggtgggagtctgagaagcta gactttcagtgcattggtgaagaggaatggaatgagtctctggtcatgaatgtggaagta gccagcgagcacgtggtatactacgtggataaggaaacacgagacccactggcctgcacg ctcacgcacacccgagagagcctgtag