GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:15:24 Sequence gi568815581f:76769030_77048835 : 279806 bp : 50.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 717 842 126 1 0 74 63 114 0.986 8.15 1.02 Intr + 5968 6142 175 2 1 79 80 171 0.884 14.40 1.03 Intr + 7377 7512 136 0 1 80 91 77 0.961 7.77 1.04 Term + 9159 9323 165 2 0 76 39 119 0.996 3.72 1.05 PlyA + 9770 9775 6 1.05 2.03 PlyA - 10452 10447 6 1.05 2.02 Term - 12322 12248 75 1 0 112 45 70 0.931 2.94 2.01 Init - 37477 37340 138 1 0 88 57 121 0.024 7.14 2.00 Prom - 49327 49288 40 -5.86 3.02 PlyA - 51567 51562 6 1.05 3.01 Sngl - 57701 57216 486 2 0 43 37 185 0.275 4.97 3.00 Prom - 59620 59581 40 -5.16 4.00 Prom + 63057 63096 40 -2.86 4.01 Init + 70367 70454 88 1 1 114 49 38 0.656 3.33 4.02 Intr + 81394 81644 251 2 2 43 110 79 0.130 2.76 4.03 Term + 87991 88173 183 0 0 60 38 136 0.506 3.34 4.04 PlyA + 88835 88840 6 1.05 5.00 Prom + 89884 89923 40 -2.86 5.01 Init + 93760 93845 86 0 2 74 97 29 0.035 2.75 5.02 Intr + 99846 99953 108 0 0 57 86 65 0.033 2.60 5.03 Intr + 103822 103934 113 1 2 165 77 113 0.196 17.92 5.04 Intr + 113122 113269 148 2 1 101 94 253 0.585 26.49 5.05 Intr + 133526 133641 116 2 2 90 59 237 0.812 21.09 5.06 Intr + 134274 134347 74 1 2 51 78 82 0.937 2.63 5.07 Intr + 135223 135393 171 0 0 65 80 211 0.773 18.14 5.08 Intr + 136140 136304 165 2 0 2 96 228 0.781 15.16 5.09 Intr + 136989 137158 170 2 2 121 105 234 0.861 27.14 5.10 Term + 139267 139270 4 1 1 130 49 0 0.424 -3.32 5.11 PlyA + 141924 141929 6 1.05 6.00 Prom + 142690 142729 40 -6.16 6.01 Init + 144686 144755 70 1 1 85 71 91 0.861 8.31 6.02 Intr + 154031 154081 51 0 0 111 66 55 0.026 4.48 6.03 Intr + 155894 156068 175 0 1 16 97 196 0.062 12.30 6.04 Intr + 157568 157701 134 2 2 34 69 370 0.994 29.99 6.05 Intr + 161456 161557 102 0 0 63 38 89 0.445 1.55 6.06 Intr + 163616 163746 131 0 2 55 105 390 0.998 37.81 6.07 Intr + 168959 169114 156 1 0 116 61 336 0.998 33.91 6.08 Intr + 171373 171519 147 0 0 110 105 257 0.999 30.03 6.09 Intr + 171703 171819 117 0 0 54 94 245 0.730 22.36 6.10 Intr + 177347 177421 75 1 0 129 82 162 0.997 19.41 6.11 Intr + 178801 179057 257 0 2 144 89 182 0.992 20.14 6.12 Intr + 179611 179782 172 1 1 63 37 318 0.315 24.15 6.13 Intr + 181650 181674 25 2 1 82 55 31 0.030 -3.20 6.14 Term + 188596 188801 206 2 2 59 55 134 0.295 4.73 6.15 PlyA + 189085 189090 6 1.05 7.13 PlyA - 190672 190667 6 1.05 7.12 Term - 193189 192845 345 1 0 36 47 177 0.302 2.89 7.11 Intr - 197810 197659 152 0 2 79 64 39 0.196 0.48 7.10 Intr - 200266 200149 118 1 1 81 60 51 0.290 1.64 7.09 Intr - 206122 205974 149 2 2 52 59 76 0.095 1.05 7.08 Intr - 207428 207388 41 1 2 83 87 13 0.036 -1.43 7.07 Intr - 213874 213809 66 0 0 95 75 59 0.196 3.32 7.06 Intr - 214187 214073 115 0 1 40 71 70 0.437 0.01 7.05 Intr - 214739 214689 51 0 0 122 91 16 0.761 4.18 7.04 Intr - 218044 217993 52 1 1 73 116 32 0.873 2.98 7.03 Intr - 219418 219270 149 2 2 63 86 87 0.794 5.95 7.02 Intr - 226700 226527 174 0 0 103 60 54 0.040 4.01 7.01 Init - 248177 248009 169 2 1 81 50 70 0.266 2.20 7.00 Prom - 256743 256704 40 -1.96 8.03 PlyA - 260021 260016 6 1.05 8.02 Term - 268981 268881 101 2 2 78 49 77 0.125 1.09 8.01 Init - 270929 270824 106 2 1 85 54 89 0.187 5.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 44088 44006 83 1 2 110 45 58 0.806 1.56 S.002 Init - 49134 49099 36 0 0 76 81 49 0.866 2.06 S.003 Init + 155980 156068 89 0 2 94 97 215 0.831 23.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:76769030_77048835|GENSCAN_predicted_peptide_1|200_aa XLELTFFSGVYGTCIGATNKFGAEEKSLIGLSGIFIGIGEILGGSLFGLLSKNNRFGRNP VVLLGILVHFIAFYLIFLNMPGDAPIAPVKGTDSSAYIKSSKEVAILCSFLLGLGDSCFN TQLLSILGFLYSEDSAPAFAIFKFVQSICAAVAFFYSNYLLLHWQLLVMVIFGFFGTISF FTVEWEAAAFVARGSDYRSI >gi568815581f:76769030_77048835|GENSCAN_predicted_CDS_1|603_bp ngtctggaattaactttcttctctggtgtatatggaacctgtattggtgctacaaataaa tttggagcagaagagaaaagccttattggactttctggcattttcatcggcattggagaa attttaggtggaagcctcttcggcctgctgagcaagaacaatcgttttggtagaaatcca gttgtgctgttgggcatcctggtgcacttcatagctttttatctaatatttctcaacatg cctggagatgccccgattgctcctgttaaaggaactgacagcagtgcttacatcaaatcc agcaaagaagttgccattctctgcagttttctgttgggccttggagacagctgctttaat acccagctgcttagtatcttgggctttctgtattctgaagacagcgccccagcatttgcc atcttcaagtttgttcagtctatttgcgcagccgtggcatttttctacagcaactacctt ctccttcactggcaactcctggtcatggtgatatttgggttttttggaacaatttctttc ttcactgtggaatgggaagctgccgcctttgtagcccgcggctctgactaccgaagtatc tga >gi568815581f:76769030_77048835|GENSCAN_predicted_peptide_2|70_aa MRLHSSALGWSMGLGALEQVAALIGEARAGQEPTAGAGRLRHSRLQLSLLVTFDGDSCHV VGCPEDYSVA >gi568815581f:76769030_77048835|GENSCAN_predicted_CDS_2|213_bp atgcgcctgcactcctcagcccttgggtggtcgatgggactgggtgccctggagcaggtg gcggcgctcattggggaggctcgggcagggcaggagcccacggcgggggcggggaggctc agacatagcaggctgcagctgtccttgcttgtgacctttgatggagacagctgccatgtc gtaggctgccctgaggactactccgtagcatga >gi568815581f:76769030_77048835|GENSCAN_predicted_peptide_3|161_aa MQFQQQKCTITPARKQQVQASYYSAFDSPALTWLLTAFTRRNLNLHRNDETGAEGDIPEP GRLQASFPPQPILSAEWGREVLLASCWTAAALACGSYTHSLTGTSAGRNRIGYLHLNAPL PLTGTCHCSVQSCPPRCLEGLCGTAEYTRTQRRGASLNTVT >gi568815581f:76769030_77048835|GENSCAN_predicted_CDS_3|486_bp atgcaatttcagcagcagaaatgcaccatcactccggccaggaagcagcaggtacaggca tcttattactcggcatttgatagcccggctctcacttggttgttaacagccttcacaaga agaaacctcaatctgcaccgaaatgatgagaccggtgcagagggagacatcccggagcct ggacgccttcaggcctcgtttcctccccagcccatcctcagcgcagagtgggggagagaa gtcctgcttgcctcatgttggacagcagccgcccttgcatgtggaagttacacacactca ctcacagggaccagcgcagggagaaacagaattggctacctccacctcaatgctcccctt ccgctcacgggcacctgccactgctctgtccagtcctgccctccacggtgccttgagggc ctgtgtggtacagccgagtacaccaggacccagcgaagaggtgcctccctcaacactgtc acatag >gi568815581f:76769030_77048835|GENSCAN_predicted_peptide_4|173_aa MPGPNRFFCHVGDVLSVLCNTVAMSHVQGGRIVRKPRRGLLEENDAEHPGPWTASSPEAD RSLDNSGQGAWRAAPTCRSGPYLCCLNSSFKARAAKDSQDILPPKAVELGRPRRTKQGPR LSHGILYTAMEAGAEMDQQRGPSATPPIYFPSVEDGKVMAVSVWQAPTWNWTP >gi568815581f:76769030_77048835|GENSCAN_predicted_CDS_4|522_bp atgcccggccccaacagatttttctgccatgttggagatgttctctctgtgctgtgcaac acagtagccatgagccacgtgcaaggtggccggattgtcaggaaaccaaggaggggcctt ttggaagagaatgatgctgagcatccgggtccctggacagccagcagccctgaggcagac agaagtttggacaactcaggacagggagcttggagggcagcccctacatgtcggagtggt ccctacctgtgttgcctcaactcttccttcaaggcaagagccgctaaggacagtcaggac atcctaccgcccaaggctgttgagctcggccggccaaggaggacaaagcaggggccacgg ctctcccacggcatcctctatacagccatggaggctggagcagagatggaccaacaaagg ggtccatctgccacaccccccatctacttcccttctgttgaagatgggaaggtcatggct gtctcagtgtggcaagctcccacctggaactggacaccataa >gi568815581f:76769030_77048835|GENSCAN_predicted_peptide_5|384_aa MVNGLFLFSLYTPQAPSMSSKNNLEAFRRPEQRGHRAARSQLRSDAASARRGFVARTRRE LGPGRLFVLGIGFFTLCFLMTSLGGQFSARRLGDSPFTIRTEVMGGPESRGVLRKMSDLL ELMVKRMDALARLENSSELHRAGGDLHFPADRMPPGAGLMERIQAIAQNVSDIAVKVDQI LRHSLLLHSKVSEGRRDQCEAPSDPKFPDCSGKVEWMRARWTSDPCYAFFGVDGTECSFL IYLSEVEWFCPPLPWRNQTAAQRAPKPLPKVQAVFRSNLSHLLDLMGSGKESLIFMKKRT KRLTAQWALAAQRLAQKLGATQRDQKQILVHIGFLTEESGDVFSPRVLKGGPLGEMVQWA DILTALYVLGHGLRVTVSLKELQR >gi568815581f:76769030_77048835|GENSCAN_predicted_CDS_5|1155_bp atggtaaatggcttgtttctcttctccctctacacaccccaggccccgtctatgagctct aaaaacaaccttgaagccttcagaaggcctgagcagcgaggccaccgggccgcgcgctcc cagcttcgctcggacgcggcttcggcccgcagagggttcgtggcccggacgcggcgagag ctgggcccaggacggctttttgtcctgggcatcggcttcttcactctctgcttcctgatg acgtctctgggaggccagttctcggcccggcgcctgggggactcgccattcaccatccgc acagaagtgatggggggccccgagtcccgcggcgtcctgcgcaagatgagcgacctgctg gagctgatggtgaagcgcatggacgcactggccaggctggagaacagcagtgagctgcac cgggccggcggcgacctgcactttcccgcagacaggatgccccctggggccggcctcatg gagcggatccaggctattgcccagaacgtctccgacatcgctgtgaaggtggaccagatc ctgcgccacagtctgctcctgcacagcaaggtgtcagaaggccggcgggaccagtgtgag gcacccagtgaccccaagttccctgactgctcagggaaggtggagtggatgcgtgcccgc tggacctctgacccctgctacgccttctttggggtggacggcaccgagtgctccttcctc atctacctcagtgaggtcgagtggttctgccccccgctgccctggaggaaccagacggct gcccagagggcacccaagcccctccccaaagtccaggcagttttccgaagcaacctgtcc caccttctggacctgatgggcagcgggaaggagtccctgatcttcatgaagaagcggacc aagaggctcacagcccagtgggcgctggctgcccagcgcctggcacagaagctgggggcc acccagagggaccagaagcagatcctggtccacatcggcttcctgacggaggagtccggg gacgtgttcagccctcgggtcctgaagggcgggcccctaggggagatggtgcagtgggcg gacattctgactgcactctatgtcctgggccatggcctgcgggtcacagtctccctgaag gagctgcagagatag >gi568815581f:76769030_77048835|GENSCAN_predicted_peptide_6|605_aa MGACEKMLLEGDVASNTLLIEKPTLSAGDQKHHRHGTTLDGGFLGAQNLKGSSLSNLGVP PGRGSCPLTMPLPFDLIYTDYHGLQQMKRHMGLSFKKYRCRIRVIDTFGTEPAYNHEEYA TLHGYRTNWGYWNLNPKQFMTMFPPSVREGRLNILGLNKRFIRDPSAQKAGLVCLQIPHT PDNSFMGFVSEELNETEKRLIKGGKASNMAVVYGKEASIWKGKEKFLGILNKYMEIHGTV YYESQRPPEVPAFVKNHGLLPQPEFQQLLRKAKLFIGFGFPYEGPAPLEAIANGCIFLQS RFSPPHSSLNHEFFRGKPTSREVFSQHPYAENFIGKPHVWTVDYNNSEEFEAAIKAIMRT QVDPYLPYEYTCEGMLERIHAYIQHQDFCRAPDPALPEAHAPQSPFVLAPNATHLEWARN TSLAPGAWPPAHALRAWLAVPGRACTDTCLDHGLICEPSFFPFLNSQDAFLKLQVPCDST ESEMNHLYPAFAQPGQECYLQKEPLLFSCAGSNTKYRRLCPCRDFRKGQLAVTVAAREED PGTHREEIPVKTEAEIRAMQLRQQTLRPAGDRRKLGGQRWTLLWRLQREYGPAGTLMLDF WPLEL >gi568815581f:76769030_77048835|GENSCAN_predicted_CDS_6|1818_bp atgggtgcctgtgagaagatgcttctagaaggagatgtggccagcaacacattgctcatt gaaaagccaacgctgtcggcaggtgaccagaaacatcatcgacacggcaccactttggac ggtgggtttcttggggcccagaatctgaagggctcttctctcagtaacttaggggtaccg ccaggccggggaagctgcccgctcaccatgcccctgcccttcgacctcatctacaccgac taccacggcctgcagcagatgaagcggcacatgggactctccttcaagaagtaccggtgc cgaatcagggtcatcgacaccttcgggacggaacctgcgtacaaccacgaggagtacgcc acgctgcacggctaccggaccaactggggctactggaacctcaaccccaagcagttcatg accatgtttcccccttccgtgcgggaaggtagattaaatatccttggactcaataaacgg tttattcgggatcccagtgcgcagaaggcagggctcgtgtgcctgcagattcctcatacc cccgacaactccttcatgggcttcgtgtccgaggagctcaacgagacggagaagcggctc atcaaaggcggcaaggccagcaacatggccgtggtgtacggcaaggaggcgagcatctgg aaggggaaggagaagttcctgggcatcctgaacaaatacatggagatccatggcaccgtg tactacgagagccagcggccccccgaggtgccagcctttgtgaagaaccacggcctctta ccgcagcctgagtttcagcagctgctgcgcaaggccaaactcttcatcgggtttggcttc ccctacgagggccccgcccccctggaggccatcgccaatggttgcatcttcctgcagtcc cgcttcagcccgccccacagctccctcaaccacgagttcttccgaggcaagcccacctcc agagaggtgttctcccagcatccctacgcggagaacttcatcggcaagccccacgtgtgg acagtcgactacaacaactcagaggagtttgaagcagccatcaaggccattatgagaact caggtagacccctacctaccctacgagtacacctgcgaggggatgctggagcggatccac gcctacatccagcaccaggacttctgcagagctccagaccctgccctaccagaggcccac gccccgcagagcccctttgtcctggcccccaatgccacccacctcgagtgggctcggaac accagcttggctcctggggcctggccccccgcgcacgccctgcgggcctggctggccgtg cctgggagggcctgcaccgacacctgcctggaccacgggctaatctgtgagccctccttc ttccccttcctgaacagccaggacgccttcctcaagctgcaggtgccctgtgacagcacc gagtcggagatgaaccacctgtacccggcgttcgcccagcctggccaggagtgctacctg cagaaggagcctctgctcttcagctgcgccggctccaacaccaagtaccgccggctctgc ccctgccgcgacttccgcaagggccagctggctgtgacagtggccgcacgagaagaggac ccggggacacacagggaagaaatccctgtgaagacagaggcagagatcagagccatgcag ctgcgccagcaaacactaaggcctgctggtgaccgccggaagctgggagggcaaagatgg acccttctctggcgccttcagagggaatacggccctgctggcaccctgatgctggacttc tggcctctagaactgtga >gi568815581f:76769030_77048835|GENSCAN_predicted_peptide_7|526_aa MTTEVMYCQESVGDCCAQLSECVLNGSVKRLLNQSDSILDRGWENEFEACWAAFPGAQQP DAKRCTFWGGTFLIFHKLNILKTKLRLCLQPSVATSYYLREGAGAMHLLMFASRAAPAGI KLLLYIRGSQLLLPLGYSDTVLQLPGHCRGPLLPIATSPTDCDKGLDEAVSTNSLALVKP QGSNPGSHVASSQTPVRIGRCAPEAVQAKVTRGGGHRLMQGCVGWQLGLTVQGGAVRELF IYPQVFSEQNLPGCLDENSLYEDNICLLKKSKALADPGAFMGLREEEVYGQPWVGSEKAP VPTSVSGTGSLAPSLQDLPGLKLIRTSGPKDANEGPRQVISLANEAAAPALFPLEYLINT AEGASQFTQEGETLRTLPVIFSGKPFLSSPPTSSQQPGSALPELQGLAQALQWQGDAGPR SDHKSHIDCQILDEQALKSCLCRLQSGSPGPGLGCQEHSGLRRGFQSVGQGWGKGGPGGA TGHTGQKHLPPGWHFTFGKLLHWIMKTFVFSKARKKRITITTSQGT >gi568815581f:76769030_77048835|GENSCAN_predicted_CDS_7|1581_bp atgaccacagaagtcatgtactgtcaagagagcgtgggggactgttgtgcgcagctttca gaatgtgtccttaacggaagtgtcaagagacttttgaaccagagtgactccatcttggat aggggctgggaaaatgagtttgaggcctgctgggctgcattccctggagctcaacagcct gatgccaaaaggtgcacattttggggcggcacatttctgatcttccacaagctgaatatt ctgaagaccaaattaagactgtgtttgcaacccagcgtggccacgagctattacctaaga gaaggagctggggccatgcatttgctaatgtttgcatccagggctgctccagcaggaatt aaattgctcctctacatccggggatcccagttgctattgcccctcgggtactcggacact gtcctccaactacctggtcactgcagaggccctctcctccccatcgctacctcccccacc gactgtgacaagggtttggacgaagcagtctcaacaaactctttggccctcgtcaagcct caaggatctaatccagggtcccacgttgcctctagtcagacaccagtcagaattgggagg tgcgctccagaggctgtgcaggccaaggtgacacgaggaggcggccacaggctgatgcag ggctgcgtgggctggcagttaggactgacggtacaaggaggggcagttagggagctgttc atctacccacaagtgttctctgagcagaacctcccaggatgcttggatgagaacagcctt tatgaagataacatatgcctgttgaaaaaatcaaaagctctggctgatcctggggctttt atgggcctcagagaggaggaagtgtatgggcagccatgggtgggctcggaaaaggcacca gttcccacttcagtcagtgggactggcagcctggcccctagccttcaggacctccctggc ctgaagctaatcagaacctctgggccgaaagatgcaaatgaagggcctcgtcaagtcatt tcccttgcaaatgaagctgccgctccggcgctttttcctcttgaatacctcatcaacacc gcagagggggcttctcagttcacccaggagggagagaccctcagaactctgccagtcatc ttctcagggaagcctttcctgtcctcacctcccacctcgtcccagcagcccggctctgcg ttgccagaactccagggcttagctcaggccctacagtggcaaggggacgcagggccccgc tcagaccacaagtcgcacatcgactgtcaaatcctcgatgagcaggctctgaagagttgc ctgtgccggctgcaatctggctcgcctgggcccgggctggggtgccaggagcactctgga ctgaggcggggcttccagagcgtggggcagggctgggggaaaggagggcctggaggagcc acaggccacaccggacaaaaacaccttccaccagggtggcacttcaccttcgggaagctc cttcactggataatgaagacttttgtcttcagcaaagcgaggaagaaaagaattaccatt accaccagccaaggcacctga >gi568815581f:76769030_77048835|GENSCAN_predicted_peptide_8|68_aa MANAGFGRSGHMSSQVRTPGLQALELQAPNPSGDFEVSASNKLVSHQSSIGAMCLATFYC VVWRIPTG >gi568815581f:76769030_77048835|GENSCAN_predicted_CDS_8|207_bp atggccaatgccggatttggcagatcaggccacatgagcagccaggttaggacaccgggg cttcaagctttggaactgcaggctccaaacccctctggagactttgaagtatctgctagt aataagctggtgtctcatcagtccagtattggtgccatgtgtctggccaccttctactgt gtagtgtggaggattcccacagggtga