GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:05:56 Sequence gi568815577f:38710035_38922886 : 212852 bp : 43.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 12899 13073 175 2 1 51 38 160 0.349 4.53 1.02 PlyA + 14049 14054 6 1.05 2.07 PlyA - 16111 16106 6 -0.45 2.06 Term - 17396 17310 87 2 0 49 53 105 0.312 0.76 2.05 Intr - 28527 28276 252 0 0 16 100 89 0.145 0.53 2.04 Intr - 30380 30282 99 2 0 90 95 18 0.548 3.01 2.03 Intr - 31338 31287 52 2 1 76 89 49 0.574 2.71 2.02 Intr - 42461 42173 289 0 1 52 75 198 0.725 11.20 2.01 Init - 53352 53286 67 0 1 66 93 99 0.314 9.43 2.00 Prom - 56192 56153 40 -5.36 3.00 Prom + 57613 57652 40 -4.16 3.01 Init + 66797 66845 49 1 1 51 82 34 0.163 0.21 3.02 Intr + 76323 76392 70 1 1 102 24 99 0.024 3.14 3.03 Intr + 79654 79807 154 1 1 78 94 17 0.012 1.37 3.04 Intr + 85250 85350 101 1 2 83 60 49 0.114 0.51 3.05 Intr + 86840 86882 43 2 1 148 52 2 0.555 0.84 3.06 Intr + 88903 89019 117 0 0 29 80 79 0.511 1.86 3.07 Intr + 95871 96086 216 2 0 81 90 84 0.613 6.60 3.08 Intr + 100001 100072 72 1 0 111 113 62 0.978 10.60 3.09 Intr + 102969 103080 112 2 1 84 46 66 0.971 1.95 3.10 Intr + 104239 104358 120 2 0 64 89 87 0.806 6.87 3.11 Intr + 104747 104947 201 0 0 93 76 201 0.999 18.66 3.12 Intr + 106974 107057 84 1 0 98 68 14 0.532 0.19 3.13 Intr + 108391 108612 222 2 0 96 99 186 0.996 18.60 3.14 Intr + 109469 109732 264 0 0 50 89 294 0.992 23.18 3.15 Intr + 111552 111670 119 1 2 135 64 114 0.998 13.88 3.16 Term + 112640 112855 216 1 0 77 51 474 0.999 39.74 3.17 PlyA + 114844 114849 6 1.05 4.00 Prom + 118646 118685 40 -6.26 4.01 Init + 119040 119042 3 2 0 74 89 0 0.456 -1.30 4.02 Term + 119572 120684 1113 0 0 86 43 347 0.433 22.05 4.03 PlyA + 125559 125564 6 1.05 5.06 PlyA - 127313 127308 6 1.05 5.05 Term - 128538 128105 434 1 2 78 42 71 0.193 -3.04 5.04 Intr - 129204 129048 157 0 1 28 66 136 0.402 4.98 5.03 Intr - 159364 159213 152 2 2 104 63 14 0.029 0.38 5.02 Intr - 166887 166792 96 1 0 62 97 37 0.320 1.98 5.01 Init - 167344 167278 67 1 1 35 87 45 0.366 0.33 5.00 Prom - 174294 174255 40 -4.16 6.05 PlyA - 176137 176132 6 1.05 6.04 Term - 184991 184882 110 0 2 120 49 73 0.556 5.27 6.03 Intr - 191931 191867 65 2 2 114 92 37 0.300 5.06 6.02 Intr - 200656 200523 134 1 2 87 68 111 0.648 8.44 6.01 Intr - 205065 204957 109 2 1 84 87 20 0.339 1.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 43056 43017 40 0 1 72 64 62 0.918 2.66 S.002 Term + 197237 197535 299 0 2 71 53 170 0.808 7.23 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:38710035_38922886|GENSCAN_predicted_peptide_1|58_aa XVYSQTPRKTSSQHHGLGRREDRGRPLKTPSDRYIYPPPHLADQKASRQQRPRLHVCF >gi568815577f:38710035_38922886|GENSCAN_predicted_CDS_1|177_bp nnggtctacagccaaacacctaggaagaccagcagtcagcatcacgggctcggcagaaga gaggacagaggacgcccgctgaaaaccccgagcgaccgctacatctacccaccaccacat ctggctgatcaaaaggcctccagacagcaacggccacggcttcacgtctgcttttaa >gi568815577f:38710035_38922886|GENSCAN_predicted_peptide_2|281_aa MDSKAAKDWLTTKDLLEVTNVKERRGGKNANNLATRLFISNVVAFIPRIVLKDENWMERR YSAKVEEQGSEQRETTQPKGSVISCHAQPSSIQSMQTDAYPELSQTPGGKLPFKVLMSRD FKKLLKNPRGRWELLMSSNSLQYTYSFLGRSSCADRLQREGNCFTFIFLKAKEEKGTSYM AAGKRACGGELLFIKPSDLMRLIHYHGKSMTEIAPVIQLIRLIHYHENSVREIASVIQLS PPALDTRELLKFKEAPSEFADELDVRSERKREIKDDFKVST >gi568815577f:38710035_38922886|GENSCAN_predicted_CDS_2|846_bp atggacagcaaagcagccaaggactggctgaccactaaggacctgctggaggtaacaaat gtgaaagaaaggagaggaggcaaaaatgcaaacaacttggcaaccaggttgttcatatcc aacgtcgtggctttcatcccaagaatcgttctcaaagatgagaactggatggagaggcgc tattccgctaaggtggaggaacagggctcggaacagagagaaaccacacagcccaagggc agtgtcatctcctgccacgcacagccttcctccatccaatccatgcagactgacgcctac cctgagctcagccagaccccgggagggaagctgcccttcaaagtgctcatgtccagggac tttaagaagctgctgaagaacccaagaggaaggtgggagctgttgatgagcagtaatagt ttacaatatacctacagtttcctgggcagatcatcgtgtgcagaccggctgcagagagaa gggaactgtttcactttcatcttcctgaaggcaaaggaggagaaaggcacgtcttacatg gcagcaggcaagagagcttgtggaggggaactcctatttataaaaccatcagatctcatg agacttattcactaccacgggaagagtatgacagaaattgcccctgtgattcaattaata agacttattcactatcacgagaacagtgtgagagaaattgcctccgtgattcaattatct ccacctgcccttgacacacgggaattattaaaatttaaggaagcaccaagtgaatttgct gatgaattggatgtgagaagtgagagaaagagagaaatcaaggatgactttaaggtttcg acgtga >gi568815577f:38710035_38922886|GENSCAN_predicted_peptide_3|719_aa MLYCLKMKGGPEPKDAVCFWMLAVIQKHLLSTSYLLNEPTRLAGRLRCTWLLEVRSSHKR TNTSFKGLGVCEQTWTSGLVEGLPGSILVAQGEKPSGNFEERGGVIGLTSPKATLAAVCR VDSRRSLLVMRLDLFPGALDEKKETRRHDLLYLVKHSCYQETFTLRVEKGLPASRALWSA PRGGRPGYFLQRLTSAVSLQLRAPGAARPASGLPDRLWPAPSPSPGAHRAAAGAEQPPSR PSAGPARSGRMNDFGIKNMDQVAPVANSYRGTLKRQPAFDTFDGSLFAVFPSLNEEQTLQ EVPTGLDSISHDSANCELPLLTPCSKAVMSQALKATFSGFKKEQRRLGIPKNPWLWSEQQ VCQWLLWATNEFSLVNVNLQRFGMNGQMLCNLGKERFLELAPDFVGDILWEHLEQMIKEN QEKTEDQYEENSHLTSVPHWINSNTLGFGTEQAPYGMQTQNYPKGGLLDSMCPASTPSVL SSEQEFQMFPKSRLSSVSVTYCSVSQDFPGSNLNLLTNNSGTPKDHDSPENGADSFESSD SLLQSWNSQSSLLDVQRVPSFESFEDDCSQSLCLNKPTMSFKDYIQERSDPVEQGKPVIP AAVLAGFTGSGPIQLWQFLLELLSDKSCQSFISWTGDGWEFKLADPDEVARRWGKRKNKP KMNYEKLSRGLRYYYDKNIIHKTSGKRYVYRFVCDLQNLLGFTPEELHAILGVQPDTED >gi568815577f:38710035_38922886|GENSCAN_predicted_CDS_3|2160_bp atgctatactgcttgaagatgaaggggggccctgagccaaaggatgcagtctgtttctgg atgctggcagtcattcagaaacatttgctgagcacctcgtacctgctgaatgaacccacg aggttggctggaaggctcagatgcacatggctgctggaagtgaggtcctcccacaagaga accaacacgtccttcaagggtctcggggtctgtgaacagacatggacatcaggattggtt gaggggcttccaggttcaatcctggtggcacagggtgagaaaccttctgggaattttgag gagagaggtggcgtcataggattaacttctccaaaggctacattggctgctgtgtgcagg gtggattctaggaggtcacttctcgtgatgagattggacttgtttcctggggctctggat gaaaagaaagaaacgagaaggcacgacttgctgtatctagtcaaacattcttgctaccaa gaaacattcacgctccgagtggagaaggggctgcctgcatctcgtgccctctggagcgcg ccgcgtgggggacggcccggttacttcctccagagactgacgagtgcggtgtcgctccag ctcagagctcccggagccgcccggccagcgtccggcctccctgatcgtctctggccggcg ccctcgccctcgcccggcgcgcaccgagcagccgcgggcgccgagcagccaccgtcccga ccaagcgccggccctgcccgcagcggcaggatgaatgatttcggaatcaagaatatggac caggtagcccctgtggctaacagttacagagggacactcaagcgccagccagcctttgac acctttgatgggtccctgtttgctgtttttccttctctaaatgaagagcaaacactgcaa gaagtgccaacaggcttggattccatttctcatgactccgccaactgtgaattgcctttg ttaaccccgtgcagcaaggctgtgatgagtcaagccttaaaagctaccttcagtggcttc aaaaaggaacagcggcgcctgggcattccaaagaacccctggctgtggagtgagcaacag gtatgccagtggcttctctgggccaccaatgagttcagtctggtgaacgtgaatctgcag aggttcggcatgaatggccagatgctgtgtaaccttggcaaggaacgctttctggagctg gcacctgactttgtgggtgacattctctgggaacatctggagcaaatgatcaaagaaaac caagaaaagacagaagatcaatatgaagaaaattcacacctcacctccgttcctcattgg attaacagcaatacattaggttttggcacagagcaggcgccctatggaatgcagacacag aattaccccaaaggcggcctcctggacagcatgtgtccggcctccacacccagcgtactc agctctgagcaggagtttcagatgttccccaagtctcggctcagctccgtcagcgtcacc tactgctctgtcagtcaggacttcccaggcagcaacttgaatttgctcaccaacaattct gggactcccaaagaccacgactcccctgagaacggtgcggacagcttcgagagctcagac tccctcctccagtcctggaacagccagtcgtccttgctggatgtgcaacgggttccttcc ttcgagagcttcgaagatgactgcagccagtctctctgcctcaataagccaaccatgtct ttcaaggattacatccaagagaggagtgacccagtggagcaaggcaaaccagttatacct gcagctgtgctggccggcttcacaggaagtggacctattcagctgtggcagtttctcctg gagctgctatcagacaaatcctgccagtcattcatcagctggactggagacggatgggag tttaagctcgccgaccccgatgaggtggcccgccggtggggaaagaggaaaaataagccc aagatgaactacgagaagctgagccggggcttacgctactattacgacaagaacatcatc cacaagacgtcggggaagcgctacgtgtaccgcttcgtgtgcgacctccagaacttgctg gggttcacgcccgaggaactgcacgccatcctgggcgtccagcccgacacggaggactga >gi568815577f:38710035_38922886|GENSCAN_predicted_peptide_4|371_aa MVCQRSGSRKKENFRPISLMNIDAKILSKILANQIQQHIKNLIHHDQVGFIPGMQGWFNI RKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDK PTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVK LSLFADDMTVYPENPIVSAQNLLKLRSNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSE LPFTIASKRIKYLGIQLIRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMA ILPKVIYRFNAIPIKLPMPFFTELEKTTLKFIWNQKRARITKSILSQKNKAGGITLPDFK LHYKATVTKTA >gi568815577f:38710035_38922886|GENSCAN_predicted_CDS_4|1116_bp atggtttgtcaaagatcaggtagccgtaaaaaagagaattttagaccaatatccttgatg aacattgatgcaaaaatcctcagtaaaatactggcaaaccaaatccagcagcacatcaaa aaccttatccaccatgatcaagtgggcttcatccctgggatgcaaggctggttcaatata cgcaaatcaataaatgtaatccagcatataaacagagccaaagacaaaaaccacatgatt atctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaact ctcaataaattaggtatcgatggaacatatttcaaaataataagagctatctatgacaaa cccacagctaatatcatactgaatgggcaaaaactggaagcattccctttgaaaactggc acaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctggcc agggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaa ttgtccctgtttgcagatgacatgactgtatatccagaaaaccccattgtctcagcccaa aatctccttaagctgagaagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacaagcattcctatacaccaacaacagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttataagggat gtgaaggacctcttcaaggagaactacaaaccgctgctcaaggaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtgaaaatggcc atactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatgcctttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatc accaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttcaaa ctacactacaaggctacagtaaccaaaacagcatga >gi568815577f:38710035_38922886|GENSCAN_predicted_peptide_5|301_aa MAVDICKTLLDGIPEIGAPTIEVRDGQPRKCPPKGRPVPWIVQEEDNTKARLFLVLPSPS NVHKESQSEQMSQGQLRGPKSHSRGNCYCKNKGNLRRVCFPQTLLVGDKDQGQGLLHTHR MQAEVWLHMKHAQVFGKRSTLAVCRHELEEVIVKWGPRNAPVPSTSPDPALYLSKDSCCS TSHPAVGITWEQPGVSAPAARPDNLNLLSERSPSDSSTPSCVKSIGGSSYSFISVKLRDH LLQRAFLDVPTLPSRLQRQLEHGAQLQQCAVATLVGTSSSSFFRSQDSSSLGSDDTTSSS F >gi568815577f:38710035_38922886|GENSCAN_predicted_CDS_5|906_bp atggctgtagacatatgtaagacattattggatggtatacctgagattggagcacctaca atagaagtgagggatggacagcccaggaagtgccctccaaagggccggcctgtcccatgg attgtccaggaggaggacaacacaaaggcgaggttgtttctcgtgcttccttccccaagt aatgttcacaaagagtcccaaagtgagcagatgtcccagggccaactgaggggccccaag tcacactcaagagggaactgctattgcaagaacaaaggaaatctgcgcagggtttgcttc ccccaaacgctcctggtaggagacaaggaccaaggtcaagggctgctgcacactcaccgc atgcaggccgaggtctggcttcatatgaagcatgctcaggtctttggtaaaaggtccact ctggctgtgtgcagacatgaattggaggaagtaatagtgaagtgggggcctagaaatgct cctgtcccttcaacttccccggatcctgccctgtatttatcgaaggattcttgctgttcc acatctcatcctgctgtaggcatcacctgggagcaacccggggtctcagctcctgctgca agacctgataatctgaacctgcttagtgaaagatctccaagtgattcatctaccccttca tgtgtgaaaagcattggaggtagctcctactcattcatcagtgttaagctcagggaccac ctcctccagagagccttccttgatgttcccaccctccccagccggctgcagagacagttg gagcatggggcacagctccagcaatgcgccgtggccaccttagtaggcacctctagctct agtttcttcaggtcccaagacagttcttcactgggctcagatgacaccacatcctcctcc ttttag >gi568815577f:38710035_38922886|GENSCAN_predicted_peptide_6|139_aa XTGKESGGAAAPGRLGPPLLTGTLMPVQLGSLDDRQKVASGFLPQAKCVASESDDLLLLE MAGAILNVSGRFLQIPIDSGLSPLNMKMTRMKTFMMIHSHLLNRSLWNWRSDMAFHGCHR GATMHRTSWASIAYIRPVE >gi568815577f:38710035_38922886|GENSCAN_predicted_CDS_6|420_bp nncacaggaaaagagagtggaggggctgctgccccaggaaggctgggcccacccctcctc acaggcaccctgatgcctgttcagcttggctctctggatgacaggcagaaggtagcctcg ggctttcttccacaagcgaagtgtgtggccagtgagtcagatgacctgctccttctggag atggctggtgccattctcaatgtcagtggtcgcttcttacagatcccgatagattcaggc ctcagcccactcaacatgaagatgacaaggatgaagacctttatgatgatccattcccac ttactgaatagatctctgtggaactggagatcagacatggccttccatgggtgtcacagg ggtgccaccatgcacagaacttcctgggccagcatcgcctacatcagacctgtggagtga