GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:47:09 Sequence gi568815581f:44457759_44659383 : 201625 bp : 46.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 17145 17071 75 0 0 82 103 63 0.681 6.91 1.01 Init - 45612 45568 45 0 0 104 97 109 0.769 14.08 1.00 Prom - 51414 51375 40 -3.76 2.00 Prom + 56370 56409 40 -5.56 2.01 Init + 58423 58477 55 0 1 81 110 16 0.480 4.73 2.02 Intr + 65403 65485 83 1 2 49 70 73 0.095 0.96 2.03 Intr + 80042 80096 55 1 1 70 73 58 0.108 1.05 2.04 Intr + 84460 84543 84 2 0 120 94 -21 0.586 1.49 2.05 Term + 87997 88106 110 2 2 24 54 142 0.768 3.07 2.06 PlyA + 88110 88115 6 1.05 3.00 Prom + 99348 99387 40 -5.56 3.01 Sngl + 99931 101628 1698 0 0 96 55 3536 0.971 344.17 3.02 PlyA + 101711 101716 6 1.05 4.03 PlyA - 102777 102772 6 1.05 4.02 Term - 108636 108509 128 1 2 112 41 138 0.815 9.94 4.01 Init - 112971 112965 7 0 1 36 121 0 0.154 -0.73 4.00 Prom - 123178 123139 40 -3.76 5.00 Prom + 124813 124852 40 -5.46 5.01 Init + 138574 138648 75 0 0 110 57 64 0.786 6.59 5.02 Intr + 142403 142537 135 1 0 121 89 -28 0.597 1.36 5.03 Intr + 144102 144230 129 2 0 64 25 116 0.532 3.79 5.04 Intr + 157332 157463 132 2 0 57 115 56 0.230 6.04 5.05 Term + 171293 171358 66 1 0 57 48 98 0.343 0.64 5.06 PlyA + 171536 171541 6 1.05 6.00 Prom + 181195 181234 40 -1.66 6.01 Sngl + 187653 187832 180 2 0 85 53 292 0.909 20.40 6.02 PlyA + 188634 188639 6 1.05 7.00 Prom + 196197 196236 40 -4.56 7.01 Init + 198561 198656 96 2 0 71 97 138 0.996 11.31 7.02 Term + 199369 199512 144 0 0 78 39 165 0.935 8.51 7.03 PlyA + 201332 201337 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:44457759_44659383|GENSCAN_predicted_peptide_1|40_aa MADRFSRFNEDRDFQGNHFDQYEEGHLEIEQASLDKPIES >gi568815581f:44457759_44659383|GENSCAN_predicted_CDS_1|120_bp atggcggaccgcttctcccgcttcaacgaagaccgagactttcagggtaatcactttgat cagtatgaggaaggacacttggaaattgaacaagcgtcacttgacaagcctatagaatcg >gi568815581f:44457759_44659383|GENSCAN_predicted_peptide_2|128_aa MAGKVLVLIEFMFWLGEPGPGDRKRSNLEQVHWSKPSSVRFRLDPQASWVLGLTDFKNEA VDLRRFPHWYTAWHPWALTALADSHQSPRLGPGHMNTEKVAIYKPGRKPSPESNHAGIPI SDFQLPEL >gi568815581f:44457759_44659383|GENSCAN_predicted_CDS_2|387_bp atggcaggaaaggttcttgttctcattgagtttatgttctggctgggggagccaggtcct ggggaccgaaagaggtccaacttggagcaggtacattggtctaagcctagttctgttcgg ttccggctcgacccccaagcttcttgggttcttggtctcacagacttcaagaatgaagcc gtggaccttcgcaggttcccccactggtacactgcctggcatccctgggcactgacagcc ctggctgactcacatcaaagccccagactgggtccaggccacatgaacactgagaaggtg gccatctacaagccaggaaggaagccttcgccagaatccaaccatgctggcatcccgatc tcagactttcagcttccagaactatga >gi568815581f:44457759_44659383|GENSCAN_predicted_peptide_3|565_aa MRPRSALPRLLLPLLLLPAAGPAQFHGEKGISIPDHGFCQPISIPLCTDIAYNQTIMPNL LGHTNQEDAGLEVHQFYPLVKVQCSPELRFFLCSMYAPVCTVLEQAIPPCRSICERARQG CEALMNKFGFQWPERLRCEHFPRHGAEQICVGQNHSEDGAPALLTTAPPPGLQPGAGGTP GGPGGGGAPPRYATLEHPFHCPRVLKVPSYLSYKFLGERDCAAPCEPARPDGSMFFSQEE TRFARLWILTWSVLCCASTFFTVTTYLVDMQRFRYPERPIIFLSGCYTMVSVAYIAGFVL QERVVCNERFSEDGYRTVVQGTKKEGCTILFMMLYFFSMASSIWWVILSLTWFLAAGMKW GHEAIEANSQYFHLAAWAVPAVKTITILAMGQIDGDLLSGVCFVGLNSLDPLRGFVLAPL FVYLFIGTSFLLAGFVSLFRIRTIMKHDGTKTEKLERLMVRIGVFSVLYTVPATIVIACY FYEQAFREHWERSWVSQHCKSLAIPCPAHYTPRMSPDFTVYMIKYLMTLIVGITSGFWIW SGKTLHSWRKFYTRLTNSRHGETTV >gi568815581f:44457759_44659383|GENSCAN_predicted_CDS_3|1698_bp atgcggccccgcagcgccctgccccgcctgctgctgccgctgctgctgctgcccgccgcc gggccggcccagttccacggggagaagggcatctccatcccggaccacggcttctgccag cccatctccatcccgctgtgcacggacatcgcctacaaccagaccatcatgcccaacctt ctgggccacacgaaccaggaggacgcaggcctagaggtgcaccagttctatccgctggtg aaggtgcagtgctcgcccgaactgcgcttcttcctgtgctccatgtacgcacccgtgtgc accgtgctggaacaggccatcccgccgtgccgctctatctgtgagcgcgcgcgccagggc tgcgaagccctcatgaacaagttcggttttcagtggcccgagcgcctgcgctgcgagcac ttcccgcgccacggcgccgagcagatctgcgtcggccagaaccactccgaggacggagct cccgcgctactcaccaccgcgccgccgccgggactgcagccgggtgccgggggcaccccg ggtggcccgggcggcggcggcgctcccccgcgctacgccacgctggagcaccccttccac tgcccgcgcgtcctcaaggtgccatcctatctcagctacaagtttctgggcgagcgtgat tgtgctgcgccctgcgaacctgcgcggcccgatggttccatgttcttctcacaggaggag acgcgtttcgcgcgcctctggatcctcacctggtcggtgctgtgctgcgcttccaccttc ttcactgtcaccacgtacttggtagacatgcagcgcttccgctacccagagcggcctatc atttttctgtcgggctgctacaccatggtgtcggtggcctacatcgcgggcttcgtgctc caggagcgcgtggtgtgcaacgagcgcttctccgaggacggttaccgcacggtggtgcag ggcaccaagaaggagggctgcaccatcctcttcatgatgctctacttcttcagcatggcc agctccatctggtgggtcatcctgtcgctcacctggttcctggcagccggcatgaagtgg ggccacgaggccatcgaggccaactctcagtacttccacctggccgcctgggccgtgccg gccgtcaagaccatcaccatcctggccatgggccagatcgacggcgacctgctgagcggc gtgtgcttcgtaggcctcaacagcctggacccgctgcggggcttcgtgctagcgccgctc ttcgtgtacctgttcatcggcacgtccttcctcctggccggcttcgtgtcgctcttccgc atccgcaccatcatgaagcacgacggcaccaagaccgaaaagctggagcggctcatggtg cgcatcggcgtcttctccgtgctctacacagtgcccgccaccatcgtcatcgcttgctac ttctacgagcaggccttccgcgagcactgggagcgctcgtgggtgagccagcactgcaag agcctggccatcccgtgcccggcgcactacacgccgcgcatgtcgcccgacttcacggtc tacatgatcaaatacctcatgacgctcatcgtgggcatcacgtcgggcttctggatctgg tcgggcaagacgctgcactcgtggaggaagttctacactcgcctcaccaacagccgacac ggtgagaccaccgtgtga >gi568815581f:44457759_44659383|GENSCAN_predicted_peptide_4|44_aa MEAQAPMRARSRQRPSPTAKSGREKAVVVAKQGKRMGASCRLRQ >gi568815581f:44457759_44659383|GENSCAN_predicted_CDS_4|135_bp atggaagctcaggcgcccatgcgtgcccgcagcaggcaaaggccaagcccaacagccaag agtgggcgggagaaagccgtggtggtagccaaacagggcaaacggatgggcgcttcttgc cggctgcgccaataa >gi568815581f:44457759_44659383|GENSCAN_predicted_peptide_5|178_aa MENVKEKKRKVLAVPEIVEKRNFSEPFPLLLLSGSLGMAPLPGSVTEKGGHAVVGDVPGP LKPKIGWVNVRYSEEGKREHVDVGNNVLRVNEPVRIEPASAGTRHPCGEYMWGGRNQDCD APCCFPNTRAKVLTTYRWREEKEREEERKGGERKRRKESQAKPLDVEEEHVSGGTHIG >gi568815581f:44457759_44659383|GENSCAN_predicted_CDS_5|537_bp atggagaatgtcaaagagaagaaaaggaaggttctggctgtgccagaaatcgttgagaaa aggaatttctcagagccctttcctctcctccttctgtctgggagtctgggaatggctcct ttacctgggagtgtcacagagaaggggggacatgcggtggttggggatgttcctgggcct cttaagcccaagatcgggtgggtgaatgtgcggtacagtgaagaggggaaacgggaacac gtggatgttgggaataacgtgttacgtgtgaatgagccagttcgcattgagcctgcaagc gcaggcacccggcacccatgtggggagtacatgtggggtggcaggaaccaagactgtgat gctccctgctgcttccctaacacccgtgcaaaggtgcttactacttaccgatggagggag gaaaaggagagggaggaagaaagaaaaggaggagagagaaagagaagaaaggaaagccaa gccaagccgctggatgttgaggaggagcacgtcagcggaggcacacacattggctag >gi568815581f:44457759_44659383|GENSCAN_predicted_peptide_6|59_aa MRQLKGKPRKETWKDKKEQKQAMQEARQQITTVVLPTLAAVVLLIVVFVYAATRPTITK >gi568815581f:44457759_44659383|GENSCAN_predicted_CDS_6|180_bp atgaggcagctcaaagggaagcccaggaaggagacatggaaggacaagaaggagcagaag caagccatgcaggaggcccggcagcagatcaccacggtggtgctgcccacgctggccgcg gtcgtgctgctgattgtggtgtttgtgtatgcagccacacgccccaccatcaccaagtga >gi568815581f:44457759_44659383|GENSCAN_predicted_peptide_7|79_aa MAGCETGGAGLTAGRGAPVPCRVPSREPQAGQPKVAFPGGANRCWNLGADAGSRLTDVFG SVMLTGSASFYDCYTSQVL >gi568815581f:44457759_44659383|GENSCAN_predicted_CDS_7|240_bp atggcgggctgcgaaacgggaggggcgggcctcacagcggggcggggcgcgcctgtgccg tgtcgcgtgccgagccgcgagccccaggcaggccagcccaaagtcgcgttccccggaggt gcgaatcgctgttggaacctcggcgccgacgccggcagcaggttaaccgacgtcttcggc agcgtgatgttgactggctccgcttccttctacgattgctacacatcgcaggtcctttag