GENSCAN 1.0 Date run: 8-Nov-116 Time: 12:41:57 Sequence gi568815589r:96221245_96483573 : 262329 bp : 43.42% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 433 428 6 1.05 1.02 Term - 1442 1333 110 0 2 102 50 69 0.181 3.17 1.01 Init - 8099 8027 73 2 1 67 27 85 0.230 1.56 1.00 Prom - 8267 8228 40 -4.56 2.12 PlyA - 10572 10567 6 1.05 2.11 Term - 11693 11493 201 2 0 102 46 58 0.651 0.39 2.10 Intr - 14326 14225 102 1 0 98 77 95 0.929 9.77 2.09 Intr - 23150 23085 66 2 0 100 94 86 0.923 9.50 2.08 Intr - 25346 25312 35 0 2 93 116 0 0.235 1.24 2.07 Intr - 31666 31559 108 2 0 75 95 85 0.479 8.16 2.06 Intr - 33664 33624 41 0 2 21 109 66 0.005 -0.13 2.05 Intr - 58189 58127 63 0 0 100 95 61 0.015 5.83 2.04 Intr - 65808 65732 77 0 2 78 1 123 0.018 0.91 2.03 Intr - 68323 68234 90 1 0 84 59 40 0.409 0.89 2.02 Intr - 77218 77172 47 2 2 79 99 51 0.934 3.43 2.01 Init - 80860 80707 154 1 1 110 110 146 0.994 17.18 2.00 Prom - 82971 82932 40 -5.66 3.13 PlyA - 84598 84593 6 1.05 3.12 Term - 93655 93536 120 1 0 120 38 45 0.534 1.17 3.11 Intr - 98933 98841 93 2 0 120 13 94 0.670 5.26 3.10 Intr - 100836 100754 83 1 2 130 97 -49 0.204 -0.44 3.09 Intr - 102925 102847 79 1 1 84 113 28 0.164 4.02 3.08 Intr - 122165 122028 138 2 0 74 85 13 0.340 0.26 3.07 Intr - 122752 122660 93 1 0 89 99 52 0.651 6.56 3.06 Intr - 124157 124055 103 1 1 109 76 20 0.417 3.08 3.05 Intr - 129927 129859 69 2 0 82 86 89 0.390 6.40 3.04 Intr - 132749 132703 47 2 2 77 39 45 0.125 -4.29 3.03 Intr - 143306 143220 87 2 0 107 115 17 0.326 6.47 3.02 Intr - 147061 147028 34 0 1 91 109 7 0.201 1.33 3.01 Init - 162390 162233 158 0 2 70 76 418 0.649 36.18 3.00 Prom - 164632 164593 40 -4.46 4.05 PlyA - 164721 164716 6 1.05 4.04 Term - 167215 166993 223 0 1 78 33 299 0.999 19.79 4.03 Intr - 171314 171154 161 2 2 32 55 152 0.025 5.09 4.02 Intr - 173698 173579 120 1 0 72 64 91 0.025 5.79 4.01 Init - 196788 196369 420 0 0 67 105 737 0.908 69.69 4.00 Prom - 206565 206526 40 -3.76 5.00 Prom + 210867 210906 40 -3.86 5.01 Init + 229036 229384 349 0 1 92 75 649 0.977 59.35 5.02 Intr + 237135 237297 163 1 1 84 53 128 0.902 7.93 5.03 Intr + 244093 244254 162 1 0 90 101 27 0.757 3.29 5.04 Intr + 244466 244534 69 2 0 92 96 26 0.660 2.10 5.05 Intr + 245524 245563 40 1 1 85 75 77 0.829 4.33 5.06 Term + 251570 251722 153 1 0 96 43 76 0.721 1.82 5.07 PlyA + 252969 252974 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 65808 65728 81 0 0 78 48 122 0.908 4.89 S.002 Intr - 171292 171154 139 2 1 37 55 165 0.965 7.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:96221245_96483573|GENSCAN_predicted_peptide_1|60_aa MLRNIRQGTALPLPPTNTELSGPKWTSLPTLNPICPHLLLAVDAGIAKGEGDATLRVSVL >gi568815589r:96221245_96483573|GENSCAN_predicted_CDS_1|183_bp atgcttcggaacatccgacagggcacagcactgcccctacccccaaccaacacagaatta tctggacccaaatggacttccttgcctactctcaatcccatatgtccacatttgctgctg gctgtggatgcaggaatagccaagggagagggagatgctacactaagggtgtcggtgctg tag >gi568815589r:96221245_96483573|GENSCAN_predicted_peptide_2|327_aa MGDVLEQFFILTGLLVCLACLAKCVRFSRCVLLNYWKVLPKSFLRSMGQWAVITGAGDGI GKAYSFEEADEVNWETKVLNSLAILLMDSKTQKWRLEERGPDPDPKRVFSDLIKERIQGK SIKCLDKLFQPITNQEIFESTYDLRTLEKLEAIATEIERTTGRSVKIIQADFTKDDIYEH IKEKLAGLEIGILDDTANSETYGIKAFVCAFSKALQEEYKAKEVIIQAGFLSLIPAWAFY SGAFQRLLLTHYVAYLKLNTKPSICATSQEAEGRRVPFGWHWEQENEVIMGCCSKKYWQL LLGRLPGVSSLSCSCGWEPEHPTSKTL >gi568815589r:96221245_96483573|GENSCAN_predicted_CDS_2|984_bp atgggggacgtcctggaacagttcttcatcctcacagggctgctggtgtgcctggcctgc ctggcgaagtgcgtgagattctccagatgtgttttactgaactactggaaagttttgcca aagtctttcttgcggtcaatgggacagtgggcagtgatcactggagcaggcgatggaatt gggaaagcgtactcgttcgaggaggctgatgaagtgaattgggagacaaaggtgcttaat tctttagccatcctgttgatggattccaagactcaaaaatggaggctagaggaaaggggt cctgatccagaccccaagagagtgttctcggacctcatcaaagaaagaattcagggcaag tccataaagtgtttagataaactttttcaaccaattaccaatcaggaaatctttgaatcc acctatgacctccggacgctggaaaaactagaggccattgccacagagatcgagcggact acagggaggagtgtgaagattatacaagcagattttacaaaagatgacatctacgagcat attaaagaaaaacttgcaggcttagaaattggaattttagatgacacagctaattctgaa acatatggaatcaaggcgtttgtgtgcgcattttccaaggccctgcaagaggaatataaa gcaaaagaagtcatcatccaggcgggctttctgagcctgatcccggcctgggccttctac agcggtgccttccaaaggctgctcctgacacactatgtggcatacctgaagctcaacacc aagccttccatctgtgctacctcccaggaagccgaaggccgcagagtccctttcggatgg cactgggagcaggaaaatgaggtgattatgggctgctgctccaagaagtattggcagctg ttgctggggcggctccctggggtgtcatccctttcttgctcttgtggatgggaaccagag caccccacttcaaagactctgtaa >gi568815589r:96221245_96483573|GENSCAN_predicted_peptide_3|367_aa MTAGGQAEAEGAGGEPGAARLPSRVARLLSALFYGTCSFLIVLVNKALLTTYGFPSPIFL GIGQMAATIMILYVSKLNKIIHFPDFDKKIPVKLLSGCSVVDDVKPVLGKQYSLNIILSV FAIILGAFIAAGSDLAFNLEGYIFVFLNDIFTAANGVYTKQKMDPKELGKYGVLFYNACF MIIPTLIISVSTGDLQQLFVPLEYFKTYPSHHAILPLNIPGSTFEFWEGVVLSAFALKPY LLLVSADVLHGSVQLLQFSPDDSSGWSHQECIRCLHWDINRWRLHFLFVKLCRVKYLVCS GAFTTGAFPHMLTYRQPLVYGKEAQMDRLDLVEVLPQVGIAKKGSFSSRFLGNWIQLGED HLNSHPI >gi568815589r:96221245_96483573|GENSCAN_predicted_CDS_3|1104_bp atgacggccggcggccaggccgaggccgagggcgctggcggggagcccggcgcggcgcgg ctgccctcgcgggtggcccggctgctgtcggcgctcttctacgggacctgctccttcctc atcgtgcttgtcaacaaggcgctgctgaccacctacggtttcccgtcaccaattttcctt ggaattggacagatggcagccaccataatgatactatatgtgtccaagctaaacaaaatc attcacttccctgattttgataagaaaattcctgtaaagctgctctctggatgcagtgtg gttgatgatgtgaagcctgtgctggggaagcagtattcactcaacatcatcctcagtgtc tttgccattattctcggggctttcatagcagctgggtctgaccttgcttttaacttagaa ggctatatttttgtattcctgaatgatatcttcacagcagcaaatggagtttataccaaa cagaaaatggacccaaaggagctagggaaatacggagtacttttctacaatgcctgcttc atgattatcccaactcttattattagtgtctccactggagacctgcaacagctttttgtt cctctggagtattttaaaacctatcccagccatcacgccattttacctctaaatattcca ggatccacatttgaattttgggaaggtgttgttctttcagcatttgccctgaaaccgtat ctgctgctggtttctgctgatgtactccacggttctgtgcagctattacaattcagccct gacgacagcagtggttggagccatcaagaatgtatccgttgcctacattgggatattaat cggtggagactacattttctctttgttaaactttgtagggttaaatatttggtttgttca ggagcttttaccacaggtgcatttcctcacatgctcacctaccgccaacccctggtgtat gggaaggaggcccaaatggataggctcgacttggtggaagttctgcctcaggtgggcata gccaaaaaggggtcattctcttctcggttcctggggaattggatccaactgggagaagac cacctgaattctcatcccatctaa >gi568815589r:96221245_96483573|GENSCAN_predicted_peptide_4|307_aa MIRGFEAPMAENPPPPPPPVIFCHDSPKRVLVSVIRTTPIKPTCGGGGEPEPPPPLIPTS PGFSDFMVYPWRWGENAHNVTLSPGAAGAAASAALPAAAAAEHSGLRGRGAPPPAASASA AASGGEDEEEASSPDSGHLKVRGPICVTIQTVEKPLFKVDSSKHISVFTPERNLLFVQKM VLTLCFEGCLSRFTHANRHCPKHPYARLKREEPTDTLSKHQAADNKAAAEWLARYWEMRE QRTPTLKGKLVQKADQEQQDPLEYLQSDEEDDEKRGAQRRLQEQRERLHGALALIELANL TGAPLRQ >gi568815589r:96221245_96483573|GENSCAN_predicted_CDS_4|924_bp atgatccggggcttcgaggcgcccatggcggagaacccgccgccgccgccgccgcccgtc atcttctgccacgactccccgaagcgggtgctggtgtcggtcatcaggacgaccccgatc aagccaacgtgcggcggtggaggggagccggagccgccgccgccgctcatccccaccagc cccggcttcagcgacttcatggtgtacccgtggcgctggggcgagaacgcacacaacgtg acgctcagccctggggccgcgggggccgccgcctcggccgccctgcctgcagccgcagcc gccgagcactcggggcttcgtggccggggcgcgcccccgcccgccgcctcggcctccgcc gccgcctcgggaggtgaggacgaggaggaagcgagcagcccagacagcggccacctcaag gtgagaggccctatctgtgtgactatccagactgtggaaaagcctttgttcaaagtggac agctcaaaacacatcagcgtcttcacaccggagagaaaccttttgtttgttcagaaaatg gtgttaaccttgtgttttgaaggctgcctgagcagattcacccatgcaaaccgccactgt ccgaagcacccctacgccaggctgaagagagaggagcccacggacacactcagcaaacat caggctgccgacaacaaggccgcggccgagtggctggcgaggtattgggaaatgagagag cagcgcacccccactttgaaaggcaagctggttcagaaggctgatcaggagcagcaggac cctctggaataccttcagtctgatgaagaggacgacgagaagagaggggcccagcgccgg ctgcaggagcagcgggagcgcctgcatggagccctcgcgctcatagagcttgccaacctg actggggcgccactccgacagtag >gi568815589r:96221245_96483573|GENSCAN_predicted_peptide_5|311_aa MKGALGSPVAAAGAAMQESFGCVVANRFHQLLDDESDPFDILREAERRRQQQLQRKRRDE AAAAAGAGPRGGRSPAGASGHRAGAGGRRESQKERKSLPAPVAQRPDSPGGGLQAPGQKR TPRRGEQQGWNDSRGPEGMLERAERRSYREYRPYETERQADFTAEKFPDEKPGDRFDRDR PLRGRGGPRGGMRGRGRGGPGNRVFDAFDQRGKREFERYGGNDKIAVRTEDNMGGCGVRT WGSGKDTSWRDDAFLVSAEEESRPGFCPHCPSETVTMSPDLMDAAVCVFLDLSEAFDFID FFVNVLSCWFL >gi568815589r:96221245_96483573|GENSCAN_predicted_CDS_5|936_bp atgaagggcgctctggggagtcccgtggctgccgctggcgccgcgatgcaggagagtttc ggctgcgtggtggccaaccgcttccatcagctgctggacgacgagtcggacccgttcgac atcctgcgcgaggccgagcgccggcgccagcagcagctgcagcgcaagaggcgcgacgag gcggcggcggcggccggggccggtccccgcggcggcaggagcccagccggggcctcgggc cacagagccggcgcgggcggccggagggagtcgcagaaggagcgcaagagcctcccggcg cccgtcgctcagcggcccgatagccccgggggcggcctgcaggcgccgggccagaagcgg actcctagaagaggggagcagcaaggatggaatgacagccgtgggccggaggggatgctc gaaagagctgagcggagatcctacagggaataccgaccctatgagacagagaggcaggca gacttcacagctgagaagtttccagatgaaaaaccaggtgataggtttgatcgagacaga ccgttgagaggacgtggaggcccgagagggggtatgcgcggcagaggcagaggtggccct gggaacagagtttttgacgcttttgaccagagaggaaagcgagaatttgaaagatatggt gggaatgacaaaatagcagtcagaactgaagacaacatgggtggatgtggagttcgaacc tggggatcgggtaaagataccagctggagggatgatgccttcctggtgtctgcagaagaa gagtccaggcctggcttctgtcctcactgtccttctgaaacagtcaccatgtctccagat ctaatggatgctgcagtctgcgtctttcttgacctctcagaagcatttgactttattgat ttcttcgtaaacgtgctctcctgttggtttctgtga