GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:42:22 Sequence gi568815580f:74399974_74620065 : 220092 bp : 46.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2643 2682 40 -0.66 1.01 Init + 12598 12733 136 0 1 85 83 147 0.744 12.32 1.02 Term + 20167 20267 101 2 2 116 45 63 0.335 3.09 1.03 PlyA + 20556 20561 6 1.05 2.00 Prom + 21626 21665 40 -1.76 2.01 Init + 30152 30187 36 1 0 114 98 19 0.766 5.51 2.02 Intr + 34874 34962 89 1 2 97 86 60 0.796 5.47 2.03 Term + 36869 36983 115 2 1 100 54 128 0.933 8.64 2.04 PlyA + 38366 38371 6 1.05 3.04 PlyA - 39116 39111 6 1.05 3.03 Term - 42143 41973 171 2 0 91 54 103 0.542 5.03 3.02 Intr - 47310 46633 678 0 0 127 98 800 0.781 76.71 3.01 Init - 57286 57089 198 1 0 97 82 304 0.881 27.60 3.00 Prom - 59266 59227 40 -6.36 4.05 PlyA - 59540 59535 6 1.05 4.04 Term - 61144 60899 246 1 0 48 52 189 0.288 7.09 4.03 Intr - 78842 78681 162 2 0 23 94 132 0.722 7.47 4.02 Intr - 87387 87301 87 0 0 129 51 91 0.144 9.67 4.01 Init - 89895 89848 48 0 0 71 37 75 0.707 1.65 4.00 Prom - 92002 91963 40 -7.16 5.00 Prom + 99223 99262 40 -5.66 5.01 Init + 100001 100060 60 1 0 86 116 64 0.856 10.25 5.02 Intr + 101356 101499 144 0 0 72 97 135 0.850 13.28 5.03 Intr + 105876 106038 163 2 1 50 101 262 0.968 23.25 5.04 Intr + 108867 108955 89 1 2 77 85 79 0.990 6.19 5.05 Intr + 110840 111040 201 1 0 89 77 227 0.900 21.18 5.06 Intr + 112475 112559 85 1 1 79 106 64 0.978 6.69 5.07 Intr + 113586 113746 161 1 2 107 47 430 0.933 40.51 5.08 Intr + 116094 116192 99 2 0 34 97 43 0.495 0.11 5.09 Intr + 116255 116423 169 1 1 90 6 228 0.677 14.32 5.10 Intr + 117406 117545 140 2 2 -7 37 99 0.438 -4.52 5.11 Intr + 118526 118667 142 1 1 64 47 153 0.914 8.73 5.12 Intr + 118976 119123 148 0 1 86 94 178 0.998 17.49 5.13 Term + 120026 120095 70 2 1 85 47 115 0.899 4.51 5.14 PlyA + 121124 121129 6 1.05 6.06 PlyA - 123567 123562 6 1.05 6.05 Term - 131369 131140 230 0 2 94 41 94 0.309 2.09 6.04 Intr - 132791 132653 139 2 1 55 64 45 0.275 -1.16 6.03 Intr - 138755 138579 177 2 0 37 86 133 0.699 8.12 6.02 Intr - 141241 141129 113 2 2 124 62 64 0.757 7.50 6.01 Init - 144574 144526 49 1 1 86 58 58 0.375 1.71 6.00 Prom - 153195 153156 40 -3.66 7.00 Prom + 153456 153495 40 -5.36 7.01 Init + 155052 155125 74 2 2 62 86 1 0.365 -2.15 7.02 Intr + 155429 155492 64 2 1 126 37 49 0.681 2.32 7.03 Intr + 156365 156493 129 1 0 57 91 202 0.938 18.29 7.04 Intr + 159350 159499 150 1 0 98 53 251 0.819 22.96 7.05 Intr + 160883 161045 163 1 1 55 73 235 0.944 18.25 7.06 Intr + 162074 162162 89 0 2 85 58 76 0.868 3.99 7.07 Intr + 167260 167460 201 0 0 72 53 194 0.871 13.78 7.08 Intr + 171815 171917 103 1 1 61 111 3 0.390 -0.35 7.09 Intr + 172139 172222 84 0 0 63 70 153 0.915 10.79 7.10 Intr + 176896 177056 161 2 2 112 86 141 0.996 16.01 7.11 Intr + 180157 180298 142 0 1 78 75 88 0.979 6.43 7.12 Intr + 183588 183735 148 1 1 93 76 218 0.953 20.39 7.13 Intr + 197782 197908 127 1 1 -13 101 121 0.412 3.98 7.14 Term + 198570 198689 120 2 0 121 42 57 0.405 2.87 7.15 PlyA + 199360 199365 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 204903 204863 41 0 2 84 101 32 0.853 3.76 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:74399974_74620065|GENSCAN_predicted_peptide_1|78_aa MDLDRLPLPLQQGARCACTAAVLTNSSLSKESEVKVREARPAGTQESMSSLGYPSRAEDD SGLSALPSQPQPFILYAT >gi568815580f:74399974_74620065|GENSCAN_predicted_CDS_1|237_bp atggacttggatcggcttcccctgcccctgcagcagggagctcgctgtgcctgcactgct gcagtactgacaaattcatccctgtccaaggagagtgaagtgaaggtccgtgaggcgaga cctgctggcactcaggaatccatgtcctcacttgggtaccccagcagagcagaggatgac agtggcctgagcgccctgccttctcagccccagcccttcattctctatgccacgtga >gi568815580f:74399974_74620065|GENSCAN_predicted_peptide_2|79_aa MPSLPVTLEEEKNTIAKRVFMSMMRRPSMELCVIWKIPSGDGEHTPETIRFGNMPSLLPL KPFSTMNKRRLREEKPTSV >gi568815580f:74399974_74620065|GENSCAN_predicted_CDS_2|240_bp atgcccagcctgccagtgactttagaggaggaaaagaatacaattgcaaagcgagtcttc atgtccatgatgaggaggccctctatggaactctgtgtcatttggaagatccccagtggt gatggggagcacactccagagaccatccgatttggaaacatgccttcacttctgcccctt aaacccttcagcaccatgaacaaaagacgcctccgagaggagaagccaacatctgtgtga >gi568815580f:74399974_74620065|GENSCAN_predicted_peptide_3|348_aa MARAAGARGPAGWCRRRGRCGRGTLLAFAAWTAGWVLAAALLLRAHPGVLSERCTDEKSR RILAALCQDYQGGTLAGDLCEDLCVAGELLFQRCLHYNRGKKVLQADWRGRPVVLKSKEE AFSSFPPLSLLEEEAGEGGQDMPEAELLLMVAGEVKSALGLELSNSSLGPWWPGRRGPRW RGQLASLWALLQQEEYVYFSLLQDLSPHVLPVLGSCGHFYAVEFLAAGSPHHRALFPLDR APGAPGGGQAKAISDIALSFLDMVNHFDSDFSHRLHLCDIKPENFAIRSDFTVVAIDVDM AFFEPKMREILEQNCTGDEDCNFFDCFSRCDLRVNKCGAQRVNNNLQV >gi568815580f:74399974_74620065|GENSCAN_predicted_CDS_3|1047_bp atggcgcgggcggcgggcgcgcggggccctgccgggtggtgcaggaggcgcgggcgctgc gggcggggcacgctcctcgccttcgccgcgtggaccgcgggctgggtgctggcggccgcg ctgctgctccgcgcgcacccgggtgtcctctccgagcgctgcaccgacgagaagagccgg cgcatcctggccgcgctgtgccaggactaccagggcggcacgctggccggggacctctgc gaggacctgtgtgtggcgggagagctgctgttccaacgctgcctgcactacaacagaggc aagaaggtgctgcaggccgactggcgcggccggcccgtggtcctcaagtccaaggaggag gccttctccagcttcccgcccctcagcctgttggaagaggaggcaggggagggtggccag gacatgcccgaggccgaactcctcctgatggtggctggggaggtcaagagcgctctgggc ctggagttgtccaacagcagcctggggccgtggtggccgggcaggcggggcccacgctgg cggggacagctggccagcctgtgggccctgctgcagcaggaggagtacgtctacttcagc ctgctgcaggacctgagcccacacgtgctgcccgtgctgggttcctgcggccacttctac gcggtggagttcctggccgcgggcagcccccaccacagggcactcttccccctggaccgg gccccaggtgcccctgggggtggccaggccaaggccatcagtgacatcgcactcagcttc ttggacatggtgaaccattttgacagtgacttttcccaccgcctccacctctgcgacatc aagccggaaaactttgccatccggagcgacttcacagtggtggctattgatgtggacatg gccttttttgaacctaaaatgagggaaatccttgagcaaaactgcacaggagatgaagac tgcaatttctttgactgtttttcaagatgtgatttacgagtcaacaaatgcggagcgcag cgcgtaaacaacaacctgcaggtatga >gi568815580f:74399974_74620065|GENSCAN_predicted_peptide_4|180_aa MKFVGTLMSDFPDSRTVDVKLMADASAAVYGQQVSFQFVCPQPQGDIDNLRELSRSMDNL LGSFKEYSHIPSAPESLGDWSVVQQECQDAQAAGVSPSPFLCYGSHPSPSGDPYGHLECK TSPVALGLAFVRAAAAAALALAKPHGAGIVEGTDSDGHAASPFAQNPLESCLMVSLTKEG >gi568815580f:74399974_74620065|GENSCAN_predicted_CDS_4|543_bp atgaaatttgtcggcaccttgatgtcggacttcccagattccagaactgttgatgtgaag ctcatggctgatgccagtgctgctgtctatggacagcaggtgtcttttcaatttgtgtgt ccccagccgcagggggatattgataaccttcgagaattatctcggtccatggacaatctc ttgggaagctttaaagagtacagccatatccccagtgccccagaaagtctaggtgactgg tctgtggtgcagcaagagtgtcaagatgctcaagccgctggggtgtctccaagcccgttc ctctgctatggctctcacccgagtccttctggggatccttatggacatttggagtgcaag acctcacccgtggccctgggactggccttcgtgagggcagcagcagcagcagccctggct ctggcaaagccccatggggcagggatagtggaagggacagacagtgacggacacgcagca tctccctttgcacagaacccactggagagctgcttgatggtctccctcaccaaggagggc tga >gi568815580f:74399974_74620065|GENSCAN_predicted_peptide_5|556_aa MAALTTLFKYIDENQDRYIKKLAKWVAIQSVSAWPEKRGEIRRMMEVAAADVKQLGGSVE LVDIGKQKLPDGSEIPLPPILLGRLGSDPQKKTVCIYGHLDVQPAALEDGWDSEPFTLVE RDGKLYGRGSTDDKGPVAGWINALEAYQKTGQEIPVNVRFCLEGMEESGSEGLDELIFAR KDTFFKDVDYVCISDNYWLGKKKPCITYGLRGICYFFIEVECSNKDLHSGVYGGSVHEAM TDLILLMGSLVDKRGNILIPGINEAVAAVTEEEHKLYDDIDFDIEEFAKDVGAQILLHSH KSHLHLDLLPVVVRLLGQALFHTAHFPDNIPSSSKDILMHRWRYPSLSLHGIEGAFSGSG AKTVIPRKVVGKFSIRLVPNMTPEVVGEQASHLQVEELRCVLNHIAPAQMLHSSQATLSL GVRAHVVKGHRGATTNAVTSYLTKKFAELRSPNEFKVYMGHGGKPWVSDFSHPHYLAGRR AMKTVFGVEPDLTREGGSIPVTLTFQEATGKNVMLLPVGSADDGAHSQNEKLNRYNYIEG TKMLAAYLYEVSQLKD >gi568815580f:74399974_74620065|GENSCAN_predicted_CDS_5|1671_bp atggcggccctcactaccctgtttaagtacatagatgaaaatcaggatcgctacattaag aaactcgcaaaatgggtggctatccagagtgtgtctgcgtggccggagaagagaggcgaa atcaggaggatgatggaagttgctgctgcagatgttaagcagttggggggctctgtggaa ctggtggatatcggaaaacaaaagctccctgatggctcggagatcccgctccctcctatt ctgctcggcaggctgggctccgacccacagaagaagaccgtgtgcatttacgggcacctg gatgtgcagcctgcagccctggaggacggctgggacagcgagcccttcaccctggtggag cgagacggcaagctgtatgggagaggttcgactgatgataagggcccggtggccggctgg ataaacgccctggaagcgtatcagaaaacaggccaggagattcctgtcaacgtccgattc tgcctcgaaggcatggaggagtcaggctctgagggcctagacgagctgatttttgcccgg aaagacacattctttaaggatgtggactatgtctgcatttctgacaattactggctggga aagaagaagccctgcatcacctacggcctcaggggcatttgctactttttcatcgaggtg gagtgcagcaacaaagacctccattctggggtgtacgggggctcggtgcatgaggccatg actgatctcattttgctgatgggctctttggtggacaagagggggaacatcctgatcccc ggcattaacgaggccgtggccgccgtcacggaagaggagcacaagctgtacgacgacatc gactttgacatagaggagtttgccaaggatgtgggggcgcagatcctcctgcacagccac aagtcgcacctgcacctggacctcctgcctgtggttgtcaggctcctgggccaggccctg tttcataccgctcatttcccagacaacattcccagctcctcgaaagacatcctcatgcac cgatggcggtacccgtctctgtccctccatggcatcgaaggcgccttctctgggtctggg gccaagaccgtgattcccaggaaggtggttggcaagttctccatcaggctcgtgccgaac atgactcctgaagtcgtcggcgagcaggcatctcatctgcaggtggaggagcttcggtgc gtgctgaatcacatcgcacctgctcagatgttacattccagccaagccacactgagcctc ggtgtaagagcacatgtggtcaaaggtcacagaggcgcaaccactaatgcggtcacaagc tacctaactaagaagtttgctgaactacgcagccccaatgagttcaaggtgtacatgggc cacggtgggaagccctgggtctccgacttcagtcaccctcattacctggctgggagaaga gccatgaagacagtttttggtgttgagccagacttgaccagggaaggcggcagtattccc gtgaccttgacctttcaggaggccacgggcaagaacgtcatgctgctgcctgtggggtca gcggatgacggagcccactcccagaatgaaaagctcaacaggtataactacatagaggga accaagatgctggccgcgtacctgtatgaggtctcccagctgaaggactag >gi568815580f:74399974_74620065|GENSCAN_predicted_peptide_6|235_aa MGFHHVGLAGLELLTSGASAEPMTEAALTVSPQRVMAPNCSPISPSTFRTYKFPRMPGIT KSDNKCFEDVEKLEPSHTASEDAKWCSQHGKQSGSSSDTDPRSYCINRQFHSQAGLLELQ AQHSSSQEVGLALLPKAKLLGSYSEFRGLERKERVRLGPVWHLSVSGVAQARSALQSHPV EREMDLRQPVLPGGRERSATSAVFTSFHFLLFLNHLDQATATKDLCNDAPRCRSV >gi568815580f:74399974_74620065|GENSCAN_predicted_CDS_6|708_bp atggggtttcaccatgttggcctggctggtctcgaactcctgacctcaggtgcgtcagca gagcccatgaccgaggctgcactcactgtgagcccccagcgggtgatggctcccaactgc agccccatttccccctccactttcagaacctacaagtttccgaggatgccgggaatcaca aagtcagataacaagtgttttgaggatgtggagaaattggagccctcacacactgctagt gaggatgcaaaatggtgcagccagcatggaaagcagtctggcagttcctcagacactgat ccacggagctactgtatcaaccggcaattccactcccaggccgggctccttgagctccag gcccagcacagcagctcccaggaagtgggcctggcccttcttccgaaggccaagttgctg ggatcatactcagaattccgagggctggaaaggaaggagagggtgaggctgggcccagtg tggcatctttctgtatctggtgtagcacaagcacgtagcgccctgcagtcacacccagtg gagagagaaatggacctcaggcagcctgtactcccaggagggagggaaagatctgctacc tcagcggtattcaccagcttccattttctcctcttcctcaaccatctagaccaagctact gccaccaaggatctgtgtaatgacgcaccacgatgccgatcagtctga >gi568815580f:74399974_74620065|GENSCAN_predicted_peptide_7|584_aa MALRLHQPSSPCSPRNPCSPPPTESPPLMARAALGEMLLMAPPDVQAASLLAVLLLLLER GMFSSPSPPPALLEKVFQYIDLHQDEFVQTLKEWVAIESDSVQPVPRFRQELFRMMAVAA DTLQRLGARVASVDMGPQQLPDGQSLPIPPVILAELGSDPTKGTVCFYGHLDVQPADRGD GWLTDPYVLTEVDGKLYGRGATDNKGPVLAWINAVSAFRALEQDLPVNIKFIIEGMEEAG SVALEELVEKEKDRFFSGVDYIVISDNLWISQRKPAITYGTRGNSYFMVETGPSADQPAP CSMWVLGVGDQVCPCSATDHLYDLGCCGPEWHSDADEFPDCPGAVLDFPWIAGSLVDSSG HILVPGIYDEVVPLTEEEINTYKAIHLDLEEYRNSSRVEKFLFDTKVTRHLEDVFSKRNS SNKMVVSMTLGLHPWIANIDDTQYLAAKRAIRTVFGTEPDMIRDGSTIPIAKMFQEIVHK SVVLIPLGAVDDGEHSQNEKINSSKILFFGCGVRRCIHYGVALEATGSPSTEKQDVPPLP PKGKTVPWLRRCYSRLLPLSDAGVGGRRAHKCGGRKHRGGSEGS >gi568815580f:74399974_74620065|GENSCAN_predicted_CDS_7|1755_bp atggcactgagacttcaccagcctagttccccttgttctccccgcaacccctgctcacct ccacctacggaaagcccgcctctgatggctcgggctgcactgggggagatgttgctgatg gcccctccagatgtgcaggctgcgtccctgctggctgtgctgctgctgctgctggagcgc ggcatgttctcctcaccctccccgcccccggcgctgttagagaaagtcttccagtacatt gacctccatcaggatgaatttgtgcagacgctgaaggagtgggtggccatcgagagcgac tctgtccagcctgtgcctcgcttcagacaagagctcttcagaatgatggccgtggctgcg gacacgctgcagcgcctgggggcccgtgtggcctcggtggacatgggtcctcagcagctg cccgatggtcagagtcttccaatacctcccgtcatcctggccgaactggggagcgatccc acgaaaggcaccgtgtgcttctacggccacttggacgtgcagcctgctgaccggggcgat gggtggctcacggacccctatgtgctgacggaggtagacgggaaactttatggacgagga gcgaccgacaacaaaggccctgtcttggcttggatcaatgctgtgagcgccttcagagcc ctggagcaagatcttcctgtgaatatcaaattcatcattgaggggatggaagaggctggc tctgttgccctggaggaacttgtggaaaaagaaaaggaccgattcttctctggtgtggac tacattgtaatttcagataacctgtggatcagccaaaggaagccagcaatcacttacgga acccgggggaacagctacttcatggtggagactggcccttccgcagaccagcctgcacct tgtagcatgtgggttctgggtgtgggtgaccaggtatgtccttgctctgccacagaccac ctgtacgaccttggttgctgtggccccgagtggcacagcgacgccgacgagttccccgat tgtcctggtgctgtccttgatttcccatggattgctggtagcctggtagactcgtctggt catatcctggtccctggaatctatgatgaagtggttcctcttacagaagaggaaataaat acatacaaagccatccatctagacctagaagaataccggaatagcagccgggttgagaaa tttctgttcgatactaaggtgacacgacatcttgaagatgtgttctccaaaagaaatagt tccaacaagatggttgtttccatgactctaggactacacccgtggattgcaaatattgat gacacccagtatctcgcagcaaaaagagcgatcagaacagtgtttggaacagaaccagat atgatccgggatggatccaccattccaattgccaaaatgttccaggagatcgtccacaag agcgtggtgctaattccgctgggagctgttgatgatggagaacattcgcagaatgagaaa atcaacagttccaagattctgtttttcggctgtggcgtccggcgctgcatccactacgga gtcgccttggaagccaccggaagtccttcgacggagaaacaggatgtcccgcctcttcct cccaaagggaagacggtgccctggctgcgccgctgctactcgcgccttcttccgttaagc gacgctggtgtgggcgggcggagggcacacaagtgcggtggccgaaagcaccgcggtggc agcgaagggtcctag