GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:39:53 Sequence gi568815579f:4551870_4752427 : 200558 bp : 54.11% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 770 553 218 2 2 79 76 342 0.638 29.93 1.08 Intr - 2607 2519 89 1 2 114 100 114 0.991 15.39 1.07 Intr - 3226 3107 120 2 0 72 100 152 0.954 15.77 1.06 Intr - 3695 3605 91 2 1 74 119 170 0.988 18.77 1.05 Intr - 4220 4119 102 2 0 89 75 231 0.982 22.77 1.04 Intr - 5144 5082 63 2 0 115 105 61 0.994 9.91 1.03 Intr - 5354 5294 61 1 1 100 72 145 0.996 13.33 1.02 Intr - 6280 6157 124 2 1 89 72 229 0.986 21.55 1.01 Init - 6588 6468 121 0 1 89 74 238 0.978 20.81 1.00 Prom - 18345 18306 40 -5.41 2.00 Prom + 18944 18983 40 -2.71 2.01 Init + 23551 23684 134 0 2 58 105 61 0.639 4.39 2.02 Intr + 24488 24524 37 2 1 79 94 26 0.447 1.05 2.03 Intr + 28327 28406 80 0 2 50 80 66 0.463 0.84 2.04 Intr + 28740 28903 164 0 2 46 86 54 0.410 1.13 2.05 Intr + 32448 32577 130 1 1 26 60 113 0.456 2.56 2.06 Intr + 33822 33933 112 0 1 103 42 65 0.194 4.28 2.07 Intr + 38979 39014 36 2 0 144 94 38 0.766 8.94 2.08 Intr + 39909 39925 17 2 2 90 105 1 0.458 -3.18 2.09 Intr + 40857 41028 172 0 1 72 58 100 0.279 5.86 2.10 Intr + 67756 67871 116 0 2 137 84 -1 0.010 4.15 2.11 Intr + 87540 87760 221 0 2 52 71 162 0.621 9.37 2.12 Term + 88828 88946 119 2 2 79 42 106 0.657 4.11 2.13 PlyA + 89489 89494 6 -0.45 3.00 Prom + 90021 90060 40 -0.21 3.01 Sngl + 100001 100561 561 1 0 83 46 1361 0.736 127.92 3.02 PlyA + 102242 102247 6 1.05 4.10 PlyA - 102479 102474 6 1.05 4.09 Term - 103004 102916 89 0 2 104 38 49 0.304 -0.38 4.08 Intr - 104536 104453 84 2 0 20 97 90 0.268 3.39 4.07 Intr - 106459 106374 86 0 2 90 68 56 0.798 3.76 4.06 Intr - 106801 106690 112 2 1 18 91 52 0.800 -1.56 4.05 Intr - 108134 108062 73 2 1 83 96 79 0.985 7.67 4.04 Intr - 108881 108800 82 1 1 156 83 93 0.860 15.74 4.03 Intr - 113068 113007 62 1 2 123 78 39 0.982 4.42 4.02 Intr - 116776 116726 51 1 0 110 105 25 0.978 5.99 4.01 Init - 118465 118292 174 1 0 109 71 291 0.989 26.82 4.00 Prom - 120024 119985 40 -1.71 5.28 PlyA - 121710 121705 6 1.05 5.27 Term - 124787 124695 93 2 0 128 52 131 0.993 11.63 5.26 Intr - 128077 127966 112 0 1 120 94 118 0.917 16.48 5.25 Intr - 130969 130827 143 1 2 127 52 365 0.984 36.46 5.24 Intr - 131760 131608 153 0 0 52 98 319 0.987 30.08 5.23 Intr - 132940 132794 147 1 0 108 47 245 0.757 23.34 5.22 Intr - 133902 133757 146 1 2 96 109 213 0.970 24.71 5.21 Intr - 137023 136888 136 1 1 79 81 267 0.981 25.75 5.20 Intr - 137853 137701 153 0 0 95 75 254 0.943 25.58 5.19 Intr - 138443 138375 69 2 0 104 72 33 0.505 3.17 5.18 Intr - 139088 139009 80 0 2 153 76 31 0.990 8.17 5.17 Intr - 142954 142792 163 1 1 88 105 116 0.945 13.36 5.16 Intr - 143686 143509 178 0 1 113 75 249 0.946 26.54 5.15 Intr - 145782 145682 101 0 2 127 73 128 0.947 14.61 5.14 Intr - 148408 148347 62 2 2 101 116 57 0.995 8.74 5.13 Intr - 150286 150158 129 2 0 41 109 218 0.980 20.37 5.12 Intr - 150847 150734 114 2 0 140 100 170 0.999 24.32 5.11 Intr - 152185 152017 169 1 1 127 81 218 0.999 25.03 5.10 Intr - 152435 152262 174 2 0 120 89 364 0.999 40.35 5.09 Intr - 154101 153989 113 1 2 109 115 124 0.986 17.80 5.08 Intr - 160926 160746 181 0 1 65 25 81 0.385 -0.54 5.07 Intr - 161657 161472 186 2 0 54 64 59 0.261 0.50 5.06 Intr - 162135 162029 107 1 2 85 47 62 0.853 2.23 5.05 Intr - 162468 162212 257 2 2 108 94 495 0.964 49.72 5.04 Intr - 168072 167982 91 1 1 111 100 75 0.514 10.55 5.03 Intr - 181179 181144 36 1 0 124 91 36 0.010 6.22 5.02 Intr - 191307 191170 138 1 0 116 81 41 0.146 7.14 5.01 Init - 196566 196509 58 0 1 84 61 52 0.240 1.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 154355 154289 67 2 1 114 44 -10 0.858 -1.49 S.002 Term - 160192 159993 200 2 2 62 45 141 0.890 5.08 S.003 Term - 186490 186448 43 0 1 100 28 132 0.831 5.32 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:4551870_4752427|GENSCAN_predicted_peptide_1|330_aa MQTPRASPPRPALLLLLLLLGGAHGLFPEEPPPLSVAPRDYLNHYPVFVGSGPGRLTPAE GADDLNIQRVLRVNRTLFIGDRDNLYRVELEPPTSTELRYQRKLTWRSNPSDINVCRMKG KQEGECRNFVKVLLLRDESTLFVCGSNAFNPVCANYSIDTLQPVGDNISGMARCPYDPKH ANVALFSDGMLFTATVTDFLAIDAVIYRSLGDRPTLRTVKHDSKWFKEPYFVHAVEWGSH VYFFFREIAMEFNYLEKVVVSRVARVCKNDVGGSPRVLEKQWTSFLKARLNCSVPGDSHF YFNVLQAVTGVVSLGGRPVVLAVFSTPSNS >gi568815579f:4551870_4752427|GENSCAN_predicted_CDS_1|990_bp atgcagaccccgcgagcgtcccctccccgcccggccctgctgcttctgctgctgctactg gggggcgcccacggcctctttcctgaggagccgccgccgcttagcgtggcccccagggac tacctgaaccactatcccgtgtttgtgggcagcgggcccggacgcctgacccccgcagaa ggtgctgacgacctcaacatccagcgagtcctgcgggtcaacaggacgctgttcattggg gacagggacaacctctaccgcgtagagctggagccccccacgtccacggagctgcggtac cagaggaagctgacctggagatctaaccccagcgacataaacgtgtgtcggatgaagggc aaacaggagggcgagtgtcgaaacttcgtaaaggtgctgctccttcgggacgagtccacg ctctttgtgtgcggttccaacgccttcaacccggtgtgcgccaactacagcatagacacc ctgcagcccgtcggagacaacatcagcggtatggcccgctgcccgtacgaccccaagcac gccaatgttgccctcttctctgacgggatgctcttcacagctactgttaccgacttccta gccattgatgctgtcatctaccgcagcctcggggacaggcccaccctgcgcaccgtgaaa catgactccaagtggttcaaagagccttactttgtccatgcggtggagtggggcagccat gtctacttcttcttccgggagattgcgatggagtttaactacctggagaaggtggtggtg tcccgcgtggcccgagtgtgcaagaacgacgtgggaggctccccccgcgtgctggagaag cagtggacgtccttcctgaaggcgcggctcaactgctctgtacccggagactcccatttc tacttcaacgtgctgcaggctgtcacgggcgtggtcagcctcgggggccggcccgtggtc ctggccgttttttccacgcccagcaacagn >gi568815579f:4551870_4752427|GENSCAN_predicted_peptide_2|445_aa MGVEVSGSAKCCWERKTWRMNLKKTKAGNRSQGKGPGVRADHKSCHTGLLGVPPTRQNPK ASEHPSNSKSPFRSSLGVKDDSLSSWGPPNPQAPTAAGGIRTGRPSPPTPLRPLPRGLDP RLVLFWAGQEAGEDSGAQVRVLRAPAATDTFFPVRPGLRPRVRGGPGRPGGGQKAFRVPG MGLIMIVTEDPGGLPGGGGSSILKCQGYSFIGTASVSEYVDLVNPSSFIPKVGKPWSFPE PSTGQKPYTFPRRWKEDSLVLGTEDFDKLVMRTSGQNHVCRGKTQPYPSFLSCLALGAHL PRGCQPLGSTLPTLLRHPPLCYFQIPHLLLPPRPGNSPVPSARAGRVEGVACRGARRVEG GAALPAPPRPRPRLRRCSHRRGNGPGPRRRLDGPPTVGRTDSASRVVTLASSCFSNTPGE VLSQGLCMGCALCHYGRSEGANFET >gi568815579f:4551870_4752427|GENSCAN_predicted_CDS_2|1338_bp atgggggtggaggtctcagggagtgccaagtgctgctgggaacggaagacctggcgaatg aacttgaagaagacaaaggcgggaaataggtcccaggggaaaggccccggggtcagagcg gaccacaagtcctgccatacgggcctacttggtgttcctccaacacgccagaaccccaaa gcctcagagcaccccagtaactccaagtccccgttccgctccagtctaggggtcaaagat gacagcctcagctcttgggggcccccgaacccccaggcccccactgcagcgggggggatc cggaccggccggccctcccctccaacaccactgcgacccctgccccgcggcctggacccg cgactcgtcctgttctgggctggacaggaggccggagaggactcgggcgcccaagtgcgg gttttgcgggcgcccgcggccaccgacaccttcttcccagtgcggcccgggctgcggccc cgggtccgaggaggcccggggagacccggaggaggtcagaaggccttcagggtccccggg atgggcctaatcatgattgtcactgaagatccaggagggcttcctggaggaggaggcagc tccattctgaaatgccaagggtacagctttattgggacagccagtgtctctgaatatgtt gacctggtcaacccgtcctccttcattcccaaggtgggaaaaccgtggagcttcccagag cccagcacagggcagaagccctacacgttccccaggagatggaaagaagattccctggtt ctggggactgaagattttgacaagctggtaatgcgcacctctggtcagaaccacgtctgc agaggaaagacccagccatatccctcttttctcagctgcctggctctgggtgcacatctc ccccggggctgccaacctctgggctccacactccccactctcctgcggcacccccctctc tgctacttccagatcccccacctgctcctgcctcccaggcctggcaactctccggtccct tccgcgcgggcggggcgagtggagggcgtggcctgccgaggggcgaggcgagtggagggc ggggccgcgctgcccgccccgccccggccccggccccggctccggcgctgctcccaccgc cgcggcaacggccccggcccacggaggcggctggacggacccccgacggttggacgtacg gactctgcttcgagagtagtcacactggcctcctcctgtttctccaacaccccaggcgag gtcctctctcagggcctttgcatgggctgtgccctctgccactacggcaggtccgagggc gccaactttgaaacataa >gi568815579f:4551870_4752427|GENSCAN_predicted_peptide_3|186_aa MDTFSTKSLALQAQKKLLSKMASKAVVAVLVDDTSSEVLDELYRATREFTRSRKEAQKML KNLVKVALKLGLLLRGDQLGGEELALLRRFRHRARCLAMTAVSFHQVDFTFDRRVLAAGL LECRDLLHQAVGPHLTAKSHGRINHVFGHLADCDFLAALYGPAEPYRSHLRRICEGLGRM LDEGSL >gi568815579f:4551870_4752427|GENSCAN_predicted_CDS_3|561_bp atggacaccttcagcaccaagagcctggctctgcaggcgcagaagaagctcctgagtaag atggcgtccaaggcagtggtggccgtgctggtggatgacaccagcagtgaggtgctggat gagctgtaccgcgccaccagggagttcacgcgcagccgcaaggaggcccagaagatgctc aagaacctggtcaaggtggccctgaagctgggactgctgctgcgtggggaccagctgggc ggtgaggagctggcgctgctgcggcgcttccgccaccgggcgcgctgcctggccatgacg gccgtcagcttccaccaggtggacttcaccttcgaccggcgcgtgctggccgccgggctg ctcgagtgccgcgacctgctgcaccaggccgtgggtccccacctgaccgccaagtcccac ggccgcatcaaccacgtgttcggccacctagccgactgcgacttcctggctgcgctctac ggccccgccgagccctaccgctcccacctgcgcaggatctgcgagggcctgggccggatg ctggacgagggcagcctctga >gi568815579f:4551870_4752427|GENSCAN_predicted_peptide_4|270_aa MAAPSGGWNGVGASLWAALLLGAVALRPAEAVSEPTTVAFDVRPGGVVHSFSHNVGPGDK YTCMFTYASQGGTNEQWQMSLGTSEDHQHFTCTIWRPQGKSYLYFTQFKAEVRGAEIEYA MAYSKAAFERESDVPLKTEEFEVTKTAAPVSSSHIQLHGANAKQKEESEAPSWGKGGSQG SGHSRSFCGFLLPAIYVSPTDLTLHITVFLSVTQDEHDRVARASCAEKAVGNSAYHQYPG QGTQNNLPDLQQHFAIFENPCPLLTNKNGK >gi568815579f:4551870_4752427|GENSCAN_predicted_CDS_4|813_bp atggcggcgcccagcggagggtggaacggcgtcggcgcgagcttgtgggccgcgctgctc ctaggggccgtggcgctgaggccggcggaggcggtgtccgagcccacgacggtggcgttt gacgtgcggcccggcggcgtcgtgcattccttctcccataacgtgggcccgggggacaaa tatacgtgtatgttcacttacgcctctcaaggagggaccaatgagcaatggcagatgagt ctggggaccagcgaagaccaccagcacttcacctgcaccatctggaggccccaggggaag tcctatctgtacttcacacagttcaaggcagaggtgcggggcgctgagattgagtacgcc atggcctactctaaagccgcatttgaaagggaaagtgatgtccctctgaaaactgaggaa tttgaagtgaccaaaacagcagcaccggtgtcctcaagccacattcagctgcatggcgcc aatgctaagcagaaagaggaatcggaggcgccgagctggggcaagggtgggagccaaggg agtggacacagcaggtctttctgtggcttcctgctgcctgccatctatgttagccccacc gaccttaccctccacatcactgttttcctgtctgtcacccaagatgagcacgacagggtt gccagggcctcctgtgcagaaaaggccgtgggaaacagtgcctaccaccagtaccctggc caaggcacccaaaataaccttcctgacctgcagcagcattttgcaatctttgagaacccc tgccccctgctgacaaacaaaaatggcaaatag >gi568815579f:4551870_4752427|GENSCAN_predicted_peptide_5|1162_aa MTTPALCWGGAGLRWATQAGPSLPAPHPYPCQEEENAALQSEQCPPPWLPGRWMGDSMRL ITLMAALAEVSWCDRTPGTPALLRSAERLMRKVKKLRLDKENTGSWRSFSLNSEGAERMA TTGTPTADRGDAAATDDPAARFQVQKHSWDGLRSIIHGSRKYSGLIVNKAPHDFQFVQKT DESGPHSHRLYYLEVHRSWKICSDGWALGMLSFEAQPQTALTTGKGCRGPLLQALDPHFL LWACACTQLLLQRLQCLTGLGSSAPRVPSGHGLGCMSSSSFCSHCGPGSRETDLCGNQMV VWQTKNLKGRIIQNRGRQVTAVFRSQVFLVPVTFLTSLAVLKLQQESHRNRGMPYGSREN SLLYSEIPKKVRKEALLLLSWKQMLDHFQATPHHGVYSREEELLRERKRLGVFGITSYDF HSESGLFLFQASNSLFHCRDGGKNGFMVSPMKPLEIKTQCSGPRMDPKICPADPAFFSFI NNSDLWVANIETGEERRLTFCHQGLSNVLDDPKSAGVATFVIQEEFDRFTGYWWCPTASW EGSEGLKTLRILYEEVDESEVEVIHVPSPALEERKTDSYRYPRTGSKNPKIALKLAEFQT DSQGKIVSTQEKELVQPFSSLFPKVEYIARAGWTRDGKYAWAMFLDRPQQWLQLVLLPPA LFIPSTENEEQRLASARAVPRNVQPYVVYEEVTNVWINVHDIFYPFPQSEGEDELCFLRA NECKTGFCHLYKVTAVLKSQGYDWSEPFSPGEDEFKCPIKEEIALTSGEWEVLARHGSKA SAKACSLLGSSSCFETISIPAQIWVNEETKLVYFQGTKDTPLEHHLYVVSYEAAGEIVRL TTPGFSHSCSMSQNFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHKQPRFWASMMEAASC PPDYVPPEIFHFHTRSDVRLYGMIYKPHALQPGKKHPTVLFVYGGPQVQLVNNSFKGIKY LRLNTLASLGYAVVVIDGRGSCQRGLRFEGALKNQMGQVEIEDQVEGLQFVAEKYGFIDL SRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWMAYDTGYTERYMDVPENNQHGY EAGSVALHVEKLPNEPNRLLILHGFLDENVHFFHTNFLVSQLIRAGKPYQLQIYPNERHS IRCPESGEHYEVTLLHFLQEYL >gi568815579f:4551870_4752427|GENSCAN_predicted_CDS_5|3489_bp atgacgaccccagcgctgtgttggggaggggcggggctcaggtgggcaacccaggcaggg ccctccctccctgctccccatccctatccctgccaggaggaggaaaacgcagccctgcag agcgagcaatgcccccctccttggttaccagggcgctggatgggggacagcatgcggctc attaccctaatggctgcccttgctgaagtttcctggtgtgatcggacaccaggcacccct gccctcctgaggtcagctgagcggttaatgcggaaggttaagaaactgcgcctggacaag gagaacaccggaagttggagaagcttctcgctgaattccgagggggctgagaggatggcc accaccgggaccccaacggccgaccgaggcgacgcagccgccacagatgacccggccgcc cgcttccaggtgcagaagcactcgtgggacgggctccggagcatcatccacggcagccgc aagtactcgggcctcattgtcaacaaggcgccccacgacttccagtttgtgcagaagacg gatgagtctgggccccactcccaccgcctctactacctggaggtccacaggagttggaag atctgctcggatggctgggccctgggcatgctgtccttcgaggctcagccacagacagct ctgactacagggaagggctgccgcgggcccctgctgcaggctctggatccacacttcctc ctctgggcgtgtgcctgcacgcagctcctcctgcagcgtctgcagtgcctcacgggactg ggctcctctgcaccacgagtgccttcagggcacgggttaggctgcatgtcctccagcagc ttctgcagccactgtgggcctggctcccgtgagacagacttgtgtgggaatcagatggtg gtctggcaaacaaagaatttaaaaggaagaatcatccagaacaggggtcggcaggtgaca gccgtcttcagaagtcaggttttcctggtccctgttactttcctgacatctttggctgtt ctcaagctacagcaggagagtcaccgcaacagaggaatgccatatggcagccgagagaac tccctcctctactctgagattcccaagaaggtccggaaagaggctctgctgctcctgtcc tggaagcagatgctggatcatttccaggccacgccccaccatggggtctactctcgggag gaggagctgctgagggagcggaaacgcctgggggtcttcggcatcacctcctacgacttc cacagcgagagtggcctcttcctcttccaggccagcaacagcctcttccactgccgcgac ggcggcaagaacggcttcatggtgtcccctatgaaaccgctggaaatcaagacccagtgc tcagggccccggatggaccccaaaatctgccctgccgaccctgccttcttctccttcatc aataacagcgacctgtgggtggccaacatcgagacaggcgaggagcggcggctgaccttc tgccaccaaggtttatccaatgtcctggatgaccccaagtctgcgggtgtggccaccttc gtcatacaggaagagttcgaccgcttcactgggtactggtggtgccccacagcctcctgg gaaggttcagagggcctcaagacgctgcgaatcctgtatgaggaagtcgatgagtccgag gtggaggtcattcacgtcccctctcctgcgctagaagaaaggaagacggactcgtatcgg taccccaggacaggcagcaagaatcccaagattgccttgaaactggctgagttccagact gacagccagggcaagatcgtctcgacccaggagaaggagctggtgcagcccttcagctcg ctgttcccgaaggtggagtacatcgccagggccgggtggacccgggatggcaaatacgcc tgggccatgttcctggaccggccccagcagtggctccagctcgtcctcctccccccggcc ctgttcatcccgagcacagagaatgaggagcagcggctagcctctgccagagctgtcccc aggaatgtccagccgtatgtggtgtacgaggaggtcaccaacgtctggatcaatgttcat gacatcttctatcccttcccccaatcagagggagaggacgagctctgctttctccgcgcc aatgaatgcaagaccggcttctgccatttgtacaaagtcaccgccgttttaaaatcccag ggctacgattggagtgagcccttcagccccggggaagatgaatttaagtgccccattaag gaagagattgctctgaccagcggtgaatgggaggttttggcgaggcacggctccaaggca tcagccaaagcctgctcgctcctgggcagcagcagttgtttcgagaccattagcatccca gcacagatctgggtcaatgaggagaccaagctggtgtacttccagggcaccaaggacacg ccgctggagcaccacctctacgtggtcagctatgaggcggccggcgagatcgtacgcctc accacgcccggcttctcccatagctgctccatgagccagaacttcgacatgttcgtcagc cactacagcagcgtgagcacgccgccctgcgtgcacgtctacaagctgagcggccccgac gacgaccccctgcacaagcagccccgcttctgggctagcatgatggaggcagccagctgc cccccggattatgttcctccagagatcttccatttccacacgcgctcggatgtgcggctc tacggcatgatctacaagccccacgccttgcagccagggaagaagcaccccaccgtcctc tttgtatatggaggcccccaggtgcagctggtgaataactccttcaaaggcatcaagtac ttgcggctcaacacactggcctccctgggctacgccgtggttgtgattgacggcaggggc tcctgtcagcgagggcttcggttcgaaggggccctgaaaaaccaaatgggccaggtggag atcgaggaccaggtggagggcctgcagttcgtggccgagaagtatggcttcatcgacctg agccgagttgccatccatggctggtcctacgggggcttcctctcgctcatggggctaatc cacaagccccaggtgttcaaggtggccatcgcgggtgccccggtcaccgtctggatggcc tacgacacagggtacactgagcgctacatggacgtccctgagaacaaccagcacggctat gaggcgggttccgtggccctgcacgtggagaagctgcccaatgagcccaaccgcttgctt atcctccacggcttcctggacgaaaacgtgcactttttccacacaaacttcctcgtctcc caactgatccgagcagggaaaccttaccagctccagatctaccccaacgagagacacagt attcgctgccccgagtcgggcgagcactatgaagtcacgttgctgcactttctacaggaa tacctctga