GENSCAN 1.0 Date run: 6-Nov-116 Time: 13:32:15 Sequence gi568815579r:4558008_4770334 : 212327 bp : 53.65% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 142 19 124 2 1 89 72 229 0.987 21.55 1.01 Init - 450 330 121 0 1 89 74 238 0.978 20.81 1.00 Prom - 12207 12168 40 -5.41 2.00 Prom + 12806 12845 40 -2.71 2.01 Init + 17413 17546 134 0 2 58 105 61 0.639 4.39 2.02 Intr + 18350 18386 37 2 1 79 94 26 0.447 1.05 2.03 Intr + 22189 22268 80 0 2 50 80 66 0.463 0.84 2.04 Intr + 22602 22765 164 0 2 46 86 54 0.410 1.13 2.05 Intr + 26310 26439 130 1 1 26 60 113 0.456 2.56 2.06 Intr + 27684 27795 112 0 1 103 42 65 0.194 4.28 2.07 Intr + 32841 32876 36 2 0 144 94 38 0.766 8.94 2.08 Intr + 33771 33787 17 2 2 90 105 1 0.458 -3.18 2.09 Intr + 34719 34890 172 0 1 72 58 100 0.279 5.86 2.10 Intr + 61618 61733 116 0 2 137 84 -1 0.010 4.15 2.11 Intr + 81402 81622 221 0 2 52 71 162 0.621 9.37 2.12 Term + 82690 82808 119 2 2 79 42 106 0.657 4.11 2.13 PlyA + 83351 83356 6 -0.45 3.00 Prom + 83883 83922 40 -0.21 3.01 Sngl + 93863 94423 561 1 0 83 46 1361 0.736 127.92 3.02 PlyA + 96104 96109 6 1.05 4.10 PlyA - 96341 96336 6 1.05 4.09 Term - 96866 96778 89 0 2 104 38 49 0.304 -0.38 4.08 Intr - 98398 98315 84 2 0 20 97 90 0.268 3.39 4.07 Intr - 100321 100236 86 0 2 90 68 56 0.798 3.76 4.06 Intr - 100663 100552 112 2 1 18 91 52 0.800 -1.56 4.05 Intr - 101996 101924 73 2 1 83 96 79 0.985 7.67 4.04 Intr - 102743 102662 82 1 1 156 83 93 0.860 15.74 4.03 Intr - 106930 106869 62 1 2 123 78 39 0.982 4.42 4.02 Intr - 110638 110588 51 1 0 110 105 25 0.978 5.99 4.01 Init - 112327 112154 174 1 0 109 71 291 0.989 26.82 4.00 Prom - 113886 113847 40 -1.71 5.31 PlyA - 115572 115567 6 1.05 5.30 Term - 118649 118557 93 2 0 128 52 131 0.993 11.63 5.29 Intr - 121939 121828 112 0 1 120 94 118 0.917 16.48 5.28 Intr - 124831 124689 143 1 2 127 52 365 0.984 36.46 5.27 Intr - 125622 125470 153 0 0 52 98 319 0.987 30.08 5.26 Intr - 126802 126656 147 1 0 108 47 245 0.757 23.34 5.25 Intr - 127764 127619 146 1 2 96 109 213 0.970 24.71 5.24 Intr - 130885 130750 136 1 1 79 81 267 0.981 25.75 5.23 Intr - 131715 131563 153 0 0 95 75 254 0.943 25.58 5.22 Intr - 132305 132237 69 2 0 104 72 33 0.505 3.17 5.21 Intr - 132950 132871 80 0 2 153 76 31 0.990 8.17 5.20 Intr - 136816 136654 163 1 1 88 105 116 0.945 13.36 5.19 Intr - 137548 137371 178 0 1 113 75 249 0.946 26.54 5.18 Intr - 139644 139544 101 0 2 127 73 128 0.947 14.61 5.17 Intr - 142270 142209 62 2 2 101 116 57 0.995 8.74 5.16 Intr - 144148 144020 129 2 0 41 109 218 0.980 20.37 5.15 Intr - 144709 144596 114 2 0 140 100 170 0.999 24.32 5.14 Intr - 146047 145879 169 1 1 127 81 218 0.999 25.03 5.13 Intr - 146297 146124 174 2 0 120 89 364 0.999 40.35 5.12 Intr - 147963 147851 113 1 2 109 115 124 0.986 17.80 5.11 Intr - 154788 154608 181 0 1 65 25 81 0.385 -0.54 5.10 Intr - 155519 155334 186 2 0 54 64 59 0.261 0.50 5.09 Intr - 155997 155891 107 1 2 85 47 62 0.853 2.23 5.08 Intr - 156330 156074 257 2 2 108 94 495 0.964 49.72 5.07 Intr - 161934 161844 91 1 1 111 100 75 0.514 10.55 5.06 Intr - 175041 175006 36 1 0 124 91 36 0.009 6.22 5.05 Intr - 185131 185032 100 1 1 94 81 61 0.007 6.18 5.04 Intr - 201703 201608 96 1 0 101 49 84 0.101 6.51 5.03 Intr - 204125 204009 117 2 0 30 99 45 0.150 0.97 5.02 Intr - 209435 209344 92 0 2 84 96 52 0.307 5.81 5.01 Init - 209826 209814 13 0 1 78 127 -5 0.550 2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 148217 148151 67 2 1 114 44 -10 0.858 -1.49 S.002 Term - 154054 153855 200 2 2 62 45 141 0.890 5.08 S.003 Term - 180352 180310 43 0 1 100 28 132 0.832 5.32 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:4558008_4770334|GENSCAN_predicted_peptide_1|82_aa MQTPRASPPRPALLLLLLLLGGAHGLFPEEPPPLSVAPRDYLNHYPVFVGSGPGRLTPAE GADDLNIQRVLRVNRTLFIGDS >gi568815579r:4558008_4770334|GENSCAN_predicted_CDS_1|246_bp atgcagaccccgcgagcgtcccctccccgcccggccctgctgcttctgctgctgctactg gggggcgcccacggcctctttcctgaggagccgccgccgcttagcgtggcccccagggac tacctgaaccactatcccgtgtttgtgggcagcgggcccggacgcctgacccccgcagaa ggtgctgacgacctcaacatccagcgagtcctgcgggtcaacaggacgctgttcattggg gacagn >gi568815579r:4558008_4770334|GENSCAN_predicted_peptide_2|445_aa MGVEVSGSAKCCWERKTWRMNLKKTKAGNRSQGKGPGVRADHKSCHTGLLGVPPTRQNPK ASEHPSNSKSPFRSSLGVKDDSLSSWGPPNPQAPTAAGGIRTGRPSPPTPLRPLPRGLDP RLVLFWAGQEAGEDSGAQVRVLRAPAATDTFFPVRPGLRPRVRGGPGRPGGGQKAFRVPG MGLIMIVTEDPGGLPGGGGSSILKCQGYSFIGTASVSEYVDLVNPSSFIPKVGKPWSFPE PSTGQKPYTFPRRWKEDSLVLGTEDFDKLVMRTSGQNHVCRGKTQPYPSFLSCLALGAHL PRGCQPLGSTLPTLLRHPPLCYFQIPHLLLPPRPGNSPVPSARAGRVEGVACRGARRVEG GAALPAPPRPRPRLRRCSHRRGNGPGPRRRLDGPPTVGRTDSASRVVTLASSCFSNTPGE VLSQGLCMGCALCHYGRSEGANFET >gi568815579r:4558008_4770334|GENSCAN_predicted_CDS_2|1338_bp atgggggtggaggtctcagggagtgccaagtgctgctgggaacggaagacctggcgaatg aacttgaagaagacaaaggcgggaaataggtcccaggggaaaggccccggggtcagagcg gaccacaagtcctgccatacgggcctacttggtgttcctccaacacgccagaaccccaaa gcctcagagcaccccagtaactccaagtccccgttccgctccagtctaggggtcaaagat gacagcctcagctcttgggggcccccgaacccccaggcccccactgcagcgggggggatc cggaccggccggccctcccctccaacaccactgcgacccctgccccgcggcctggacccg cgactcgtcctgttctgggctggacaggaggccggagaggactcgggcgcccaagtgcgg gttttgcgggcgcccgcggccaccgacaccttcttcccagtgcggcccgggctgcggccc cgggtccgaggaggcccggggagacccggaggaggtcagaaggccttcagggtccccggg atgggcctaatcatgattgtcactgaagatccaggagggcttcctggaggaggaggcagc tccattctgaaatgccaagggtacagctttattgggacagccagtgtctctgaatatgtt gacctggtcaacccgtcctccttcattcccaaggtgggaaaaccgtggagcttcccagag cccagcacagggcagaagccctacacgttccccaggagatggaaagaagattccctggtt ctggggactgaagattttgacaagctggtaatgcgcacctctggtcagaaccacgtctgc agaggaaagacccagccatatccctcttttctcagctgcctggctctgggtgcacatctc ccccggggctgccaacctctgggctccacactccccactctcctgcggcacccccctctc tgctacttccagatcccccacctgctcctgcctcccaggcctggcaactctccggtccct tccgcgcgggcggggcgagtggagggcgtggcctgccgaggggcgaggcgagtggagggc ggggccgcgctgcccgccccgccccggccccggccccggctccggcgctgctcccaccgc cgcggcaacggccccggcccacggaggcggctggacggacccccgacggttggacgtacg gactctgcttcgagagtagtcacactggcctcctcctgtttctccaacaccccaggcgag gtcctctctcagggcctttgcatgggctgtgccctctgccactacggcaggtccgagggc gccaactttgaaacataa >gi568815579r:4558008_4770334|GENSCAN_predicted_peptide_3|186_aa MDTFSTKSLALQAQKKLLSKMASKAVVAVLVDDTSSEVLDELYRATREFTRSRKEAQKML KNLVKVALKLGLLLRGDQLGGEELALLRRFRHRARCLAMTAVSFHQVDFTFDRRVLAAGL LECRDLLHQAVGPHLTAKSHGRINHVFGHLADCDFLAALYGPAEPYRSHLRRICEGLGRM LDEGSL >gi568815579r:4558008_4770334|GENSCAN_predicted_CDS_3|561_bp atggacaccttcagcaccaagagcctggctctgcaggcgcagaagaagctcctgagtaag atggcgtccaaggcagtggtggccgtgctggtggatgacaccagcagtgaggtgctggat gagctgtaccgcgccaccagggagttcacgcgcagccgcaaggaggcccagaagatgctc aagaacctggtcaaggtggccctgaagctgggactgctgctgcgtggggaccagctgggc ggtgaggagctggcgctgctgcggcgcttccgccaccgggcgcgctgcctggccatgacg gccgtcagcttccaccaggtggacttcaccttcgaccggcgcgtgctggccgccgggctg ctcgagtgccgcgacctgctgcaccaggccgtgggtccccacctgaccgccaagtcccac ggccgcatcaaccacgtgttcggccacctagccgactgcgacttcctggctgcgctctac ggccccgccgagccctaccgctcccacctgcgcaggatctgcgagggcctgggccggatg ctggacgagggcagcctctga >gi568815579r:4558008_4770334|GENSCAN_predicted_peptide_4|270_aa MAAPSGGWNGVGASLWAALLLGAVALRPAEAVSEPTTVAFDVRPGGVVHSFSHNVGPGDK YTCMFTYASQGGTNEQWQMSLGTSEDHQHFTCTIWRPQGKSYLYFTQFKAEVRGAEIEYA MAYSKAAFERESDVPLKTEEFEVTKTAAPVSSSHIQLHGANAKQKEESEAPSWGKGGSQG SGHSRSFCGFLLPAIYVSPTDLTLHITVFLSVTQDEHDRVARASCAEKAVGNSAYHQYPG QGTQNNLPDLQQHFAIFENPCPLLTNKNGK >gi568815579r:4558008_4770334|GENSCAN_predicted_CDS_4|813_bp atggcggcgcccagcggagggtggaacggcgtcggcgcgagcttgtgggccgcgctgctc ctaggggccgtggcgctgaggccggcggaggcggtgtccgagcccacgacggtggcgttt gacgtgcggcccggcggcgtcgtgcattccttctcccataacgtgggcccgggggacaaa tatacgtgtatgttcacttacgcctctcaaggagggaccaatgagcaatggcagatgagt ctggggaccagcgaagaccaccagcacttcacctgcaccatctggaggccccaggggaag tcctatctgtacttcacacagttcaaggcagaggtgcggggcgctgagattgagtacgcc atggcctactctaaagccgcatttgaaagggaaagtgatgtccctctgaaaactgaggaa tttgaagtgaccaaaacagcagcaccggtgtcctcaagccacattcagctgcatggcgcc aatgctaagcagaaagaggaatcggaggcgccgagctggggcaagggtgggagccaaggg agtggacacagcaggtctttctgtggcttcctgctgcctgccatctatgttagccccacc gaccttaccctccacatcactgttttcctgtctgtcacccaagatgagcacgacagggtt gccagggcctcctgtgcagaaaaggccgtgggaaacagtgcctaccaccagtaccctggc caaggcacccaaaataaccttcctgacctgcagcagcattttgcaatctttgagaacccc tgccccctgctgacaaacaaaaatggcaaatag >gi568815579r:4558008_4770334|GENSCAN_predicted_peptide_5|1236_aa MSVYAFTLCQGQCQTFYLLSDLKASGQPHEIDPSQRGADRKTVPGSHSGGGGYTQGSPSE ALPFVSLLLLFPEQLSRSEPPTDMTGHPSGWTGHGEDHEPGQRAQREEENAALQSEQCPP PWLPGRWMGDSMRLITLMAALAEVSWCDRTPGTPALLRSAERLMRKVKKLRLDKENTGSW RSFSLNSEGAERMATTGTPTADRGDAAATDDPAARFQVQKHSWDGLRSIIHGSRKYSGLI VNKAPHDFQFVQKTDESGPHSHRLYYLEVHRSWKICSDGWALGMLSFEAQPQTALTTGKG CRGPLLQALDPHFLLWACACTQLLLQRLQCLTGLGSSAPRVPSGHGLGCMSSSSFCSHCG PGSRETDLCGNQMVVWQTKNLKGRIIQNRGRQVTAVFRSQVFLVPVTFLTSLAVLKLQQE SHRNRGMPYGSRENSLLYSEIPKKVRKEALLLLSWKQMLDHFQATPHHGVYSREEELLRE RKRLGVFGITSYDFHSESGLFLFQASNSLFHCRDGGKNGFMVSPMKPLEIKTQCSGPRMD PKICPADPAFFSFINNSDLWVANIETGEERRLTFCHQGLSNVLDDPKSAGVATFVIQEEF DRFTGYWWCPTASWEGSEGLKTLRILYEEVDESEVEVIHVPSPALEERKTDSYRYPRTGS KNPKIALKLAEFQTDSQGKIVSTQEKELVQPFSSLFPKVEYIARAGWTRDGKYAWAMFLD RPQQWLQLVLLPPALFIPSTENEEQRLASARAVPRNVQPYVVYEEVTNVWINVHDIFYPF PQSEGEDELCFLRANECKTGFCHLYKVTAVLKSQGYDWSEPFSPGEDEFKCPIKEEIALT SGEWEVLARHGSKASAKACSLLGSSSCFETISIPAQIWVNEETKLVYFQGTKDTPLEHHL YVVSYEAAGEIVRLTTPGFSHSCSMSQNFDMFVSHYSSVSTPPCVHVYKLSGPDDDPLHK QPRFWASMMEAASCPPDYVPPEIFHFHTRSDVRLYGMIYKPHALQPGKKHPTVLFVYGGP QVQLVNNSFKGIKYLRLNTLASLGYAVVVIDGRGSCQRGLRFEGALKNQMGQVEIEDQVE GLQFVAEKYGFIDLSRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWMAYDTGYT ERYMDVPENNQHGYEAGSVALHVEKLPNEPNRLLILHGFLDENVHFFHTNFLVSQLIRAG KPYQLQIYPNERHSIRCPESGEHYEVTLLHFLQEYL >gi568815579r:4558008_4770334|GENSCAN_predicted_CDS_5|3711_bp atgtctgtttacgcttttactctgtgccagggtcagtgccagacattctacctgctgagt gatctgaaggcctcagggcagccccatgagatagatcccagccagaggggagcagatagg aagacagttcctggcagccacagtggtggaggggggtacacccaggggagcccctcagag gctctgccatttgtgtctttgctcctgctgttcccggagcagctcagtcgctccgagccc cccacggacatgacaggacatccaagtgggtggacaggacacggcgaggatcacgaacca ggccagcgagcacagagggaggaggaaaacgcagccctgcagagcgagcaatgcccccct ccttggttaccagggcgctggatgggggacagcatgcggctcattaccctaatggctgcc cttgctgaagtttcctggtgtgatcggacaccaggcacccctgccctcctgaggtcagct gagcggttaatgcggaaggttaagaaactgcgcctggacaaggagaacaccggaagttgg agaagcttctcgctgaattccgagggggctgagaggatggccaccaccgggaccccaacg gccgaccgaggcgacgcagccgccacagatgacccggccgcccgcttccaggtgcagaag cactcgtgggacgggctccggagcatcatccacggcagccgcaagtactcgggcctcatt gtcaacaaggcgccccacgacttccagtttgtgcagaagacggatgagtctgggccccac tcccaccgcctctactacctggaggtccacaggagttggaagatctgctcggatggctgg gccctgggcatgctgtccttcgaggctcagccacagacagctctgactacagggaagggc tgccgcgggcccctgctgcaggctctggatccacacttcctcctctgggcgtgtgcctgc acgcagctcctcctgcagcgtctgcagtgcctcacgggactgggctcctctgcaccacga gtgccttcagggcacgggttaggctgcatgtcctccagcagcttctgcagccactgtggg cctggctcccgtgagacagacttgtgtgggaatcagatggtggtctggcaaacaaagaat ttaaaaggaagaatcatccagaacaggggtcggcaggtgacagccgtcttcagaagtcag gttttcctggtccctgttactttcctgacatctttggctgttctcaagctacagcaggag agtcaccgcaacagaggaatgccatatggcagccgagagaactccctcctctactctgag attcccaagaaggtccggaaagaggctctgctgctcctgtcctggaagcagatgctggat catttccaggccacgccccaccatggggtctactctcgggaggaggagctgctgagggag cggaaacgcctgggggtcttcggcatcacctcctacgacttccacagcgagagtggcctc ttcctcttccaggccagcaacagcctcttccactgccgcgacggcggcaagaacggcttc atggtgtcccctatgaaaccgctggaaatcaagacccagtgctcagggccccggatggac cccaaaatctgccctgccgaccctgccttcttctccttcatcaataacagcgacctgtgg gtggccaacatcgagacaggcgaggagcggcggctgaccttctgccaccaaggtttatcc aatgtcctggatgaccccaagtctgcgggtgtggccaccttcgtcatacaggaagagttc gaccgcttcactgggtactggtggtgccccacagcctcctgggaaggttcagagggcctc aagacgctgcgaatcctgtatgaggaagtcgatgagtccgaggtggaggtcattcacgtc ccctctcctgcgctagaagaaaggaagacggactcgtatcggtaccccaggacaggcagc aagaatcccaagattgccttgaaactggctgagttccagactgacagccagggcaagatc gtctcgacccaggagaaggagctggtgcagcccttcagctcgctgttcccgaaggtggag tacatcgccagggccgggtggacccgggatggcaaatacgcctgggccatgttcctggac cggccccagcagtggctccagctcgtcctcctccccccggccctgttcatcccgagcaca gagaatgaggagcagcggctagcctctgccagagctgtccccaggaatgtccagccgtat gtggtgtacgaggaggtcaccaacgtctggatcaatgttcatgacatcttctatcccttc ccccaatcagagggagaggacgagctctgctttctccgcgccaatgaatgcaagaccggc ttctgccatttgtacaaagtcaccgccgttttaaaatcccagggctacgattggagtgag cccttcagccccggggaagatgaatttaagtgccccattaaggaagagattgctctgacc agcggtgaatgggaggttttggcgaggcacggctccaaggcatcagccaaagcctgctcg ctcctgggcagcagcagttgtttcgagaccattagcatcccagcacagatctgggtcaat gaggagaccaagctggtgtacttccagggcaccaaggacacgccgctggagcaccacctc tacgtggtcagctatgaggcggccggcgagatcgtacgcctcaccacgcccggcttctcc catagctgctccatgagccagaacttcgacatgttcgtcagccactacagcagcgtgagc acgccgccctgcgtgcacgtctacaagctgagcggccccgacgacgaccccctgcacaag cagccccgcttctgggctagcatgatggaggcagccagctgccccccggattatgttcct ccagagatcttccatttccacacgcgctcggatgtgcggctctacggcatgatctacaag ccccacgccttgcagccagggaagaagcaccccaccgtcctctttgtatatggaggcccc caggtgcagctggtgaataactccttcaaaggcatcaagtacttgcggctcaacacactg gcctccctgggctacgccgtggttgtgattgacggcaggggctcctgtcagcgagggctt cggttcgaaggggccctgaaaaaccaaatgggccaggtggagatcgaggaccaggtggag ggcctgcagttcgtggccgagaagtatggcttcatcgacctgagccgagttgccatccat ggctggtcctacgggggcttcctctcgctcatggggctaatccacaagccccaggtgttc aaggtggccatcgcgggtgccccggtcaccgtctggatggcctacgacacagggtacact gagcgctacatggacgtccctgagaacaaccagcacggctatgaggcgggttccgtggcc ctgcacgtggagaagctgcccaatgagcccaaccgcttgcttatcctccacggcttcctg gacgaaaacgtgcactttttccacacaaacttcctcgtctcccaactgatccgagcaggg aaaccttaccagctccagatctaccccaacgagagacacagtattcgctgccccgagtcg ggcgagcactatgaagtcacgttgctgcactttctacaggaatacctctga