GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:04:33 Sequence gi568815597f:27730836_27950443 : 219608 bp : 45.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1768 1822 55 1 1 80 127 34 0.969 5.48 1.02 Intr + 3196 3348 153 0 0 91 94 121 0.995 13.27 1.03 Intr + 8350 8547 198 0 0 3 35 145 0.191 0.25 1.04 Intr + 9160 9576 417 0 0 33 5 263 0.069 6.72 1.05 Intr + 13819 13976 158 0 2 138 91 67 0.840 10.81 1.06 Intr + 24360 24495 136 0 1 28 86 155 0.957 9.87 1.07 Intr + 28691 28792 102 1 0 95 62 87 0.966 7.17 1.08 Intr + 38096 38194 99 1 0 41 93 53 0.105 1.41 1.09 Intr + 42389 42643 255 1 0 77 68 233 0.166 17.84 1.10 Intr + 62106 62159 54 2 0 101 91 4 0.257 1.18 1.11 Intr + 70843 70980 138 0 0 67 84 74 0.280 5.56 1.12 Intr + 79411 79454 44 0 2 64 90 18 0.140 -3.36 1.13 Intr + 81328 81433 106 1 1 83 78 169 0.892 15.62 1.14 Intr + 87016 87088 73 0 1 121 76 63 0.988 7.38 1.15 Term + 88815 88945 131 1 2 129 47 72 0.093 5.54 1.16 PlyA + 89090 89095 6 1.05 2.00 Prom + 89867 89906 40 -2.46 2.01 Init + 100001 100056 56 1 2 100 105 61 0.965 9.34 2.02 Intr + 101921 101981 61 2 1 111 64 11 0.554 -0.26 2.03 Intr + 107864 108017 154 1 1 23 86 115 0.977 4.45 2.04 Intr + 110179 110539 361 2 1 75 46 173 0.778 6.08 2.05 Intr + 111612 111657 46 0 1 113 36 14 0.769 -2.89 2.06 Intr + 112351 112495 145 0 1 41 94 167 0.908 12.46 2.07 Intr + 116193 116257 65 1 2 109 97 -2 0.954 1.24 2.08 Term + 119258 119611 354 1 0 34 40 264 0.987 10.69 2.09 PlyA + 120813 120818 6 1.05 3.00 Prom + 133772 133811 40 -4.46 3.01 Init + 141737 141830 94 1 1 109 84 153 0.994 17.33 3.02 Intr + 145753 145893 141 2 0 79 86 189 0.787 18.12 3.03 Intr + 148809 149219 411 1 0 135 79 323 0.961 30.26 3.04 Intr + 151136 152208 1073 0 2 95 74 1049 0.523 94.15 3.05 Intr + 154460 154616 157 1 1 99 95 91 0.873 10.48 3.06 Term + 156810 156862 53 1 2 77 55 60 0.415 -0.81 3.07 PlyA + 158665 158670 6 1.05 4.11 PlyA - 160239 160234 6 1.05 4.10 Term - 161412 161328 85 2 1 150 36 104 0.999 8.53 4.09 Intr - 163271 163177 95 2 2 67 108 43 0.991 2.96 4.08 Intr - 163562 163455 108 2 0 40 91 52 0.689 1.18 4.07 Intr - 166286 166170 117 2 0 70 109 106 0.977 11.56 4.06 Intr - 166872 166798 75 0 0 66 103 29 0.772 1.91 4.05 Intr - 176206 176093 114 1 0 103 95 131 0.989 15.94 4.04 Intr - 176447 176346 102 2 0 107 91 39 0.985 6.47 4.03 Intr - 183334 183228 107 2 2 71 39 152 0.970 8.53 4.02 Intr - 183906 183587 320 2 2 65 33 254 0.668 13.20 4.01 Init - 185710 185694 17 1 2 97 58 -4 0.557 -2.75 4.00 Prom - 188467 188428 40 -6.36 5.00 Prom + 192734 192773 40 -2.46 5.01 Init + 204349 204409 61 0 1 69 92 68 0.456 4.73 5.02 Intr + 207686 207739 54 0 0 74 88 47 0.419 2.25 5.03 Intr + 210713 210741 29 0 2 48 115 33 0.292 -0.17 5.04 Intr + 212732 212853 122 1 2 64 63 58 0.701 0.19 5.05 Intr + 214350 214610 261 0 0 113 92 232 0.790 22.70 5.06 Intr + 218230 218327 98 1 2 65 110 99 0.785 9.45 5.07 Term + 218595 218602 8 1 2 109 52 0 0.337 -3.47 5.08 PlyA + 219070 219075 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 29660 29746 87 1 0 96 48 75 0.926 1.96 S.002 Intr + 88815 88897 83 1 2 129 98 77 0.860 12.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:27730836_27950443|GENSCAN_predicted_peptide_1|706_aa XKTNTICKKCAQNVQLYGTPKPCQYCNIIAAFIGNKCQRCTNSEKKYGPPYSCEQCKQQC AFDRKDDRKKDAGGKAGAPIVCMLNQGLKHVSEGCVFAYCQVEEKPYWKDPNNDFRKNLE VTTVPTLLKYGTPQKLQPGQFRGQVPPYSRLWVHLDVMDGHFVPNVTFGYPVVESFQKQL VQDLFFDIHVMVSKLEQWVKSIAIAEASQYTFHLEATKYSEALIEDIWENGMKVGLTIKP GTTVEYLAPWTNQIDTALVITVEPGFGRQKFMDDMVDGKLLCWLCTLSYKRVLQKTKEQR KHLSSSSRAGHQEKEQYSRLSGGGHYNSFSPDLALDSPGTDHFVIIAQLKEEVATLKKML HQKDQMILEKEKKITELKADFQYQESQMRAKMNQMEKTHKEVTEQLQFGTSKNKYCYELH NNTKLETMQTSINKRMDNQTFLRQPAGPGWRLLPVGERCRASRSQLLVMSYGPLDMYRNP GPSGPQLRDFSSIIQTCSGNIQRISQASEPGYRAGGRELSRGQAWVRLLCFGGKSQKLRY KEQRQQRLQKERLMNDFSAALNNFQAVQRRVSEKEKESIARARAGSRLSAEERQREEQLV SFDSHEEWNQMQSQEDEVAITEQDLELIKERETAIRQLEADILDVNQIFKDLAMMIHDQG DLIDSIEANVESSEVHVERATEQLQRAAYYQVKAGTKESHSVLQTF >gi568815597f:27730836_27950443|GENSCAN_predicted_CDS_1|2121_bp nntaaaaccaatacaatatgcaagaaatgtgctcagaacgtgcagttgtatggaacgccc aaaccttgtcagtattgcaacataattgcagcatttattgggaataaatgccagcgctgc acaaattcagaaaagaagtatggaccaccctattcttgtgaacagtgcaagcagcagtgt gcatttgacaggaaagatgatagaaagaaggatgctggaggaaaagctggtgcccctatt gtatgcatgctgaaccaggggctgaagcatgttagtgaaggatgtgtgttcgcctactgc caagtagaagaaaagccttattggaaagatccaaataatgacttcagaaaaaacttggaa gtaaccacagtgcctacactacttaaatatggaacacctcaaaaactgcaacctggccag tttaggggtcaagtgcctccgtattctagactctgggtccacctggatgtcatggatggg cactttgttcccaacgtcacctttggttatcctgtggtagaaagctttcaaaagcagcta gtccaggaccttttctttgacatacatgtgatggtgtccaagctggaacagtgggtaaaa tcaatagctatagcagaagccagtcagtacacctttcatcttgaggctactaagtactca gaggctttgattgaagacatttgggagaatgggatgaaggttggccttaccatcaaacca ggaactacagttgagtatttggcaccatggactaatcaaatagatacggccttggttatc acagtggaacctgggtttggaaggcagaaattcatggatgatatggtagatgggaaattg ctgtgctggctgtgcacactttcatacaaacgggtccttcagaagaccaaagagcagagg aaacacctgagtagctcttctcgtgctggccaccaggagaaggagcagtatagtcgcctg agtggtggtggccattataacagcttctccccagacctggctctggactcaccaggcact gaccactttgtcatcattgcccaactgaaggaagaagtggctaccctgaagaagatgttg catcaaaaggatcaaatgattttagagaaagagaagaagattacagagttgaaggctgat tttcagtaccaggaatcgcagatgagagccaaaatgaaccagatggagaaaacccacaaa gaagtcacagaacaactgcagtttggaactagtaaaaataaatactgctatgaacttcat aataacaccaaattggaaacaatgcaaacatccatcaataagagaatggataatcaaact tttctccgtcagcctgcgggtcccggctggcggctgcttccggtaggagagcggtgtaga gcgagcaggtctcagctcctcgtcatgtcatacggtcccttagacatgtaccggaacccg gggccctcggggccccagctccgggacttcagcagcatcatccagacgtgcagcggcaac atccagcggatcagccaagccagtgagccggggtaccgagctggggggcgggagctgtcc cggggacaggcctgggtaaggcttctttgttttggagggaagagccagaaactcaggtat aaagagcagcgccagcagagacttcagaaggaacgcctcatgaatgacttctctgcagcc ttaaacaatttccaggctgtgcagagaagggtatctgaaaaggaaaaggagagtattgcc agagcaagagctggatctcgtctttctgcagaagagaggcaaagagaggagcagctggtc tcatttgacagccatgaggagtggaaccagatgcagagccaggaggatgaggtggccatc actgagcaggatttggaacttattaaagaaagagaaacggcaattcggcagctggaggct gacattttggatgtcaatcagatatttaaagatttggccatgatgatccatgaccagggt gatctgattgatagcatagaagccaatgtggaaagctcagaggtgcacgtcgaaagagcc actgaacagttacagcgagctgcttactatcaggtaaaagcgggtaccaaagaaagtcac tctgtgttgcagactttttag >gi568815597f:27730836_27950443|GENSCAN_predicted_peptide_2|413_aa MAAAANSGSSLPLFDCPTWAGKPPPGLHLDVVKGDKLIEKLIIDEKKYYLFGRNPDLCDF TIDHQSCSRVHAALVYHKHLKRVFLIDLNSTHGTFLGHIRLEPHKPQQIPIDSTVSFGAS TRAYTLREKPQTLPSAVKGDEKMGGEDDELKGLLGLPEEETELDVIPCLCHCFVFGDPGF WSIQLFYVAGNASDLAAGGSLESSNGPLAIWPPYPGCCYTSMAFSSNLTEFNTAHNKRIS TLTIEEGNLDIQRPKRKRKNSRVTFSEDDEIINPEDVDPSVGRFRNMVQTAVVPVKKKRV EGPGSLGLEESGSRRMQNFAFSGGLYGGLPPTHSEAGSQPHGIHGTALIGGLPMPYPNLA PDVDLTPVVPSAVNMNPAPNPAVYNPEAVNEPKKKKYAKEAWPGKKPTPSLLI >gi568815597f:27730836_27950443|GENSCAN_predicted_CDS_2|1242_bp atggcggcagccgcgaactccggctctagcctcccgctgttcgactgcccaacctgggca ggtaagccccctcccggtttacatctggatgtagtcaaaggagacaaactaattgagaaa ctgattattgatgagaagaagtattacttatttgggagaaaccctgatttgtgtgacttt accattgaccaccagtcttgctctcgggtccatgctgcacttgtctaccacaagcatctg aagagagttttcctgatagatctcaacagtacacacggcactttcttgggtcacattcgg ttggaacctcacaagcctcagcaaattcccatcgattccacggtctcatttggcgcatcc acaagggcatacactctgcgcgagaagcctcagacattgccatcggctgtgaaaggagat gagaagatgggtggagaggatgatgaactcaagggcttactggggcttccagaggaggaa actgagcttgatgtaattccctgtttatgtcattgttttgtctttggggaccctggtttt tggagtatacagctgttctatgtagctggaaatgccagtgacttagcagcagggggttct cttgagtccagtaatgggcctttggcaatctggccaccttatccaggctgctgctatact tccatggcttttagttcaaacctgacagagttcaacactgcccacaacaagcggatttct acccttaccattgaggagggaaatctggacattcaaagaccaaagaggaagaggaagaac tcacgggtgacattcagtgaggatgatgagatcatcaacccagaggatgtggatccctca gttggtcgattcaggaacatggtgcaaactgcagtggtcccagtcaagaagaagcgtgtg gagggccctggctccctgggcctggaggaatcagggagcaggcgcatgcagaactttgcc ttcagcggaggactctacgggggcctgccccccacacacagtgaagcaggctcccagcca catggcatccatgggacagcactcatcggtggcttgcccatgccatacccaaaccttgcc cctgatgtggacttgactcctgttgtgccgtcagcagtgaacatgaaccctgcaccaaac cctgcagtctataaccctgaagctgtaaatgaacccaagaagaagaaatatgcaaaagag gcttggccaggcaagaagcccacaccttccttgctgatttga >gi568815597f:27730836_27950443|GENSCAN_predicted_peptide_3|642_aa MEPVPLQDFVRALDPASLPRVLRVCSGVYFEGSIYEISGNECCLSTGDLIKVTQVRLQKV VCENPKTSQTMELAPNFQGYFTPLNTPQSYETLEELVSATTQSSKQLPTCFMSTHRIVTE GRVVTEDQLLMLEAVVMHLGIRSARCVLGMEGQQVILHLPLSQKGPFWTWEPSAPRTLLQ VLQDPALKDLVLTCPTLPWHSLILRPQYEIQAIMHMRRTIVKIPSTLEVDVEDVTASSRH VHFIKPLLLSEVLAWEGPFPLSMEILEVPEGRPIFLSPWVGSLQKGQRLCVYGLASPPWR VLASSKGRKVPRHFLVSGGYQGKLRRRPREFPTAYDLLGAFQPGRPLRVVATKDCEGERE ENPEFTSLAVGDRLEVLGPGQAHGAQGSDVDVLVCQRLSDQAGEDEEEECKEEAESPERV LLPFHFPGSFVEEMSDSRRYSLADLTAQFSLPCEVKVVAKDTSHPTDPLTSFLGLRLEEK ITEPFLVVSLDSEPGMCFEIPPRWLDLTVVKAKGQPDLPEGSLPIATVEELTDTFYYRLR KLPACEIQAPPPRPPKNQGLSKQRRHSSEGGVKSSQVLGLQQHARLPKPKAKTLPEFIKD GSSTYSKIPAHRKGHRPAKPQRQDLAPMPFSREVNLRCKGGP >gi568815597f:27730836_27950443|GENSCAN_predicted_CDS_3|1929_bp atggagccggtgccgctgcaggacttcgtgcgcgccttggaccccgcctccctcccgcgc gtgctgcgggtctgctcgggggtctacttcgagggctccatctatgagatctctgggaat gagtgctgcctctccacgggggacctgatcaaggtcacccaggtccgcctccagaaggtg gtctgtgagaacccgaagaccagccagaccatggagctcgcccccaacttccagggctac ttcacccccctcaacaccccacagagctatgaaaccctggaggagctggtctctgccaca actcagagctccaagcagctgcccacttgcttcatgtcgacccacaggattgtcacagag ggcagggtggtgactgaggaccagctcctcatgcttgaggctgtggtgatgcacctcggg atccgctctgcccgctgtgtcctgggcatggagggtcagcaggtcatcctgcacctgccc ctatcccagaaggggcccttctggacatgggagcctagtgcccctcgaactctgctccag gtcctacaggatccagccctgaaagacctcgtcctcacctgccccaccctgccctggcat tccctgatcctgcggccccagtatgagatccaagccatcatgcacatgcgcaggaccatt gtcaagatcccttctaccctggaggtcgacgtggaggacgtcaccgcctcctcccggcac gtccactttatcaaaccgctgctgctgagcgaggtcctggcctgggaaggccctttcccc ctgtccatggagatcctggaggttcctgagggccgccccatcttcctcagcccgtgggtg ggctccttgcaaaaaggccagaggctttgcgtctatggcctagcctcaccaccctggcgg gtcctggcctcaagcaagggccgcaaggtgcccaggcacttcctggtgtcagggggctac caaggcaagctgcggcggcggccaagggagttccccacggcctatgacctcctaggtgct ttccagccaggccggccactccgggtggtggccacaaaggactgtgagggcgagagggag gagaatcccgagttcacgtccctggctgtgggtgaccggctggaggtgctggggcctggc caggcccatggggcccagggcagtgacgtggatgtcttggtttgtcagcggctgagtgac caggctggggaggatgaggaggaagagtgcaaagaggaggcagagagcccagagcgggtc ctgctgcccttccacttccctggcagtttcgtggaggagatgagtgacagccggcgctac agcctggcagatctgactgcccagttttcactgccttgtgaggtcaaggtggtggccaag gacaccagccaccccactgaccctctgacctccttcctgggcctgcggctggaggagaag atcacagagccattcttggtggtgagcctagactctgagcctgggatgtgctttgagatc cctccccggtggctggacctgactgttgtgaaggccaaggggcagccagacttgccagag gggtctctccccatagccacagtggaggagctgacagacaccttctattatcgtcttcgg aagttaccagcctgtgagatccaagcccccccacccaggccccctaaaaatcagggcctc agcaagcagaggagacacagcagtgagggaggcgtcaagtcttctcaagtcttaggattg cagcaacacgctcggctgcccaaacccaaggcgaagaccttgccagagttcatcaaggat ggctccagtacgtacagcaagattcctgcccacaggaagggccacaggcccgctaagccc caaaggcaggatctagctcccatgcccttctccagagaagtcaacctgcgctgcaaagga ggaccctga >gi568815597f:27730836_27950443|GENSCAN_predicted_peptide_4|379_aa MADHLSAEGFALRKDGRGGFWSQLGAQYAFSGAWAEETGTSVASVSTDWLKRAPRELTGS AIRGKAFVVPEKSSQSGAVAAAFCGFPLFPQTRTFSASLRRIVTKMWNSNDGGGFESYGS SSYGGAGGYTQSPGGFGSPAPSQAEKKSRARAQHIVPCTISQLLSATLVDEVFRIGNVEI SQVTIVGIIRHAEKAPTNIVYKIDDMTAAPMDVRQWVDTDDTSSENTVVPPETYVKVAGH LRSFQNKKSLVAFKIMPLEDMNEFTTHILEVINAHMVLSKANSQPSAGRAPISNPGMSEA GNFGGNSFMPANGLTVAQNQVLNLIKACPRPEGLNFQDLKNQLKHMSVSSIKQAVDFLSN EGHIYSTVDDDHFKSTDAE >gi568815597f:27730836_27950443|GENSCAN_predicted_CDS_4|1140_bp atggcagatcacctaagtgcggagggttttgcccttcgtaaagatggccgcggaggcttt tggagccaactgggagcgcagtacgcgttttctggagcatgggcagaggagacaggaaca agcgtagcatccgtgagcaccgattggctgaagcgagcaccccgggagctgactggctcc gccattcgcgggaaggcgtttgtggtgccagagaaaagtagccagagcggcgcagtggcg gccgcgttctgtggttttccgctattcccccagacccgcaccttctcggcctctttgcgg agaatcgtgaccaagatgtggaacagtaatgatgggggtggattcgaaagctatggcagc tcctcatacgggggagccggcggctacacgcagtccccggggggctttggatcgcccgca ccttctcaagccgaaaagaaatcaagagcccgagcccagcacattgtgccctgtactata tctcagctgctttctgccactttggttgatgaagtgttcagaattgggaatgttgagatt tcacaggtcactattgtggggatcatcagacatgcagagaaggctccaaccaacattgtt tacaaaatagatgacatgacagctgcacccatggacgttcgccagtgggttgacacagat gacaccagcagtgaaaacactgtggttcctccagaaacatatgtgaaagtggcaggccac ctgagatcttttcagaacaaaaagagcctggtagcctttaagatcatgcccctggaggat atgaatgagttcaccacacatattctggaagtgatcaatgcacacatggtactaagcaaa gccaacagccagccctcagcagggagagcacctatcagcaatccaggaatgagtgaagca gggaactttggtgggaatagcttcatgccagcaaatggcctcactgtggcccaaaaccag gtgttgaatttgattaaggcttgtccaagacctgaagggttgaactttcaggatctcaag aaccagctgaaacacatgtctgtatcctcaatcaagcaagctgtggattttctgagcaat gaggggcacatctattctactgtggatgatgaccattttaaatccacagatgcagaataa >gi568815597f:27730836_27950443|GENSCAN_predicted_peptide_5|210_aa MRLLAWLIFLANWGGARAEPGMLAEERLNVMALSEIMPGKMDNKQANKVTQEMGEEVIAI AEGEEILGVILVLSRNRMNRGQICSTLLQLRERPALKEDVFSPAGKFWHIADLHLDPDYK VSKDPFQVCPSAGSQPVPDAGPWGDYLCDSPWALINSSIYAMKEIEPEPDFILWTGDDTP HVPDEKLGEAAVLEIVERLTKLIREVFPDS >gi568815597f:27730836_27950443|GENSCAN_predicted_CDS_5|633_bp atgaggctgctcgcctggctgattttcctggctaactggggaggtgccagggctgaacca gggatgttagcagaggagagattaaatgtgatggccttaagtgagatcatgccagggaag atggacaataagcaagccaacaaggtgacacaagaaatgggggaggaagtgattgcaata gcagagggtgaggaaatcctgggcgtcattcttgttttatcgagaaacaggatgaacaga ggccagatctgctcaaccctccttcagctgagagagagaccagctttgaaggaggatgtt ttttcccctgcagggaagttctggcacatcgctgacctgcaccttgaccctgactacaag gtatccaaagaccccttccaggtgtgcccatcagctggatcccagccagtgcccgacgca ggcccctggggtgactacctctgtgattctccctgggccctcatcaactcctccatctat gccatgaaggagattgagccagagccagacttcattctctggactggtgatgacacgcct catgtgcccgatgagaaactgggagaggcagctgtactggaaattgtggaacgcctgacc aagctcatcagagaggtctttccagactcatga