GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:56:17 Sequence gi568815584f:96163683_96364741 : 201059 bp : 44.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 10208 10266 59 1 2 76 116 43 0.978 6.93 1.02 Intr + 10708 10807 100 1 1 47 81 74 0.960 2.71 1.03 Intr + 12068 12382 315 1 0 64 97 124 0.865 7.16 1.04 Intr + 15285 15398 114 2 0 78 80 28 0.613 1.64 1.05 Intr + 20086 20234 149 0 2 72 51 96 0.017 3.33 1.06 Intr + 25878 26023 146 0 2 56 53 81 0.005 1.33 1.07 Intr + 43882 44038 157 2 1 46 115 95 0.049 7.07 1.08 Intr + 50753 50871 119 2 2 56 85 19 0.024 -1.49 1.09 Term + 52913 53184 272 0 2 35 47 258 0.802 12.05 1.10 PlyA + 53802 53807 6 1.05 2.00 Prom + 57312 57351 40 -4.66 2.01 Sngl + 59426 59665 240 1 0 70 48 249 0.937 14.18 2.02 PlyA + 60591 60596 6 1.05 3.00 Prom + 70670 70709 40 -4.56 3.01 Init + 71131 71177 47 0 2 85 40 73 0.031 2.36 3.02 Intr + 76721 77818 1098 2 0 102 53 1931 0.003 180.05 3.03 Intr + 82197 82249 53 0 2 -9 73 168 0.026 4.15 3.04 Intr + 82750 82777 28 2 1 83 80 43 0.012 0.17 3.05 Intr + 99991 100966 976 1 1 125 47 998 0.235 90.36 3.06 Term + 105791 105952 162 1 0 69 38 106 0.158 1.64 3.07 PlyA + 106212 106217 6 1.05 4.23 PlyA - 110315 110310 6 1.05 4.22 Term - 122303 122073 231 2 0 124 45 174 0.994 13.17 4.21 Intr - 126166 125974 193 0 1 43 78 146 0.337 8.59 4.20 Intr - 128416 128347 70 2 1 123 47 66 0.301 4.24 4.19 Intr - 131878 131800 79 1 1 71 62 45 0.246 -0.68 4.18 Intr - 138426 138325 102 0 0 79 72 56 0.868 3.47 4.17 Intr - 139573 139379 195 1 0 65 95 48 0.738 2.81 4.16 Intr - 142133 141984 150 2 0 55 5 183 0.623 7.06 4.15 Intr - 143234 143032 203 0 2 68 90 114 0.998 8.60 4.14 Intr - 145912 145771 142 1 1 56 95 134 0.996 10.83 4.13 Intr - 147605 147435 171 2 0 80 87 43 0.803 3.54 4.12 Intr - 153635 153463 173 0 2 53 100 45 0.806 1.86 4.11 Intr - 154173 154016 158 2 2 97 45 65 0.860 2.75 4.10 Intr - 158572 158430 143 1 2 63 95 54 0.973 2.75 4.09 Intr - 159053 158858 196 1 1 46 98 110 0.941 7.22 4.08 Intr - 160316 160214 103 0 1 51 113 75 0.936 5.53 4.07 Intr - 162240 161967 274 0 1 74 91 148 0.993 10.81 4.06 Intr - 164853 164665 189 0 0 90 119 19 0.868 4.98 4.05 Intr - 165062 164992 71 0 2 45 110 -31 0.455 -6.30 4.04 Intr - 170191 170006 186 2 0 75 91 133 0.851 11.96 4.03 Intr - 183659 183497 163 2 1 107 77 51 0.740 5.45 4.02 Intr - 199391 199133 259 1 1 -30 86 249 0.575 10.37 4.01 Intr - 200494 200188 307 2 1 107 49 197 0.752 13.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 50727 50871 145 2 1 71 85 49 0.811 3.18 S.002 Sngl + 76728 77822 1095 2 0 95 53 1903 0.913 184.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:96163683_96364741|GENSCAN_predicted_peptide_1|476_aa MGQRPLQAQGSWWRLGSGERGLLGEKEDPAEETDRQRLVRQEAWRGKSFRKDKTLANASL MAATKFAPEIEALTHWCARAPHGMRQKLGSNQTTCLAFSTCCILLPLPPFSWEYSPVNHL SKNPTSGSALRSMNFHVPRTMMGVVVEKEPVNPVLTISMLSNTNDKSKQVVCLQSSRPAP KLQLGETHRALDPPVQLEPGHGSDPTLITEMTKVTWRIMECHHPDKLNGSLQDCYVRARD CKPVLTLGSQAGFSGRSDTSAKVARGPSWGRPIVSALANVEVLEGQGGADGHRTKCPSPR VNPDENYGLWVMMMCQRRFINYNKLTALVGDVSNGGGYVCVGTGDICPPAGDNAPYPTSG DLGSDRRPHTYSPSLSDRQSWDLNTPAWATEQDPVSTKEKEERRGRKKEEKRKKKEEEEE GGGRRRKRKEEEEGGGGGRRRRRRKEEEEEEEEVGEEEGGGGGGDPTALFGRLWEN >gi568815584f:96163683_96364741|GENSCAN_predicted_CDS_1|1431_bp atgggtcagcggccgctgcaggcccaggggagctggtggcgcctgggctcaggggaaagg ggtcttcttggcgagaaggaagatccagcagaggagacagacaggcagcggctcgtgagg caggaagcctggagagggaagagcttccggaaagacaagaccttggccaatgccagcctc atggctgccacaaagtttgccccggagatagaggcactaactcattggtgcgccagagcc ccccatgggatgaggcagaagctgggctccaaccagaccacctgcttagctttttccacc tgctgtatcctgcttcctttgccccccttttcatgggagtactccccagtaaatcatctg agcaagaatccaacctcaggctccgctttaagatctatgaattttcatgtgccaagaacc atgatgggggtggtggtggagaaagagccagtgaatccagtactaacaatatcgatgctg agcaacacaaatgataaatccaagcaagttgtttgcctgcagagttcccggcctgctcca aagttgcagctgggggaaacccacagggctctagacccacccgtgcagctggaacctgga catggcagtgacccaactctaatcacagaaatgaccaaggtgacctggagaataatggag tgccaccaccctgacaagctgaatggctcgctccaggactgttacgtgagagcgagagat tgcaagccagtcctcacattgggcagtcaggcaggcttctctgggagaagtgacacttca gccaaagtggccaggggaccgagctgggggaggcccatcgtcagtgccctagcgaatgtg gaagtgctcgagggacagggaggggccgacggccatcgcaccaaatgtccatcaccaaga gtgaaccctgatgaaaactatggactttgggtgatgatgatgtgtcaacgcaggttcatc aattacaacaaattgactgctcttgtgggggatgtcagtaacgggggaggctacgtatgt gtaggcacaggggatatatgccctcctgccggggacaatgctccttatcccacaagcggg gacctgggctctgacaggaggccacacacttacagtccctcccttagtgaccggcagagc tgggatttgaacacccctgcctgggcaacagagcaagatcctgtttcaacaaaagaaaaa gaagagagaagaggaagaaagaaggaggagaagaggaagaagaaggaggaggaggaggaa ggaggaggaaggaggaggaagaggaaggaggaggaggaaggaggaggaggaggaaggagg aggaggaggaggaaggaggaggaggaggaggaggaggaagtaggagaagaagaaggagga ggaggaggaggagatcctacagcactgtttggaagattatgggaaaattag >gi568815584f:96163683_96364741|GENSCAN_predicted_peptide_2|79_aa MSHKQIHYSDRYDDEEFEYRHVMLPKDIAKLVPKTHLMSESEWRNLGVQQSQGWVHNMIH ESEPHILLFRRPPPKKPKK >gi568815584f:96163683_96364741|GENSCAN_predicted_CDS_2|240_bp atgtcacacaaacaaattcactattcggacagatacgacgacgaggagtttgagtatcga catgtcatgctgcccaaggacatagccaagctggtccctaaaacccatttgatgtctgaa tctgaatggaggaatcttggcgttcagcagagtcagggatgggtccataatatgatccat gaatcagaacctcacatcttgctgttccggcgcccaccacccaagaagccaaagaaatga >gi568815584f:96163683_96364741|GENSCAN_predicted_peptide_3|787_aa MEASEETLDTSPMTELADMLNVTLQGPTLNGTFAQSKCPQVEWLGWLNTIQPPFLWVLFV LATLENIFVLSVFCLHKSSCTVAEIYLGNLAAADLILACGLPFWAITISNNFDWLFGETL CRVVNAIISMNLYSSICFLMLVSIDRYLALVKTMSMGRMRGVRWAKLYSLVIWGCTLLLS SPMLVFRTMKEYSDEGHNVTACVISYPSLIWEVFTNMLLNVVGFLLPLSVITFCTMQIMQ VLRNNEMQKFKEIQTERRATVLVLVVLLLFIICWLPFQISTFLDTLHRLGILSSCQDERI IDVITQIASFMAYSNSCLNPLVYVIVGKRFRKKSWEVYQGVCQKGGCRSEPIQMENSMGT LRTSISVERQIHKLQDWAGSRHFDDDDDDDDDDDGGGVGTSWRNEKKSRSLCMASSWPPL ELQSSNQSQLFPQNATACDNAPEAWDLLHRVLPTFIISICFFGLLGNLFVLLVFLLPRRQ LNVAEIYLANLAASDLVFVLGLPFWAENIWNQFNWPFGALLCRVINGVIKANLFISIFLV VAISQDRYRVLVHPMASRRQQRRRQARVTCVLIWVVGGLLSIPTFLLRSIQAVPDLNITA CILLLPHEAWHFARIVELNILGFLLPLAAIVFFNYHILASLRTREEVSRTRCGGRKDSKT TALILTLVVAFLVCWAPYHFFAFLEFLFQVQAVRGCFWEDFIDLGLQLANFFAFTNSSLN PVIYVFVGRLFRTKLRPLPKKGLTAGLQSAQCSRSEVQVSESHMPLFQSSLHHRMGLADH RNSASHL >gi568815584f:96163683_96364741|GENSCAN_predicted_CDS_3|2364_bp atggaggccagcgaggaaacgctggacacgtcccccatgactgagctcgccgacatgctc aatgtcaccttgcaagggcccactcttaacgggacctttgcccagagcaaatgcccccaa gtggagtggctgggctggctcaacaccatccagccccccttcctctgggtgctgttcgtg ctggccaccctagagaacatctttgtcctcagcgtcttctgcctgcacaagagcagctgc acggtggcagagatctacctggggaacctggccgcagcagacctgatcctggcctgcggg ctgcccttctgggccatcaccatctccaacaacttcgactggctctttggggagacgctc tgccgcgtggtgaatgccattatctccatgaacctgtacagcagcatctgtttcctgatg ctggtgagcatcgaccgctacctggccctggtgaaaaccatgtccatgggccggatgcgc ggcgtgcgctgggccaagctctacagcttggtgatctgggggtgtacgctgctcctgagc tcacccatgctggtgttccggaccatgaaggagtacagcgatgagggccacaacgtcacc gcttgtgtcatcagctacccatccctcatctgggaagtgttcaccaacatgctcctgaat gtcgtgggcttcctgctgcccctgagtgtcatcaccttctgcacgatgcagatcatgcag gtgctgcggaacaacgagatgcagaagttcaaggagatccagacagagaggagggccacg gtgctagtcctggttgtgctgctgctattcatcatctgctggctgcccttccagatcagc accttcctggatacgctgcatcgcctcggcatcctctccagctgccaggacgagcgcatc atcgatgtaatcacacagatcgcctccttcatggcctacagcaacagctgcctcaaccca ctggtgtacgtgatcgtgggcaagcgcttccgaaagaagtcttgggaggtgtaccaggga gtgtgccagaaagggggctgcaggtcagaacccattcagatggagaactccatgggcaca ctgcggacctccatctccgtggaacgccagattcacaaactgcaggactgggcagggagc agacactttgatgatgatgatgatgatgatgatgatgatgatggtggtggtgttggcaca agctggcgcaatgagaagaagtccaggtcactgtgcatggcatcatcctggccccctcta gagctccaatcctccaaccagagccagctcttccctcaaaatgctacggcctgtgacaat gctccagaagcctgggacctgctgcacagagtgctgccaacatttatcatctccatctgt ttcttcggcctcctagggaacctttttgtcctgttggtcttcctcctgccccggcggcaa ctgaacgtggcagaaatctacctggccaacctggcagcctctgatctggtgtttgtcttg ggcttgcccttctgggcagagaatatctggaaccagtttaactggcctttcggagccctc ctctgccgtgtcatcaacggggtcatcaaggccaatttgttcatcagcatcttcctggtg gtggccatcagccaggaccgctaccgcgtgctggtgcaccctatggccagccggaggcag cagcggcggaggcaggcccgggtcacctgcgtgctcatctgggttgtggggggcctcttg agcatccccacattcctgctgcgatccatccaagccgtcccagatctgaacatcaccgcc tgcatcctgctcctcccccatgaggcctggcactttgcaaggattgtggagttaaatatt ctgggtttcctcctaccactggctgcgatcgtcttcttcaactaccacatcctggcctcc ctgcgaacgcgggaggaggtcagcaggacaaggtgcgggggccgcaaggatagcaagacc acagcgctgatcctcacgctcgtggttgccttcctggtctgctgggccccttaccacttc tttgccttcctggaattcttattccaggtgcaagcagtccgaggctgcttttgggaggac ttcattgacctgggcctgcaattggccaacttctttgccttcactaacagctccctgaat ccagtaatttatgtctttgtgggccggctcttcaggaccaagctcaggcccctgcccaag aagggcctcaccgcggggctgcagagtgcacagtgcagccggtcagaggtccaggtctct gagtcacacatgccactgttccagtcctcactccaccaccgcatgggcctggccgaccac cgaaactcggcatcccatctgtaa >gi568815584f:96163683_96364741|GENSCAN_predicted_peptide_4|1252_aa XVTSQKAEIRLRRLPEQPKFFTQVPAGSGAFSISQGMQFCPQMLRKPKFPDRCYDTPRIK RAYARSALQTARQTATKKLPQKQYLNEAGAEELADNGHRSMLGLRPRRGLEPESLPRGRS RRNPQPQSAQLPAVTMPWPFSESIKKRACRYLLQRYLGHFLQEKLSLEQLSLDLYQGTGS LAQVPLDKWCLNEILESADAPLEVTEGFIQSISLSVPWGSLLQDNCALEVRGLEMVFRPR PRPENSSKIGLANKDRKNRPMQQEDEYRIQMELNRYYLRKDSLSVGVSSEQSFYETETAR TPSSREETGSHSPVCLQLHYKHSENRGPQGNQARLSSVPHKAELQIKLNPVCCELDISIV DRLNSLLQPQKLATVEMMASHMYTSYNKHISLHKAFTEVFLDDSHSPANCRISVQVATPA LNLSVRFPIPDLRSDQERGPWFKKSLQKEILYLAFTDLEFKTEFIGGSTPEQIKLELTFR ELIGSFQEEKGDPSIKFFHVSSGVDGDTTSSDDFDWPRIVLKINPPAMHSILERIAAEEE EENDGHYQEEEEGGAHSLKDVCDLRRPAPSPFSSRRVMFENEQMVMPGDPVEMTEFQDKA ISNSHYVLELTLPNIYVTLPNKSFYEKLYNRIFNDLLLWEPTAPSPVETFENISYGIGLS VASQLINTFNKDSFSAFKSAVHYDEESGSEEETLQYFSTVDPNYRSRRKKKLDSQNKNSQ SFLSVLLNINHGLIAVFTDVKTEPRFELHCSSDVVHIRTCSDSCAALMNLIQYIASYGDL QTPNKADMKPGAFQRRSKVDSSGRSSSRGPVLPEADQQMLRDLMSDAMEEIDMQQGTSSV KPQANGVLDEKSQIQEPCCSDLFLFPDESGNVSQESGPTYASFSHHFISDAMTGVPTEND DFCILFAPKAAMQEKEEEPVIKIMVDDAIVIRDNYFSLPVNKTDTSKAPLHFPIPVIRYV VKEVKFQHEVYPPCKPDCDSSLSEHPVSRQVFIVQDLEIRDRLATSQMNKFLYLYCSKEM PRKAHSNMLTVKALHVCPESGRSPQECCLRVSLMPLRLNIDQDALFFLKDFFTSLSAEVE LQMTPDPEENLDSRQKFPFDLIIMANMYQWIRLSSNSFLPFDKMIQAAAETAYDMVSPGT LSIEPKKTKRFPHHRLAHQPVDLREGVAKAYSVVKEGITDTAQTIYETAAREHESRGVTG AVGEVLRQIPPAVVKPLIVATEATSNVLGGMRNQIRPDVRQDESQKWRHGDD >gi568815584f:96163683_96364741|GENSCAN_predicted_CDS_4|3759_bp ngagttacctcccaaaaggcagaaatacgattgcgcagactgccagaacagcccaagttc ttcacccaggttccggcggggtcaggagcctttagcatttctcaaggaatgcagttttgc ccacagatgcttcgaaaacccaagtttcctgatcgttgctatgacacacccaggattaag cgagcgtatgcgagaagcgctctgcaaaccgccaggcaaacggcaacgaagaagctgcct cagaaacaatacctgaatgaggcgggggccgaagaacttgctgacaatgggcaccggtca atgctcgggcttaggccccgccgcggcctggagccggagtcgctaccccgaggccggagc cgtcgcaacccgcagccgcagtcagcccagctgccagcagtcactatgccttggccgttt tcggagtccatcaagaagagggcctgccggtacctcctgcagaggtacctgggccacttt ctgcaggagaagctgagcctggagcagctcagcctggacctgtaccaaggcaccgggtcc ctcgcccaggtccccttggacaaatggtgtctcaatgagatcttggagtcagcagatgca cccttagaagtcactgaaggattcattcagtcaatttccctgtcagttccatggggctct ttactgcaggataattgtgcactggaagtgagaggattagaaatggtcttccggcctaga cctcgcccagaaaattctagcaaaatagggttagctaataaagataggaaaaatcgaccc atgcagcaggaagacgagtatcgaattcagatggaattaaaccggtattatttgagaaaa gattccctctctgtgggtgtatcttcagagcaaagcttttatgagacagaaacagctcgt acaccttctagccgtgaagaaactggttcccattcccctgtgtgtcttcagcttcattat aagcattctgagaatagagggccccagggtaatcaagcaagacttagttcagttcctcac aaggcagaattgcaaattaaattaaatccagtgtgttgtgagctggatatcagtattgtg gacaggttaaattccttgcttcaaccacagaaacttgccacagtagagatgatggcatcc cacatgtatacttcatataataaacatattagtctgcacaaggctttcactgaagtgttt ctagatgattcacatagtcctgcaaattgtcggatatcagtacaagttgccacaccagca ttaaacctttctgttcgcttcccaatacctgatcttcgatctgatcaagaaagaggacca tggtttaagaagtcacttcagaaggagatcctttatttagccttcacagatctagaattt aagactgaatttataggaggatcaaccccagaacaaattaaattggaacttacctttaga gaactaattggatcgttccaggaagagaaaggagatccatctattaagtttttccatgtg tctagtggagtagatggagatacaacatcgtcagatgactttgactggccacgaattgta ctgaaaataaatccaccagccatgcattccattttggagagaattgcagctgaagaagaa gaggagaatgatggtcactaccaggaggaagaggaaggaggtgctcattccttgaaagat gtttgtgatctaagaagaccagccccatctcctttttcttctcgtagagtaatgtttgaa aatgaacagatggtgatgccaggagaccctgtagaaatgacagaatttcaggataaagca atcagcaattctcactatgtgctggaacttacgttaccaaatatttatgtaacactacct aataagagcttttatgagaagctttataataggatctttaatgacttgctactgtgggaa ccaacagctccttcaccagtggagacattcgagaatatttcctatggcattgggctttca gtagccagtcagctcattaatactttcaacaaagatagttttagtgcatttaaatctgca gttcactatgatgaggaaagtggatctgaggaggagactttgcagtatttttccactgtt gatcccaactatcgttctcgcaggaaaaaaaaattagactctcagaacaagaactctcag agttttctctcagttcttctgaatattaatcatggattaatagcagtgttcacagatgtg aagactgagccccgctttgagttacactgttccagcgatgttgtccatatcagaacgtgc tcagactcttgtgctgcgttaatgaatctcattcagtacattgcaagctatggtgacttg cagacacctaacaaggcagatatgaagcctggagcctttcaaagaaggtctaaggtagat tccagtggtcgatcatcctcacgtggtccagtacttcctgaagcagatcaacaaatgtta cgagatctgatgagtgatgctatggaggagatcgacatgcaacaaggcacctcgtcagta aaaccacaggctaatggtgttttggatgaaaaatctcaaattcaggagccatgttgttca gacctcttcctgtttcctgacgagagtgggaatgtatcccaggagtccggccccacctat gcctcattctctcaccatttcatcagtgatgcaatgacaggtgtgcccactgagaatgat gacttttgcattctttttgcaccaaaagcagccatgcaggagaaggaagaagaaccagtt ataaaaatcatggttgatgatgcaattgtgataagagacaattatttcagtctgcccgtt aataagaccgatacgagcaaagcccccttacactttcccattcctgtgattcgctatgtg gtgaaggaggtgaagtttcagcatgaagtctacccgccatgcaaacctgattgtgattcc agcctctcagaacacccagtctcccggcaggtgttcattgttcaggatcttgagattcga gatcgtttggcaacatcacaaatgaataaatttttatacctgtattgcagtaaagaaatg cctcgaaaagctcactccaacatgttgacagtgaaagccttacacgtgtgtccagaatct ggcaggtccccacaggagtgctgcttgagagtgtcgctgatgccgctccgcctcaatatt gaccaggatgctttgttcttcctgaaggatttcttcacaagtctttctgcagaagtagag cttcaaatgactccagatccagaagagaatttagattcacgtcagaagttcccattcgac ttgattatcatggcaaacatgtatcaatggatcagactttcttctaattcatttcttcct tttgacaaaatgatccaggcagctgcagagactgcttatgatatggtgtctcctggtacc ctttctatcgagcccaagaagaccaaaaggtttcctcatcaccggttagcccaccagcca gtagacctgagggaaggtgtggccaaggcctacagtgttgtgaaagagggaatcacagac acggctcagaccatttatgaaactgcggctcgagaacacgagagcagaggggtgactggt gccgtgggcgaggttctgcgccagattcctccggcagtggtgaaacctctgattgttgcc acagaagcaacgtcaaacgtgctgggtggcatgagaaaccaaattaggccagatgtccgg caagacgagtcacagaaatggcgccacggggatgactga