GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:39:42 Sequence gi568815587r:10414919_10634158 : 219240 bp : 43.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 36026 36125 100 1 1 50 117 81 0.254 6.88 1.02 Intr + 36245 36346 102 0 0 90 99 38 0.853 5.25 1.03 Intr + 40468 40530 63 2 0 65 61 67 0.252 0.39 1.04 Intr + 46597 46822 226 2 1 86 76 403 0.828 35.94 1.05 Intr + 63608 63812 205 2 1 84 74 180 0.523 15.40 1.06 Intr + 67145 67307 163 1 1 115 80 197 0.979 21.15 1.07 Intr + 69902 70121 220 0 1 93 87 387 0.958 36.46 1.08 Intr + 72227 72446 220 2 1 84 100 148 0.800 13.90 1.09 Intr + 78431 78720 290 1 2 87 33 418 0.571 32.24 1.10 Intr + 79750 79912 163 1 1 80 25 125 0.647 5.38 1.11 Intr + 79981 80112 132 0 0 107 113 114 0.999 16.64 1.12 Intr + 80652 80815 164 2 2 67 83 279 0.996 24.07 1.13 Intr + 81894 82020 127 0 1 69 92 109 0.737 10.08 1.14 Intr + 85168 85331 164 0 2 60 96 294 0.998 26.17 1.15 Intr + 86552 86672 121 2 1 121 86 158 0.998 19.40 1.16 Intr + 87803 87976 174 1 0 73 116 100 0.950 11.44 1.17 Intr + 89631 89741 111 2 0 -52 80 181 0.655 3.88 1.18 Term + 90790 90966 177 0 0 94 42 181 0.999 11.79 1.19 PlyA + 92185 92190 6 1.05 2.12 PlyA - 94205 94200 6 1.05 2.11 Term - 96272 96206 67 1 1 91 38 48 0.242 -2.49 2.10 Intr - 104223 104116 108 2 0 48 115 66 0.427 4.70 2.09 Intr - 110455 110274 182 1 2 88 71 63 0.693 3.27 2.08 Intr - 115833 115725 109 2 1 131 100 32 0.733 8.99 2.07 Intr - 139934 139849 86 2 2 8 116 61 0.007 -0.48 2.06 Intr - 144379 144220 160 0 1 95 9 185 0.019 11.29 2.05 Intr - 144976 144898 79 2 1 116 97 11 0.971 3.41 2.04 Intr - 145882 145577 306 2 0 90 84 65 0.550 2.72 2.03 Intr - 149161 149022 140 0 2 84 105 20 0.975 3.41 2.02 Intr - 149456 149285 172 0 1 66 94 122 0.985 9.60 2.01 Init - 153614 153530 85 2 1 110 100 84 0.998 10.78 2.00 Prom - 154189 154150 40 -4.96 3.20 PlyA - 156339 156334 6 1.05 3.19 Term - 161657 161414 244 1 1 74 49 47 0.324 -5.23 3.18 Intr - 165671 165537 135 1 0 125 75 205 0.894 22.58 3.17 Intr - 167068 166949 120 0 0 94 84 124 0.972 12.21 3.16 Intr - 176694 176630 65 0 2 79 106 7 0.563 -0.88 3.15 Intr - 178681 178574 108 1 0 103 111 -7 0.778 3.58 3.14 Intr - 179277 179228 50 1 2 92 91 -2 0.755 -1.10 3.13 Intr - 186141 186000 142 0 1 126 95 188 0.952 23.23 3.12 Intr - 188333 188202 132 2 0 60 109 123 0.987 12.44 3.11 Intr - 189627 189469 159 0 0 102 48 127 0.599 10.28 3.10 Intr - 190768 190622 147 1 0 75 75 31 0.648 0.93 3.09 Intr - 191854 191824 31 0 1 91 107 20 0.964 2.33 3.08 Intr - 194909 194810 100 0 1 68 93 107 0.670 8.37 3.07 Intr - 208938 208860 79 0 1 49 100 60 0.586 2.42 3.06 Intr - 211665 211048 618 0 0 100 90 439 0.859 37.81 3.05 Intr - 211906 211760 147 1 0 44 93 45 0.508 1.03 3.04 Intr - 212845 212798 48 1 0 104 76 37 0.654 2.98 3.03 Intr - 214793 214687 107 0 2 112 97 253 0.700 28.53 3.02 Intr - 217143 217073 71 2 2 80 82 15 0.865 -1.07 3.01 Intr - 219153 219050 104 0 2 78 92 77 0.638 6.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 126144 126432 289 0 1 74 53 152 0.873 5.15 S.002 Term - 144379 144193 187 0 1 95 37 214 0.980 13.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:10414919_10634158|GENSCAN_predicted_peptide_1|973_aa CPAAPLRSAQRVPFAPALRPSLSAGGMALSSEPEAVALTPVRSLALSPVPVPVPTPASAV ARAPAASGGVAESSQRSELEAHVGAVSGSEMPRQFPKLNISEVDEQVRLLAEKVFAKVLR EEDSKDALSLFTVPEDCPIGQKEAKERELQKELAEQKSVETAKRKKSFKMIRSQSLSLQM PPQQDWKGPPAASPAMSPTTPVVTGATSLPTPAPYAMPEFQRVTISGDYCAGITLEDYEQ AAKSLAKALMIREKYARLAYHRFPRITSQYLGHPRADTAPPEEGLPDFHPPPLPQEDPYC LDDAPPNLDYLVHMQGGILFVYDNKKMLEHQEPHSLPYPDLETYTVDMSHILALITDGPT QGWPLQMLEASKLESSMRLNYGLSPSPTCRKTYCHRRLNFLESKFSLHEMLNEMSEFKEL KSNPHRDFYNVRKVDTHIHAAACMNQKHLLRFIKHTYQTEPDRTVAEKRGRKITLRQVFD GLHMDPYDLTVDSLDVHAVSELLLQCRQQTAALAGDSAPWKATMIVLAGRWGLCCSFTGG LAKGSQADRVRIHGPGAQEFLRFHARIDIEDVFRCGHTEGRLVKGRQTFHRFDKFNSKYN PVGASELRDLYLKTENYLGGEYFARMVKEVARELEESKYQYSEPRLSIYGRSPEEWPNLA YWFIQHKVYSPNMRWIIQVPRIYDIFRSKKLLPNFGKMLENIFLPLFKATINPQDHRELH LFLKYVTGFDSVDDESKHSDHMFSDKSPNPDVWTSEQNPPYSYYLYYMYANIMVLNNLRR ERGLSTFLFRPHCGEAGSITHLVSAFLTADNISHGLLLKKSPVLQYLYYLAQIPIAMSPL SNNSLFLEYSKNPLREFLHKGLHVSLSTDDPMQFHYTKEALMEEYAIAAQVWKLSTCDLC EIARNSVLQSGLSHQEKQKFLGQNYYKEGPEGNDIRKTNVAQIRMAFRYETLCNELSFLS DAMKSEEITALTN >gi568815587r:10414919_10634158|GENSCAN_predicted_CDS_1|2922_bp tgcccagcagccccgctccgctctgcccagcgcgtcccctttgctccagccctgcggccg tccctttcggccggcggcatggccctgtcgtccgaacccgaggcggtggcgctgactccg gtccgatcccttgccctgtcccctgttcctgtccctgtccctacccctgcctctgcggtg gcccgagccccagcggcctcaggaggagtggcagagtccagccagcgctcggagctggag gcccacgtgggagcagtgagcggctctgagatgccgcggcagtttcccaagctgaacatc tctgaagtggatgagcaagtccggctcctggcggagaaggtgtttgctaaagtgctccga gaagaggacagcaaagatgccctgtccctgttcactgtcccagaggactgccccatcggg caaaaggaagccaaggagagggagctgcagaaggagctggcagagcagaagtctgtggag accgcaaaaagaaagaaaagtttcaagatgattcggtcccagtccctgtctctgcaaatg ccgccacagcaagattggaagggccccccggcagccagtccggccatgtctcccacaacc cctgtggtcactggagccacttccctgcccacgccagcaccctatgccatgcctgagttc cagcgggtcaccatcagcggagattactgtgccgggatcactttggaggactatgagcag gcagccaagagtctggccaaggccctaatgatccgggagaagtatgcgcggctcgcctac caccgcttcccgcggatcacatcccagtacctgggtcatccgcgggcggatactgcacct ccggaagagggccttccagacttccaccctcctccactgccccaggaagacccctactgc ctggatgatgcaccccccaacctggattacttggtccacatgcaggggggcatcctcttt gtgtatgataacaagaagatgctggagcaccaggagccgcacagcctaccctaccccgac ctggagacctacacggtggacatgagccacatcctggctctcatcaccgatggccccacc cagggttggcccctgcagatgctggaggcctccaaactggagagctcgatgcgactcaac tatggtctctccccatctccaacttgcaggaaaacctattgtcaccggcgactgaacttt ctggaatccaagttcagccttcatgagatgttaaacgaaatgtccgagttcaaagagttg aagagtaacccccaccgggacttctataacgtgagaaaggtggacacacacatccatgcg gccgcctgcatgaaccaaaagcatctgctgcgcttcatcaagcacacataccagacggag cctgacaggactgtggcagagaagcggggccggaagatcaccctgcggcaggtgtttgac ggcctgcacatggacccctacgacctcactgtggactcactggatgtccacgcggtgagt gagcttctgctccagtgccgccagcagacagcagccctggctggggactcagccccctgg aaagccaccatgattgtgcttgccgggaggtggggcctctgctgttccttcacagggggc cttgccaaaggatcccaagctgaccgagtgaggatccatggtcctggtgctcaggagttt ctaaggtttcatgcaagaattgacatagaagatgtctttcggtgtggccatacagaaggc cgcctcgtaaagggccggcagacattccaccgctttgacaagttcaactccaaatacaac cctgtgggggccagtgagctgcgtgacctgtatttgaaaactgaaaactatctgggagga gagtactttgctcggatggtcaaggaggttgcccgggagctggaggagagcaagtaccag tactcagagccacggctctccatctacggccgcagtcctgaggagtggcccaacctggcc tactggttcatccagcacaaggtctactctcccaacatgcgctggatcatccaggtgccc cggatttatgacatatttaggtcaaagaagctgctgccaaactttgggaagatgctggag aacatcttcctgccccttttcaaggccactatcaacccccaagatcatcgagagcttcac ctcttccttaaatatgtgacggggtttgacagcgtggatgatgagtccaagcacagcgac cacatgttttccgacaagagcccaaacccggacgtctggaccagtgagcagaacccaccc tacagctactacctgtactacatgtatgccaacatcatggtgctcaacaacctccgcagg gagcgcggcctgagcacgttcctgttccggccgcactgtggggaagccggctccatcacc cacctggtgtctgccttcctcactgctgacaacatttcccacgggctgctcctcaagaag agtccggtattgcagtatctctactaccttgctcagatccccattgccatgtctcctctt agcaacaacagtttgttcctcgaatattccaagaaccctctgagggaattcctacacaag ggactgcatgtttctctttccaccgatgaccccatgcagttccactacacgaaggaagca cttatggaagaatatgccattgcagctcaagtgtggaagctgagcacctgcgacctgtgt gagatcgccaggaacagcgtgctgcagagcggcctctcgcatcaggaaaagcaaaagttt ctgggacaaaattattataaagaaggacctgaaggaaatgatattcgaaagacaaatgtg gctcagatccggatggcattccgatatgagaccttatgcaatgagctcagcttcctgtct gatgctatgaaatcagaagagatcaccgccttgaccaactag >gi568815587r:10414919_10634158|GENSCAN_predicted_peptide_2|497_aa MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEA CRLLGLSLAGKDQVETALKASFETCSYGWVGDGFVVISRISPNPKCGKNGVGVLIWKVPV SRQFAAYCYNSSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPA PTTTPPAPASTSIPRRKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTAL LVLALLFFGAAAGLGFCYVKRYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKT DKNPEESKSPSKTTKNSRALFYLCSHTTTTVINSITKELNDKRTAKVASGQEKHLLFEVQ PGSDSSAFWKVVVRVVCTKINKSSGIVEASRIMNLYQFIQLYKDITSQAAGVLAQSSTSE EPDENSSSVTSCQASLWMGRVKQLTDEEECCICMDGRADLILPCAHSFCQKCIDKWQQTS TVHSPGVVDPYPNRQQK >gi568815587r:10414919_10634158|GENSCAN_predicted_CDS_2|1494_bp atggccaggtgcttcagcctggtgttgcttctcacttccatctggaccacgaggctcctg gtccaaggctctttgcgtgcagaagagctttccatccaggtgtcatgcagaattatgggg atcacccttgtgagcaaaaaggcgaaccagcagctgaatttcacagaagctaaggaggcc tgtaggctgctgggactaagtttggccggcaaggaccaagttgaaacagccttgaaagct agctttgaaacttgcagctatggctgggttggagatggattcgtggtcatctctaggatt agcccaaaccccaagtgtgggaaaaatggggtgggtgtcctgatttggaaggttccagtg agccgacagtttgcagcctattgttacaactcatctgatacttggactaactcgtgcatt ccagaaattatcaccaccaaagatcccatattcaacactcaaactgcaacacaaacaaca gaatttattgtcagtgacagtacctactcggtggcatccccttactctacaatacctgcc cctactactactcctcctgctccagcttccacttctattccacggagaaaaaaattgatt tgtgtcacagaagtttttatggaaactagcaccatgtctacagaaactgaaccatttgtt gaaaataaagcagcattcaagaatgaagctgctgggtttggaggtgtccccacggctctg ctagtgcttgctctcctcttctttggtgctgcagctggtcttggattttgctatgtcaaa aggtatgtgaaggccttcccttttacaaacaagaatcagcagaaggaaatgatcgaaacc aaagtagtaaaggaggagaaggccaatgatagcaaccctaatgaggaatcaaagaaaact gataaaaacccagaagagtccaagagtccaagcaaaactaccaaaaactcccgagccttg ttttacctctgctctcacaccaccacaacagtcatcaactcaataacaaaagaactcaat gacaaaagaacggctaaagtggcttctggccaggaaaaacatcttctctttgaggtacaa cctgggtctgattcctctgctttttggaaagtggttgtacgggtggtctgtaccaagatt aacaaaagcagtggcattgtggaggcatcacggatcatgaatttataccagtttattcaa ctttataaagatatcacaagtcaagcagcaggagtattggcacagagctccacctctgaa gaacctgatgaaaactcatcctctgtaacatcttgtcaggctagtctttggatgggaagg gtgaagcagctgaccgatgaggaggagtgttgtatctgtatggatgggcgggctgacctc atcctgccttgtgctcacagcttttgtcagaagtgtattgataaatggcagcagaccagt actgtccacagcccaggggttgtggacccctaccccaacagacagcaaaaataa >gi568815587r:10414919_10634158|GENSCAN_predicted_peptide_3|868_aa GPPAAGVSCSPTPTIVLTGDATSPEGETDKNLANRVHSPHKRLSHRHLKVSTASLTSVDP AGHIIDLVNDQLPDISISEEDKKKNLALLEEAKLQKGDEADVSSPHPGEPLLSLGKFLAG FGLQVPFSCLFDKPRAEALSFHIVHSYPIHTYTNGPLGKNVPKGLADRKQNDQRKVSQGR LAPRPPPVEKSKEIAIEQKENFDPLQYPETTPKGLAPVTNSSGKMALNSPQPGPVESELG KQLLKTGWEGSPLPRSPTQDAAGVGPPASQGRGPAGEPMGPEAGSKAELPPTVSRPPLLR GLSWDSGPEEPGPRLQKVLAKLPLAEEEKRFAGKAGGKLAKAPGLKDFQIQVQPVRMQKL TKLREEHILMRNQNLVGLKLPDLSEAAEQEKAIEEEESKSGLDVMPNISDVLLRKLRVHR SLPGSAPPLTEKEVEMYMDTALGMVEIKQAFKSDSLRLNPGSVTPCHLEQVNDRSGLGLF THKQNVFVQLSLAFRNDSYTLESRINQAERERNLTEENTEKELENFKASITVIVEPDSSA SLWHHCEHRETYQKLLEDIAVLHRLAARLSSRAEVVGAVRQEKRMSKATEVMMQYVENLK RTYEKDHAELMEFKKLANQNSSRSCGPSEDGVPRTARSMSLTLGKNMPRRRVSVAVVPKF NALNLPGQTPSSSSIPSLPALSESPNGKGSLPVTSALPALLENGKTNGDPDCEASAPALT LSCLEELSQETKARMEEEAYSKGFQEGLKKTKELQDLKEEEEEQKSESPEEPEEVEETEE EEKGPRSSKLEELVHFLQVMYPKLCQHWQVIWMMAAVMLVLTVVLGLYNSYNSCAEQADG PLGRSTCSAAQRDSWWSSGLQHEQPTEQ >gi568815587r:10414919_10634158|GENSCAN_predicted_CDS_3|2607_bp ggtcctcctgccgcaggagtatcttgcagtccaactcccacgattgtcctgactggggat gccacttcaccagaaggagaaaccgacaaaaacctggccaacagagttcacagtccccac aagaggctttctcaccgacacttgaaggtgtccactgcctccctgacatctgtggacccc gcggggcacatcattgacctggtgaatgaccagctgccagacatcagcatctcagaggag gacaagaagaaaaacctggcgctgctggaagaagccaagttgcagaagggggatgaggcc gacgtctcttcacctcaccctggcgagcctctcctatcccttgggaaattcttggctggt tttggattgcaggttccattttcctgtctgtttgacaaacctcgggcagaagccctctcc ttccacatcgtccactcataccccatacacacttatacaaatgggccccttggcaagaac gtccccaaagggctagctgacaggaagcagaatgaccagaggaaagtgtctcagggcagg ctggctcctcgtcctcctccagttgagaagtccaaagagattgcaatagaacaaaaggaa aacttcgatcccctccagtaccccgagaccacacccaaaggcctagctcctgttacaaac agcagtgggaaaatggccctgaacagccctcagcctggccccgtggagagcgagctgggg aagcagctcttgaaaacgggctgggagggcagccctctgccgagaagtccaacccaggat gcggcaggagtgggtcccccagcctcccaggggagaggcccagctggagagccgatgggg cccgaggctggctccaaagctgagcttccacccactgtgtcccggcccccgctgctgcga gggctctcctgggacagtggccctgaagaacctggcccccggctgcagaaagtgcttgcc aagctgccactggcagaggaagaaaagcgttttgcaggcaaggccggcggcaagctggcc aaggcccctggtctcaaagactttcagatacaagtgcagcccgtgcggatgcagaaactg accaagctccgagaggagcacatcctgatgagaaatcagaacttagtggggctcaagctt ccagaccttagtgaagcagctgagcaggaaaaagctattgaggaagaagagtcaaagagt ggcttagatgtcatgcctaatatttctgatgtgctgctgcgcaaactgcgggtccacagg agtctccctggaagtgcccctccactcactgaaaaggaagttgagatgtatatggacaca gctctcggcatggtggaaataaaacaggctttcaaatcagacagtctgaggttaaatcct ggctccgttactccgtgtcaccttgaacaagttaatgaccgttctggtcttggcctcttt acccataaacagaacgtgtttgtgcaactgtccttggcctttagaaatgacagctacact ctggaatctagaattaaccaggctgaaagggaacgcaacctgacagaggagaacactgag aaagaactggaaaacttcaaagcttccattacggtaattgtggagcctgattcctcagct tcactctggcaccactgtgagcaccgggaaacctaccagaagttgctggaggacatcgct gtcctgcaccgcctggctgcccgcctctccagccgagctgaggtggtaggcgccgtccgc caggaaaagcgcatgtcgaaagcaacggaagtgatgatgcagtatgtggagaatctaaag aggacgtatgagaaggaccatgcggagctcatggagtttaaaaagcttgcaaatcagaat tcaagccgcagctgtggcccctctgaagatggggtccctcgcacggcacggtccatgtcc ctcacgctgggaaagaatatgcctcgccggagggtcagcgttgctgtggttcctaagttt aatgccctgaatctgcctggccaaactcccagctcatcatccattccctccttaccagcc ttgtcggaatcacccaatgggaaaggcagcctacctgtcacttcagcactgcctgcactt ttggaaaatggaaagacaaatggggacccagattgtgaagcctctgctcctgcgctgacc ctgagctgcctggaggagcttagtcaggagaccaaggccaggatggaggaagaagcctac agcaagggattccaagaaggtctaaagaagaccaaagaacttcaagacctgaaggaggag gaggaagaacagaagagtgagagtcctgaggaacctgaagaggtagaagaaactgaggaa gaggaaaagggcccaagaagcagcaaacttgaagaattggtccatttcttacaagtcatg tatcccaaactgtgtcagcactggcaagtgatctggatgatggctgcagtgatgctggtc ttgactgttgtgctggggctctacaattcctataactcttgtgcagagcaggctgatggg ccccttggaagatccacttgctcggcagcccagagggactcctggtggagctcaggactc cagcatgagcagcctacagagcagtag